The Korean Association for the Study of English Language and Linguistics
[ Article ]
Korea Journal of English Language and Linguistics - Vol. 22, No. 0, pp.1443-1464
ISSN: 1598-1398 (Print) 2586-7474 (Online)
Print publication date 31 Jan 2022
Received 07 Nov 2022 Revised 18 Dec 2022 Accepted 30 Dec 2022
DOI: https://doi.org/10.15738/kjell.22..202212.1443

Production of Coda Obstruent Clusters by Korean and English Speakers: Acoustical and Dynamic Time Warping Analyses

Hyesun Cho
Associate Professor, Department of Education, Graduate School of Education, Dankook University hscho@dankook.ac.kr


© 2022 KASELL All rights reserved
This is an open-access article distributed under the terms of the Creative Commons License, which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Coda obstruent clusters in English are known to be difficult for Korean learners of English to produce due to phonotactic differences between Korean and English syllables. In English, the coda obstruent clusters undergo reduction, so the medial consonant in CCC clusters is deleted by native speakers of English. In this study, coda obstruent clusters produced by Korean and English speakers are compared for their acoustic properties (center of gravity and intensity) and the similarity distance obtained by the dynamic time warping (DTW) algorithm. The CoG and intensity was overall lower in Korean speakers than in English speakers. The DTW similarity distance between the clusters produced by English speakers and those produced by Korean speakers was greater than the distance between the clusters produced by English speakers only. In addition, the DTW similarity distance between the clusters produced by English speakers and the error tokens by Korean speakers was greater than the distance between the clusters produced by English speakers and the non-error tokens by Korean speakers. The current study further employed the K-Nearest Neighbors (KNN) classifier for L1 and error detection using the DTW distance measures. The results showed that the DTW similarity distance was an adequate measure to capture the differences due to speakers’ L1 and error production.

Keywords:

coda obstruent clusters, dynamic time warping, intensity, center of gravity

References

  • Altman, Naomi. S. 1992. An introduction to Kernel and Nearest Neighbor nonparametric regression. The American Statistician 46(3), 175-185. [https://doi.org/10.1080/00031305.1992.10475879]
  • Alfeilat, Haneen Arafat Abu, Ahmad B.A. Hassanat, Omar Lasassmeh, Ahmad S. Tarawneh, Mahmoud Bashir Alhasanat, Hamzeh S. Eyal Salman and V. B. Surya Prasath. 2019. Effects of distance measure choice on K-Nearest Neighbor Classifier performance: A review. Big Data 7(4), 221-248. [https://doi.org/10.1089/big.2018.0175]
  • Balise, Raymond and Randy Diehl. 1994. Some distributional facts about fricatives and a perceptual explanation. Phonetica 51, 99-110. [https://doi.org/10.1159/000261962]
  • Beckers, Gabriel, J. L. 2002. RMS equalize Praat scripts. http://wwwbio.leidenuniv.nl/~eew/G6/staff/beckers/beckers.html
  • Boersma, Paul and David Weenink. 1992-2020. Praat (version 6.1.35): Doing phonetics by computer.
  • Browman, Catherine P. and Louis Goldstein. 1989. Articulatory gestures as phonological units. Phonology 6, 201-251. [https://doi.org/10.1017/S0952675700001019]
  • Bugdol, Marcin, Zuzanna Miodonska and Micha Kręcichwost. 2014. Pronunciation error detection using dynamic time warping algorithm. In E. Pietka, J. Kawa and W. Wieclawek, eds., Information Technologies in Biomedicine 4, 345-354. DOI: 10.1007/978-3-319-06596-0_32 [https://doi.org/10.1007/978-3-319-06596-0_32]
  • Celce-Muercia, Marianne, Donna M. Brinton and Janet M. Goodwin. 2010. Teaching Pronunciation: A Course Book and Reference Guide, 2nd ed. Cambridge: Cambridge University Press.
  • Cho, Mi-Hui. 2005. Cluster reduction by Korean EFL students: insertion vs. deletion strategies. The Journal of the Korea Contents Association 6(1), 80-84.
  • Cho, Mi-Hui and Shinsook Lee. 2005. Repair strategies of English biconsonantal coda clusters: An optimality-theoretic account in conjunction with P-map. Studies in Phonetics, Phonology and Morphology 11(2), 191-214.
  • Cho, Sunghye, Naomi Nevler, Natalia Parjane, Christopher Cieri, Mark Liberman, Murray Grossman and Katheryn A. Q. Cousins. 2021. Automated analysis of digitized letter fluency data. Frontiers in Psychology 12, Article 654214. [https://doi.org/10.3389/fpsyg.2021.654214]
  • Clements, George. N. 1990. The role of the sonority cycle in core syllabification. In John Kingston and Mary E. Beckman, eds., Papers in Laboratory Phonology I: Between the grammar and the physics of speech, 283-333. Cambridge: Cambridge University Press. [https://doi.org/10.1017/CBO9780511627736.017]
  • Cover, Thomas M. and Peter E. Hart. 1967. Nearest Neighbor pattern classification. IEEE Transactions on Information Theory 13(1), 21-27. [https://doi.org/10.1109/TIT.1967.1053964]
  • Cunningham, Pádraig and Sarah Jane Delany. 2007. K-Nearest Neighbour Classifiers. Technical Report UD-CSI-2007-4. March 27, 2007.
  • Cunningham, Pádraig and Sarah Jane Delany. 2020. K-Nearest Neighbor Classifiers, 2nd ed. (with Python Examples). arXiv:2004.04523 [cs.LG]
  • Davidson, Lisa. 2006. Phonology, phonetics, or frequency: Influences on the production of non-native sequences. Journal of Phonetics 34(1), 104-137. [https://doi.org/10.1016/j.wocn.2005.03.004]
  • Fix, Evelyn and J. L. Hodges, Jr. 1951. Discriminatory Analysis, Nonparametric Discrimination: Consistency Properties. Technical Report 4, USAF School of Aviation Medicine, Randolph Field. [https://doi.org/10.1037/e471672008-001]
  • Gimson, Alfred C. 1989. An Introduction to the Pronunciation of English, 4th ed. London: Edward Arnold.
  • Giorgino, Toni. 2009. Computing and visualizing dynamic time warping alignments in R: The dtw package. Journal of Statistical Software 31(7), 1-24. [https://doi.org/10.18637/jss.v031.i07]
  • Gordon, Matthew, Paul Barthmaier and Kathy Sands. 2002. A cross-linguistic acoustic study of voiceless fricatives. Journal of the International Phonetic Association 32, 141-174. [https://doi.org/10.1017/S0025100302001020]
  • Gussenhoven, Carlos amd Jacobs Haike. 2011. Understanding Phonology: Understanding Language, 3rd ed. London: Hodder Education.
  • Hong, Hyejin, Sunhee Kim and Minhwa Chung. 2014. A corpus-based analysis of English segments produced by Korean learners. Journal of Phonetics 46, 52-67. [https://doi.org/10.1016/j.wocn.2014.06.002]
  • Kanters, Sandra, Catia Cucchiarini, Helmer Strik. 2009. The goodness of pronunciation algorithm: A detailed performance study. In Proceedings of Speech and Language Technology in Education (SLaTE 2009), 49-52
  • Kim, Jungsun. 2015. The effect of word frequency on the reduction of English CVCC syllables in spontaneous speech. Journal of the Korean Society of Speech Sciences 7(3), 45-53. [https://doi.org/10.13064/KSSS.2015.7.3.045]
  • Kim. Jungyeon. 2018. Production of English final stops by Korean speakers. Phonetics Speech Sciences 10(4),11-17. [https://doi.org/10.13064/KSSS.2018.10.4.011]
  • Kreidler, Charles W. 2004. The Pronunciation of English: A Course Book, 2nd ed. Malde:Blackwell Publishing. [https://doi.org/10.1002/9780470759264]
  • Kwon, Bo-Young. 2008. The effects of gestural overlap on the acquisition of English word-final obstruent clusters. English Language and Linguistics 25, 149-170. [https://doi.org/10.17960/ell.2008..25.008]
  • Ladefoged, Peter. 2005. Vowels and Consonants, 2nd ed. Malden:Blackwell Publishing.
  • Lee, Ho-Young. 2000. The pronunciation of English consonant clusters by Koreans. Malsori 40, 79-89.
  • Lee, Shinsook. 2003. A comparison of cluster realizations in first and second language. The Journal of Studies in Language 19(2), 341-357.
  • Lee, Shinsook, Jeongsoon Joh and Mi-Hui Cho. 2002. Acquisition of English consonant clusters among Korean EFL learners. Korean Journal of Linguistics 27(3), 439-472.
  • Lim, Youngshin. 2021. Repair strategies employed by Korean speakers in producing English final consonant clusters. Journal of Linguistic Studies 26(1), 205-221. [https://doi.org/10.21291/jkals.2021.26.1.10]
  • Muda, Lindasalwa, Mumtaj Begam, and I. Elamvazuth. 2010. Voice recognition algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) techniques. Journal of Computing 2(3), 138-143.
  • Park, Jin-Sook. 2021. Acoustic and spectral characteristics of English voiceless fricatives. The Journal of Studies in Language 37(1), 7-19.
  • Park, Sunwoo. 2022. Towards grammar research based on data science. Grammar Education 44, 1-28.
  • Pater, Joe. 2019. Generative linguisitcs and neural networks at 60: Foundation, frication, and fusion. Language 95(1), e41-e74. [https://doi.org/10.1353/lan.2019.0009]
  • Prator, Jr., Clifford Holmes and Robinett Betty Wallace. 2009. Manual of American English Pronunciation, 4th ed. Seoul: Heinle Cengage Learning.
  • Roca, Iggy and Wyn Johnson. 1999. A Course in Phonology. Malden: Blackwell Publishers.
  • R Core Team. 2022. R: Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. URL http://www.R-project.org/
  • Sakoe, Hiroaki and Seibi Chiba. 1978. Dynamic programming algorithm optimization for spoken word recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing 26(1), 43-49. [https://doi.org/10.1109/TASSP.1978.1163055]
  • Sung, Eunkyung and Yunjeong Cho. 2010. An acoustic study of Korean and English voiceless sibilant fricatives. Phonetics and Speech Sciences 2(3), 37-46.
  • Silbert, Noah and Kenneth de Jong. 2008. Focus, prosodic context, and phonological feature specification: Patterns of variation in fricative production. Journal of the Acoustical Society of America 1.2.4.5, 2769-2779. [https://doi.org/10.1121/1.2890736]
  • Shin, Dong-Jin and Paul Iverson. 2014. Phonetic investigation of epenthetic vowels produced by Korean learners of English. Journal of the Korean Society of Speech Sciences 6(4), 17-26. [https://doi.org/10.13064/KSSS.2014.6.4.017]
  • Steinbach, Michael and Tan, Pang-Ning. 2009. kNN: k-Nearest Neighbors. In Xindong Wu and Vipin Kumar, eds., The Top Ten ALgorithms in Data Mining, 151-161. Boca Raton: Chapman and Hall/CRC. [https://doi.org/10.1201/9781420089653.ch8]
  • Sun, Xihao, Yoshikazu Miyanaga and Baiko Sai. 2014. Dynamic time warping for speech recognition with training part to reduce the computation. Journal of Signal Processing 18(2), 89-96, [https://doi.org/10.2299/jsp.18.89]
  • Sung, Eunkyung and Yunjeong Cho. 2010. An acoustic study of Korean and English voiceless sibilant fricatives. Phonetics and Speech Sciences 2(3), 37-46.
  • Weinberger, Steven. 1994. Functional and phonetic constraints on second language phonology. In M. Yavas, eds., First and Second Language Phonology, 283-302. Singular Publishing Group, Inc.
  • Yavaş, Mehmet. 2020. Applied English Phonology, 4th ed. Malden: Wiley-Blackwell.
  • Zhao, Tongmu, Akemi Hoshino, Masayuki Suzuki, Nobuaki Minematsu and Keikichi Hirose. 2012. Automatic Chinese pronunciation error detection using SVM trained with structural features. IEEE Spoken Language Technology Workshop (SLT), 2012, 473-478. [https://doi.org/10.1109/SLT.2012.6424270]