The Korean Association for the Study of English Language and Linguistics
[ Article ]
Korea Journal of English Language and Linguistics - Vol. 25, No. 0, pp.1378-1394
ISSN: 1598-1398 (Print) 2586-7474 (Online)
Print publication date 31 Jan 2025
Received 18 Jul 2025 Revised 22 Aug 2025 Accepted 16 Oct 2025
DOI: https://doi.org/10.15738/kjell.25..202510.1378

Exploring Vocabulary Patterns in ESL and EFL Learner Corpora: Implications for Language Teaching

Seulgi Choi ; Sunyoung Park ; Jin-young Tak
(First author) Master’s Student, Department of Artificial Intelligence and Language Engineering Sejong University csg0726@sju.ac.kr
(Co-author) Visiting Professor, Daeyang Humanity College Sejong University sunpark@sejong.ac.kr
(Corresponding author) Professor, English Data Convergence Major Sejong University jytak@sejong.ac.kr


© 2025 KASELL All rights reserved
This is an open-access article distributed under the terms of the Creative Commons License, which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Vocabulary knowledge is a crucial determinant of writing proficiency and overall communicative competence for second language learners. While previous research examined lexical richness in academic writing, few studies have directly compared English as a Second Language (ESL) and English as a Foreign Language (EFL) learners. This study attempts to fill that gap by analyzing vocabulary use in written texts from EFL learners and ESL learners, focusing on how different learning environments influence lexical development. Using the Linguistic Feature Toolkit (LFTK), an open-source tool with over 220 linguistically defined features, and the International Corpus Network of Asian Learners of English (ICNALE), the study systematically examines lexical features across groups. The results of the current study found interesting patterns between ESL and EFL learners. ESL learners, immersed in English speaking contexts, demonstrate greater lexical flexibility and diversity in content word usage that closely aligns with native speaker patterns. In contrast, EFL learners exhibited more native-like patterns in functional word usages. This phenomenon can be interpreted as that ESL learners develop greater diversity in content word usage as a result of immersive exposure, whereas EFL learners, through form-focused instruction, developed functional word usage patterns more similar to native speakers. These results have important pedagogical implications: AI-based vocabulary learning tools can simulate immersive exposure for EFL learners, promoting lexical diversity and motivation, while ESL learners may benefit from explicit instruction targeting functional words to improve grammatical precision. Ultimately, the study emphasizes the importance of tailoring instruction to learners’ environments to support balanced language development and effective writing outcomes.

Keywords:

vocabulary development, lexical diversity, EFL learners, ESL learners, writing proficiency, pedagogical implications

Acknowledgments

Seulgi Choi and Sunyoung Park contributed equally to the conception and design of the study and share first authorship.

References

  • Alsakaker, S. M. 2025. Investigating EFL Learners’ perceptions of using AI to enhance English vocabulary acquisition based on the technology acceptance Model. Forum for Linguistic Studies 7(2), 1067-1077. [https://doi.org/10.30564/fls.v7i2.8593]
  • Azizollahi, Z., F. Tabrizi, and Z. Behineh. 2025. Utilizing AI mediated methods of EFL vocabulary learning and teaching. Modarese Bartar Quarterly 2(3), 1-18. Available online at https://teacherstribune.com/wp-content/uploads/2025/01/Utilizing-AI-Mediated-Methods-of-EFL-Vocabulary-Learning-and-Teaching-1.pdf, Article Publication
  • Azodi, N., F. Karimi and R. Vaezi. 2014. Measuring the lexical richness of productive vocabulary in Iranian EFL university students’ writing performance. Theory and Practice in Language Studies (TPLS) 4(9). [https://doi.org/10.4304/tpls.4.9.1837-1849]
  • Bestgen, Y. 2017. Beyond single-word measures: L2 writing assessment, lexical richness and formulaic competence. System 69, 65-78. [https://doi.org/10.1016/j.system.2017.08.004]
  • Bulté, B. and A. Housen. 2014. Conceptualizing and measuring short-term changes in L2 writing complexity. Journal of Second Language Writing 26, 42-65. [https://doi.org/10.1016/j.jslw.2014.09.005]
  • Buschfeld, S. and A. Kautzsch. 2017. Towards an integrated approach to postcolonial and non-postcolonial Englishes. World Englishes 36, 104-126. [https://doi.org/10.1111/weng.12203]
  • Celce-Murcia, M. and D. Larsen-Freeman. 1999. The Grammar Book: An ESL/EFL Teacher’s Course (2nd ed.). Heinle and Heinle.
  • Chang, X. and P. Wang. 2016. Influence of second language proficiency and syntactic structure similarities on the sensitivity and processing of English passive sentence in late Chinese-English bilinguists: an ERP study. Journal of Psycholinguistics Research 45, 85-101. [https://doi.org/10.1007/s10936-014-9319-1]
  • Ma, Y. and M. Chen. 2024. AI-empowered applications effects on EFL learners’ engagement in the classroom and academic procrastination. BMC Psychology 12, 739 [https://doi.org/10.1186/s40359-024-02248-w]
  • Djiwandono, P.I. 2016. Lexical richness in academic papers: A comparison between students’ and lecturers’ essays. Indonesian Journal of Applied Linguistics 5(2), 209-216. [https://doi.org/10.17509/ijal.v5i2.1345]
  • Engber, C.A. 1995. The relationship of lexical proficiency to the quality of ESL compositions. Journal of Second Language Writing 4(2), 139-155. [https://doi.org/10.1016/1060-3743(95)90004-7]
  • González‐López, S. and A. López‐López. 2015. Lexical analysis of student research drafts in computing. Computer Applications in Engineering Education 23(4), 638-644. [https://doi.org/10.1002/cae.21638]
  • Ha, D. 2019. Reinforcement learning for improving agent design. Artificial Life 25(4), 352-365. [https://doi.org/10.1162/artl_a_00301]
  • Huddleston, R. and G. Pullman 2002. The Cambridge Grammar of the English Language. Cambridge University Press. [https://doi.org/10.1017/9781316423530]
  • Ionin, T., H. Ko and K. Wexler. 2004. Article semantics in L2 acquisition: The role of specificity. Language acquisition 12(1), 3-69. [https://doi.org/10.1207/s15327817la1201_2]
  • Ishikawa, S. 2011. ICNALE: The international corpus network of Asian learners of English. Corpora 6-2, 121-133.
  • Ishikawa, S. 2011. A corpus-based study on Asian learners’ use of English linking adverbials. Themes in Science and Technology Education 3 1-2, 139-157.
  • Ishikawa, S. 2013. The ICNALE and sophisticated contrastive interlanguage analysis of Asian learners of English. In S. Ishikawa, ed., Learner corpus studies in Asia and the world 1, 91-118. Kobe University.
  • Ishikawa, S. 2024. Lexicogrammar of the L2 English essays written by Asian college students: a corpus-based Study. Journal of Asia TEFL 21(1), pp.119-138. [https://doi.org/10.18823/asiatefl.2024.21.1.7.119]
  • Ishikawa, S. 2025. Quantitative reconsideration of an L1 effect on Asian learners’ L2 English writing: a study based on the ICNALE. LEARN Journal: Language Education and Acquisition Research Network 18(1), 989-1014. [https://doi.org/10.70730/IBWT6076]
  • Johnson, M. D., L. Mercado, and A. Acevedo. 2012. The effect of planning sub-processes on L2 writing fluency, grammatical complexity, and lexical complexity. Journal of Second Language Writing 21, 264–282. [https://doi.org/10.1016/j.jslw.2012.05.011]
  • Krashen, S. D. 1985. The Input Hypothesis: Issues and Implication. Longman.
  • Kyle, K. and S.A. Crossley. 2015. Automatically assessing lexical sophistication: Indices, tools, findings, and application. TESOL Quarterly 49(4), 757-786. [https://doi.org/10.1002/tesq.194]
  • Laufer, B. and P. Nation. 1995. Vocabulary size and use: Lexical richness in L2 written production. Applied linguistics 16(3), 307-322. [https://doi.org/10.1093/applin/16.3.307]
  • Lee, B. W. and J. Lee. 2023. LFTK: Handcrafted features in computational linguistics. In Proceedings of the 18th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2023) (pp. 1-19). Association for Computational Linguistics. [https://doi.org/10.18653/v1/2023.bea-1.1]
  • Loewen, S. 2015. Introduction to Instructed Second Language Acquisition. Routledge. [https://doi.org/10.4324/9780203117811]
  • Long, M.H. 1981. INPUT, INTERACTION, AND SECOND-LANGUAGE ACQUISITION. Annals of the New York Academy of Sciences 379, 259-278. [https://doi.org/10.1111/j.1749-6632.1981.tb42014.x]
  • Lu, X. 2010. Automatic analysis of syntactic complexity in second language writing. International Journal of Corpus Linguistics 15, 474–496. [https://doi.org/10.1075/ijcl.15.4.02lu]
  • Lu, X. 2012. The relationship of lexical richness to the quality of ESL learners’ oral narratives. Modern Language Journal 96(2), 190-208. [https://doi.org/10.1111/j.1540-4781.2011.01232_1.x]
  • Master, P. 1987. Teaching the English article as a binary system. TESOL Quarterly 24(3), 461-478. [https://doi.org/10.2307/3587230]
  • Nation, I.S. and I.S.P. Nation. 2001. Learning Vocabulary in Another Language. Cambridge university press. [https://doi.org/10.1017/CBO9781139524759]
  • Norris, J. M. and L. Ortega. 2009. Towards an organic approach to investigating CAF in instructed SLA: the case of complexity. Applied Linguistics 30, 555-578. [https://doi.org/10.1093/applin/amp044]
  • O’Leary, J.A. and R. Steinkrauss. 2022. Syntactic and lexical complexity in L2 English academic writing: development and competition. Ampersand 9, 100096. [https://doi.org/10.1016/j.amper.2022.100096]
  • Ortega, L. 2003. Syntactic complexity measures and their relationship to L2 proficiency: A research synthesis of college‐level L2 writing. Applied linguistics 24(4). 492-518. [https://doi.org/10.1093/applin/24.4.492]
  • Ortega, L. 2005. Methodology, epistemology, and ethics in instructed SLA research: An introduction. The Modern Language Journal 89(3). 317-327. [https://doi.org/10.1111/j.1540-4781.2005.00307.x]
  • Ortega, L. 2015. Syntactic complexity in L2 writing: progress and expansion. Journal of Second Language Writing 29, 82-94. [https://doi.org/10.1016/j.jslw.2015.06.008]
  • Park, S. 2014. L2 Acquisition of Genericity in English Articles: The Case of Korean Adult Learners of L2 English. Doctoral dissertation. University of Sheffield.
  • Park, S. 2023 Corpus analysis of L2 English article usage patterns and pedagogical implications. Cogent Education 10(1). [https://doi.org/10.1080/2331186X.2023.2197662]
  • Quirk, R., Greenbaum, S., Leech and J. Svartvik. 1985. A Comprehensive Grammar of the English language. Longman.
  • Schenck, A. 2019. The impact of form-focused instruction on Korean learner production: A comprehensive study of technique and timing. English Teaching 74(2), 75-102. [https://doi.org/10.15858/engtea.74.2.201906.75]
  • Schmitt, N. 2008. Instructed second language vocabulary learning. Language Teaching Research 12(3), 329-363. [https://doi.org/10.1177/1362168808089921]
  • Shehadeh, A. and A. Coombe. 2012. Task-Based Language Teaching in Foreign Language Contexts: Research and Implementation. Task-Based Language Teaching: V. 4. John Benjamins Pub. Co. [https://doi.org/10.1075/tblt.4]
  • Tyler, A. and V. Evans. 2003. The Semantics of English Prepositions: Spatial Scenes, Embodied Meanings and Cognition. Cambridge University Press. [https://doi.org/10.1017/CBO9780511486517]
  • Vedder, I. and V. Benigno. 2016. Lexical richness and collocational competence in second-language writing. International Review of Applied Linguistics in Language Teaching 54(1), 23-42. [https://doi.org/10.1515/iral-2016-0015]
  • Vold, E. T. 2022. Learner spoken output and teacher response in second versus foreign language classrooms. Language Teaching Research 29(2). [https://doi.org/10.1177/13621688211068610]
  • Wei. L. 2023. Artificial intelligence in language instruction: impact on English learning achievement, L2 motivation, and self-regulated learning. Frontiers in Psychology. [https://doi.org/10.3389/fpsyg.2023.1261955]
  • Zhang, Y. and J. Ouyang. 2023. Linguistic complexity as the predictor of EFL independent and integrated writing quality. Assessing Writing 56, 100727. [https://doi.org/10.1016/j.asw.2023.100727]