The Korean Association for the Study of English Language and Linguistics

Korean Journal of English Language and Linguistics - Vol. 19 , No. 1


[ Article ]
Korea Journal of English Language and Linguistics - Vol. 19, No. 1, pp. 76-94
Abbreviation: KASELL
ISSN: 1598-1398 (Print)
Print publication date 31 Mar 2019
Received 14 Feb 2019 Revised 10 Mar 2019 Accepted 19 Mar 2019
DOI: https://doi.org/10.15738/kjell.19.1.201903.76
혼합효과모형(Mixed-Effects Model)을 이용한 실험언어학 데이터 분석 방법 고찰: 자기조절읽기 실험 데이터를 중심으로
신정아

교수, 동국대학교, 영어영문학부, 서울특별시 중구 필동로1길 30, Tel: 02) 2260-3167 (jashin@dongguk.edu)(jashin@gmail.com)
How to analyze experimental linguistic data using a mixed-effects model in R: Focusing on data from a self-paced reading experiment
Shin, Jeong-Ah
Professor, Dongguk Univ., Division of English, 30 Pildong-ro-1-gil, Jung-gu, Seoul, Tel: 02) 2260-3167 (jashin@dongguk.edu)(jashin@gmail.com)




Funding Information ▼ National Research Foundation of Korea NRF-2016K2A9A2A19939367 Dongguk University S-2018-G0001-00022

Abstract

Shin, Jeong-Ah. 2019. How to analyze experimental linguistic data using a mixed-effects model in R: Focusing on data from a self-paced reading experiment. Korean Journal of English Language and Linguistics 19-1, 76-94. This study examined a practical use of mixed-effects models in R, analyzing accuracy and reading time data from a self-paced reading experiment. It discussed the applications of logistic mixed-effects model for binary data (e.g., accuracy data) and the use of a mixed-effects model for reading time (RT) data, effectively removing outliers within the data set. A sample for mixed-effects model analyses was collected from a previously conducted self-paced reading experiment, involving English reduced relative clauses for 30 advanced and intermediate second language learners. Rationales and guidelines toward selecting the most appropriate mixed-effects model and checking model assumptions were also discussed.


Keywords: mixed-effects model, linear mixed model, logistic mixed model, experimental linguistics, psycholinguistics, self-paced reading, reading time, RT data, accuracy

Acknowledgments

이 논문은 2016년 한국연구재단의 국제협력사업(NRF-2016K2A9A2A19939367)과 2018년 동국대학교 우수연구자 지원사업(S-2018-G0001-00022)의 지원을 받아 연구되었음.

References


1.	이준규(Lee, J.). 2011. 반응시간을 이용한 제2 언어 어휘의 연결강도 추정(Estimating the strength of second language word association with reference to reaction times). ≪언어학≫(Journal of the Linguistic Society of Korea) 61, 243-261.
2.	이준규(Lee, J.). 2016. 제2 언어 머릿속 사전에 관한 심리언어학적 탐색(A psycholinguistic inquiry of the second language mental lexicon). ≪언어학≫(Journal of the Linguistic Society of Korea) 74, 51-70.
3.	최재웅·홍정하(Choe, J-W. and J. Hong). (역). 2013. 『언어학자를 위한 통계학-R 활용』(Statistics for Linguistics with R: A Practical Introduction). 고려대학교출판문화원(Korea University Press).
4.	Baayen, R. H., Davidson, D. J., and D. M. Bates. 2008. Mixed-effects modeling with crossed random effects for subjects and items. Journal of Memory and Language 59(4), 390-412.
5.	Barr, D. J., R. Levy, C. Scheepers and H. J. Tily. 2013. Random effects structure for confirmatory hypothesis testing: Keep it maximal. Journal of Memory and Language 68(3), 255-278.
6.	Bates, D., M. Maechler, B. Bolker and S. Walker. 2019. lme4: Linear mixed-effects models using Eigen and S4. R package version 1.1-21. http://CRAN.R-project.org/package=lme4
7.	Clark, H. H. 1973. The language-as-fixed-effect fallacy: A critique of language statistics in psychological research. Journal of Verbal Learning and Verbal Behavior 12(4), 335-359.
8.	Drummond, A. 2013. Ibex Farm. Retrieved from http://spellout.net/ibexfarm
9.	Gelman, A. and J. Hill. 2007. Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge: Cambridge University Press.
*10.*	Gries, S. 2013. Statistics for Linguistics with R: A Practical Introduction. Berlin: De Gruyter.
*11.*	Jaeger, F. 2008. Categorical data analysis: Away from ANOVAs (transformation or not) and towards logit mixed models. Journal of Memory and Language 59, 434-446.
*12.*	Jaeger, F. 2009. Centering several variables. Retrieved from https://hlplab.wordpress.com/2009/04/27/centering-several-variables
*13.*	Jaeger, F. 2008. Modeling self-paced reading data: Effects of word length, word position, spill-over, etc. Retrieved from https://hlplab.wordpress.com/2008/01/23/modeling-self-paced-reading-data-effects-of-word-length-word-position-spill-over-etc/
*14.*	Juffs, A. 1998. Main verb versus reduced relative clause ambiguity resolution in L2 sentence processing. Language Learning 48(1), 107-147.
*15.*	Kuznetsova A., P. B. Brockhoff and R. H. B. Christensen. 2019. lmerTest: Tests in Linear Mixed Effects Models. R package version 3.1-0. http://CRAN.R-project.org/package=lmerTest
*16.*	Lee, J-H. and J-A. Shin. 2016. Syntactic reanalysis and lingering misinterpretations in L2 sentence processing. Linguistic Research 33(S). 53-79.
*17.*	Marsden, E., Thompson, S., and L. Plonsky. 2018. A methodological synthesis of self-paced reading in second language research. Applied Psycholinguistics 1-44.
*18.*	Matuschek, H., R. Kliegl, S. Vasishth, H. Baayen and D. Bates. 2017. Balancing Type I error and power in linear mixed models. Journal of Memory and Language 94, 305-315.
*19.*	R Core Team. 2014. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. Retrieved from http://www.R-project.org
*20.*	Rah, A. and D. Adone. 2010. Processing of the reduced relative clause versus main verb ambiguity in L2 learners at different proficiency levels. Studies in Second Language Acquisition 32(1), 79-109.
*21.*	Sarkar, D. 2018. lattice: Trellis Graphics for R. R package version 0.20-38. http://CRAN.R-project.org/package=lattice
*22.*	Seo, H-J. and J-A. Shin. 2016. L2 processing of English pronouns and reflexives: An eye-tracking study. Korean Journal of English Language and Linguistics 16(4), 879-901.
*23.*	Winter, B. 2013. Linear models and linear mixed effects models in R with linguistic applications. arXiv:1308.5499 [http://arxiv.org/pdf/1308.5499.pdf]
*24.*	Wurm, L. H. and S. A. Fisicaro. 2014. What residualizing predictors in regression analyses does (and what it does not do). Journal of Memory and Language 72, 37-48.