How Many Raters Can Be Enough: G Theory Applied to Assessment and Measurement of L2 Speech Perception

Kevin Hirschi, Okim Kang

doi:10.32038/ltrq.2023.37.12

European KnowledgeDevelopment Institute

Original Research

How Many Raters Can Be Enough: G Theory Applied to Assessment and Measurement of L2 Speech Perception

Kevin Hirschi

Okim Kang

Language Teaching Research Quarterly, Volume 37, Pages 213-230, https://doi.org/10.32038/ltrq.2023.37.12

Abstract
Full text
Metrics

This paper extends the use of Generalizability Theory to the measurement of extemporaneous L2 speech through the lens of speech perception. Using six datasets of previous studies, it reports on G studies–a method of breaking down measurement variance–and D studies–a predictive study of the impact on reliability when modifying the number of raters, items, or other facets that assist the field in adopting measurement designs that include comprehensibility, accentedness, and intelligibility. When data from a single audio sample per learner were subjected to D-studies, we find that both semantic differential and rubric scales for comprehensibility were reliable at the .90 level with about 15 trained raters or 50 untrained crowdsourced raters. In order to offer generalizable and dependable evaluations, empirically informed recommendations are given, including considerations for the number of speech samples rated, or the granularity of the scales for various assessment and research purposes.

Loading PDF…

Page 1 of

Download Count : 441

Visit Count : 1448

Keywords

L2 Speech; Generalizability; Comprehensibility; Accentedness; Intelligibility; Measurement

Previous article in volume

Leveraging Computational Psychometrics for Language Testing

Next article in volume

Mixed Methods Investigation into Test Score Users’ Perspectives about IELTS Reading Skill Profiles

Author(s) Information

Kevin Hirschi

Kevin Hirschi is a PhD Candidate in Applied Linguistics at Northern Arizona University. His research interests include second language acquisition with a focus on L2 pronunciation, technology for analytic and learning purposes, corpus linguistics, and quantitative research methods. He has taught language, linguistics, and pedagogy courses at institutions in three countries and supported university students in become effective language learners, users, and teachers. In his research, he leverages large data sets and computational techniques for statistical and perceptual analyses of speech. He also has experience in technology development for web applications, primarily for pedagogical purposes.

Okim Kang

Okim Kang is Professor of Applied Linguistics and Director of the Applied Linguistics Speech Lab at Northern Arizona University, Flagstaff, AZ. Her research interests are speech production and perception, L2 pronunciation and intelligibility, L2 oral assessment and testing, automated scoring and speech recognition, World Englishes, and language attitude. She has published over 6 books, 1 edited volume for conference proceedings, over 70 journal articles, 17 book chapters, over 40 invited and keynote speech presentations, and more than 130 conference presentations. She is currently an associate editor for Applied Linguistics.

Acknowledgments

Not applicable.

Funding

Not applicable.

Conflict of Interests

No, there are no conflicting interests.

Open Access

This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. You may view a copy of Creative Commons Attribution 4.0 International License here: http://creativecommons.org/licenses/by/4.0/