Eurokd
European KnowledgeDevelopment Institute
Language Teaching Research Quarterly

e‐ISSN

    

2667-6753

CiteScore

  exclamation mark

1.2

ICV

  exclamation mark

124.94

SNIP

  exclamation mark

0.604

SJR

  exclamation mark

0.283

CiteScore

  exclamation mark

1.2

ICV

  exclamation mark

124.94

SNIP

  exclamation mark

0.604

SJR

  exclamation mark

0.283

SCOPUSEBSCOProQuestCrossrefIndex CopernicusMIAR

Perspective Article

Where Assessment Validation and Responsible AI Meet

Language Teaching Research Quarterly, Volume 50, Pages 120-137, https://doi.org/10.32038/ltrq.2025.50.09

The core principles of validity, reliability, and fairness have been the foundation of ethical assessment practices, as discussed in classical validation theories (e.g., Chapelle et al., 2008; Kane, 1992, 2013) and the American Educational Research Association (AERA), the American Psychological Association (APA), and the National Council on Measurement in Education Standards (NCME) (AERA & APA & NCME, 2014). The Standards have provided best practices for AI use in high-stakes testing, particularly in the automated scoring of written and spoken responses. Responsible AI (RAI) is essential across industry domains, including educational assessments. With recent advancements in generative AI, new policies and guidance on applying RAI principles in assessment have emerged. Expanding on Chapelle et al.'s (2008) work, this paper introduces a unified assessment framework that integrates traditional validation theory with both assessment-specific and domain-agnostic RAI principles. This framework supports responsible AI use, aligns with ethical principles to uphold human values and oversight, and promotes broader social responsibility in AI-driven assessments.

Loading PDF…
next

Page 1 of

next

Download Count : 28

Visit Count : 50

Publisher’s Note

The claims, arguments, and counter-arguments made in this article are exclusively those of the contributing authors. Hence, they do not necessarily represent the viewpoints of the authors’ affiliated institutions, or EUROKD as the publisher, the editors and the reviewers of the article.

 

Acknowledgements

We would like to acknowledge Ravit Dotan for her guidance about the alignment between the NIST AI RMF and the DET RAI Standards. Many thanks to our Duolingo colleagues, Ben Naismith, Yena Park, and Alina von Davier for their reviews.

 

Funding

The article was funded by Duolingo as both authors are employed by Duolingo.

 

CRediT Authorship Contribution Statement

Jill Burstein: Conceptualization, Investigation, and Drafting, Reviewing and Editing

Geoff LaFlair: Conceptualization, Investigation, and Drafting, Reviewing and Editing

 

Generative AI Use Disclosure Statement

Generative AI was used to implement copyediting.

 

Ethics Declarations

World Medical Association (WMA) Declaration of Helsinki–Ethical Principles for Medical Research Involving Human Participants

No human participants were involved.

 

Competing Interests

Both authors are employed by Duolingo.

 

Data Availability

This is a theoretical paper.