FoL 2026
AIED

Has Automated Essay Scoring Reached Sufficient Accuracy? Deriving Achievable QWK Ceilings from Classical Test Theory

Thu Jul 2, 11:35 AM–12:00 PM · R2 (Auditorium Meeting 2)
★ Notable speakers
Masaki Uto — Item response theory for automated essay scoring; Bayesian psychometrics; multidimensional IRT; neural AES

Derives achievable accuracy ceilings for automated essay scoring from classical test theory to assess whether current systems are sufficiently accurate.

Authors

Masaki Uto