AIED
Validity and Reliability of AI-Based Educational Measurement
Mon Jun 29, 9:00 AM–10:15 AM · Room 101
Automated Assessment & Scoring Psychometrics & Educational Measurement Generative AI & Large Language Models Explainable & Trustworthy AI
★ Notable speakers
Matthias von Davier
★★
— Item response theory; diagnostic classification models; large-scale international assessment (TIMSS, PIRLS, PISA)
Pantelis M. Papadopoulos
★
— Computer-supported collaborative learning, conversational agents, learning analytics
Gábor Kismihók
★
— Learning and skills analytics, labor market matching, technology-enhanced learning
AIED08 (TS) | Technical | Short-paper session
5 talks in this session
Measuring What Matters---or What’s Convenient?: Robustness of LLM-Based Scoring Systems to Construct-Irrelevant Factors
Cole Walsh, Rodica Ivan
Beyond the Gold Standard: Reliability Estimation of Human and GenAI Scoring
Ji Yoon Jung, Ummugul Bezirhan, Matthias von Davier
Contrastive Network-based Similarity for Zero-Shot Automatic Scoring of Very Short Handwritten Answers
Nam Tuan Ly, Hung Tuan Nguyen, Truong Thanh-Nghia, Masaki Nakagawa
Cross-Dataset Bloom Question Classification: Supervised Models and Prompted LLMs
Mohammadreza Molavi Hajiagha, Abdolali Faraji, Zohre Rasoulkhani, Mohammadreza Tavakoli, Gábor Kismihók
Agnoagentia: The Illusion of Agency in AI-Assisted Learning
Iris Delikoura, Pantelis M Papadopoulos, Pan Hui