Mon Jun 29 · 9:15 AM · Room 101 Beyond the Gold Standard: Reliability Estimation of Human and GenAI Scoring in Validity and Reliability of AI-Based Educational Measurement