Evaluating Reading Comprehension Exercises Generated by LLMs: A Showcase of ChatGPT in Education Applications
This paper evaluates ChatGPT's ability to generate personalized reading comprehension exercises (passages and multiple-choice questions) for middle school English learners in China, comparing AI-generated materials against human-written textbook exercises through both automatic and manual evaluation by students, teachers, and native speakers.
The recent advancement of pre-trained Large Language Models (LLMs), such as OpenAI’s ChatGPT, has led to transformative changes across fields. For example, developing intelligent systems in the educational sector that leverage the linguistic capabilities of LLMs demonstrates a visible potential. Though researchers have recently explored how ChatGPT could possibly assist in student learning, few studies have applied these techniques to real-world classroom settings involving teachers and students