On the robustness of ChatGPT in teaching Korean Mathematics
Relevance: 7/10 1 cited 2025 paper
This paper evaluates ChatGPT's accuracy in solving 586 Korean mathematics questions (achieving 66.72% accuracy) and its ability to rate question difficulty using eleven criteria, examining the model's robustness in multilingual educational settings.
ChatGPT, an Artificial Intelligence model, has the potential to revolutionize education. However, its effectiveness in solving non-English questions remains uncertain. This study evaluates ChatGPT's robustness using 586 Korean mathematics questions. ChatGPT achieves 66.72% accuracy, correctly answering 391 out of 586 questions. We also assess its ability to rate mathematics questions based on eleven criteria and perform a topic analysis. Our findings show that ChatGPT's ratings align with educat
Source
Framework Categories
Tool Types
AI Tutors 1-to-1 conversational tutoring systems.
Tags
multilingual evaluation educationcomputer-sciencemathematics