On the robustness of ChatGPT in teaching Korean Mathematics

Relevance: 7/10 1 cited 2025 paper

This paper evaluates ChatGPT's accuracy in solving 586 Korean mathematics questions (achieving 66.72% accuracy) and its ability to rate question difficulty using eleven criteria, examining the model's robustness in multilingual educational settings.

ChatGPT, an Artificial Intelligence model, has the potential to revolutionize education. However, its effectiveness in solving non-English questions remains uncertain. This study evaluates ChatGPT's robustness using 586 Korean mathematics questions. ChatGPT achieves 66.72% accuracy, correctly answering 391 out of 586 questions. We also assess its ability to rate mathematics questions based on eleven criteria and perform a topic analysis. Our findings show that ChatGPT's ratings align with educat

Tool Types

AI Tutors 1-to-1 conversational tutoring systems.

Tags

multilingual evaluation educationcomputer-sciencemathematics