A Report on the llms evaluating the high school questions
This report evaluates the performance of 8+ large language models on Chinese high school mathematics college entrance examination questions (2019-2023), assessing accuracy, response time, logical reasoning, and creativity. The study aims to understand LLMs' potential as educational tools for K-12 mathematics instruction.
This report aims to evaluate the performance of large language models (LLMs) in solving high school science questions and to explore their potential applications in the educational field. With the rapid development of LLMs in the field of natural language processing, their application in education has attracted widespread attention. This study selected mathematics exam questions from the college entrance examinations (2019-2023) as evaluation data and utilized at least eight LLM APIs to provide