A Report on the llms evaluating the high school questions

Research / Other Relevance: 7/10 2025 paper

This report evaluates the performance of 8+ large language models on Chinese high school mathematics college entrance examination questions (2019-2023), assessing accuracy, response time, logical reasoning, and creativity. The study aims to understand LLMs' potential as educational tools for K-12 mathematics instruction.

This report aims to evaluate the performance of large language models (LLMs) in solving high school science questions and to explore their potential applications in the educational field. With the rapid development of LLMs in the field of natural language processing, their application in education has attracted widespread attention. This study selected mathematics exam questions from the college entrance examinations (2019-2023) as evaluation data and utilized at least eight LLM APIs to provide

Study Type

Research / Other

Source

View source

Framework Categories

1 General reasoning 3.1 Content knowledge

A Report on the llms evaluating the high school questions

Study Type

Source

Framework Categories

Tool Types

Tags