Edu-Values: Towards Evaluating the Chinese Education Values of Large Language Models
Edu-Values is a Chinese education values benchmark with 1,418 questions evaluating LLMs across seven core educational dimensions including professional philosophy, teachers' ethics, education laws, cultural literacy, educational knowledge/skills, basic competencies, and subject knowledge. The benchmark evaluates 21 LLMs using human feedback-based automatic evaluation and finds Chinese LLMs outperform English ones, with particular weaknesses in professional ethics and philosophy.
In this paper, we present Edu-Values, the first Chinese education values evaluation benchmark that includes seven core values: professional philosophy, teachers' professional ethics, education laws and regulations, cultural literacy, educational knowledge and skills, basic competencies and subject knowledge. We meticulously design 1,418 questions, covering multiple-choice, multi-modal question answering, subjective analysis, adversarial prompts, and Chinese traditional culture (short answer) que