VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language Models

Benchmark (Published & Automated) Relevance: 7/10 22 cited 2023 paper

This paper introduces VNHSGE, a Vietnamese high school graduation examination dataset with 19,000+ multiple-choice questions and 300 essays across 9 subjects (math, literature, English, sciences, social studies) designed to evaluate large language models like ChatGPT and BingChat against human student performance. The benchmark includes both textual and visual questions, assessing capabilities in question answering, reading comprehension, and visual question answering.

The VNHSGE (VietNamese High School Graduation Examination) dataset, developed exclusively for evaluating large language models (LLMs), is introduced in this article. The dataset, which covers nine subjects, was generated from the Vietnamese National High School Graduation Examination and comparable tests. 300 literary essays have been included, and there are over 19,000 multiple-choice questions on a range of topics. The dataset assesses LLMs in multitasking situations such as question answering

Study Type

Benchmark (Published & Automated)

Tool Types

Tags

large language model evaluation educationcomputer-science