VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language Models

Benchmark (Published & Automated) Relevance: 7/10 22 cited 2023 paper

This paper introduces VNHSGE, a Vietnamese high school graduation examination dataset with 19,000+ multiple-choice questions and 300 essays across 9 subjects (math, literature, English, sciences, social studies) designed to evaluate large language models like ChatGPT and BingChat against human student performance. The benchmark includes both textual and visual questions, assessing capabilities in question answering, reading comprehension, and visual question answering.

The VNHSGE (VietNamese High School Graduation Examination) dataset, developed exclusively for evaluating large language models (LLMs), is introduced in this article. The dataset, which covers nine subjects, was generated from the Vietnamese National High School Graduation Examination and comparable tests. 300 literary essays have been included, and there are over 19,000 multiple-choice questions on a range of topics. The dataset assesses LLMs in multitasking situations such as question answering

Study Type

Benchmark (Published & Automated)

Source

View source Open PDF

Framework Categories

1 General reasoning 3.1 Content knowledge 3.2 Content alignment 6.1 Multimodal capabilities

VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language Models

Study Type

Source

Framework Categories

Tool Types

Tags