VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language Models
This paper introduces the VNHSGE dataset, comprising over 19,000 multiple-choice questions and 300 essays across nine subjects from Vietnamese high school graduation examinations, designed to benchmark large language models' performance against student-level competencies. The authors evaluate ChatGPT and BingChat on this dataset, comparing their performance to Vietnamese students across subjects including mathematics, literature, sciences, and humanities.
The VNHSGE (VietNamese High School Graduation Examination) dataset, developed exclusively for evaluating large language models (LLMs), is introduced in this article. The dataset, which covers nine subjects, was generated from the Vietnamese National High School Graduation Examination and comparable tests. 300 literary essays have been included, and there are over 19,000 multiple-choice questions on a range of topics. The dataset assesses LLMs in multitasking situations such as question answering