Alloprof: a new French question-answer education dataset and its use in an information retrieval case study
This paper introduces a French question-answering dataset from Alloprof (a Quebec K-12 help website) containing 29,349 student questions across subjects, and presents a case study using BERT models for information retrieval to match student questions with relevant educational resources. The work focuses on creating and evaluating retrieval systems to help students find answers quickly on the platform.
Teachers and students are increasingly relying on online learning resources to supplement the ones provided in school. This increase in the breadth and depth of available resources is a great thing for students, but only provided they are able to find answers to their queries. Question-answering and information retrieval systems have benefited from public datasets to train and evaluate their algorithms, but most of these datasets have been in English text written by and for adults. We introduce