Computer Science > Computation and Language

arXiv:2210.03992 (cs)

[Submitted on 8 Oct 2022 (v1), last revised 2 Jan 2023 (this version, v3)]

Title:Generative Language Models for Paragraph-Level Question Generation

Authors:Asahi Ushio, Fernando Alva-Manchego, Jose Camacho-Collados

View PDF

Abstract:Powerful generative models have led to recent progress in question generation (QG). However, it is difficult to measure advances in QG research since there are no standardized resources that allow a uniform comparison among approaches. In this paper, we introduce QG-Bench, a multilingual and multidomain benchmark for QG that unifies existing question answering datasets by converting them to a standard QG setting. It includes general-purpose datasets such as SQuAD for English, datasets from ten domains and two styles, as well as datasets in eight different languages. Using QG-Bench as a reference, we perform an extensive analysis of the capabilities of language models for the task. First, we propose robust QG baselines based on fine-tuning generative language models. Then, we complement automatic evaluation based on standard metrics with an extensive manual evaluation, which in turn sheds light on the difficulty of evaluating QG models. Finally, we analyse both the domain adaptability of these models as well as the effectiveness of multilingual models in languages other than English. QG-Bench is released along with the fine-tuned models presented in the paper this https URL, which are also available as a demo this https URL.

Comments:	EMNLP 2022 main conference
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2210.03992 [cs.CL]
	(or arXiv:2210.03992v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.03992

Submission history

From: Asahi Ushio [view email]
[v1] Sat, 8 Oct 2022 10:24:39 UTC (7,838 KB)
[v2] Fri, 21 Oct 2022 08:42:44 UTC (7,845 KB)
[v3] Mon, 2 Jan 2023 08:17:47 UTC (7,852 KB)

Computer Science > Computation and Language

Title:Generative Language Models for Paragraph-Level Question Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Generative Language Models for Paragraph-Level Question Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators