Computer Science > Computation and Language

arXiv:2210.16865 (cs)

[Submitted on 30 Oct 2022]

Title:Learning to Decompose: Hypothetical Question Decomposition Based on Comparable Texts

Authors:Ben Zhou, Kyle Richardson, Xiaodong Yu, Dan Roth

View PDF

Abstract:Explicit decomposition modeling, which involves breaking down complex tasks into more straightforward and often more interpretable sub-tasks, has long been a central theme in developing robust and interpretable NLU systems. However, despite the many datasets and resources built as part of this effort, the majority have small-scale annotations and limited scope, which is insufficient to solve general decomposition tasks. In this paper, we look at large-scale intermediate pre-training of decomposition-based transformers using distant supervision from comparable texts, particularly large-scale parallel news. We show that with such intermediate pre-training, developing robust decomposition-based models for a diverse range of tasks becomes more feasible. For example, on semantic parsing, our model, DecompT5, improves 20% to 30% on two datasets, Overnight and TORQUE, over the baseline language model. We further use DecompT5 to build a novel decomposition-based QA system named DecompEntail, improving over state-of-the-art models, including GPT-3, on both HotpotQA and StrategyQA by 8% and 4%, respectively.

Comments:	Accepted at EMNLP 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2210.16865 [cs.CL]
	(or arXiv:2210.16865v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.16865

Submission history

From: Ben Zhou [view email]
[v1] Sun, 30 Oct 2022 15:38:03 UTC (168 KB)

Computer Science > Computation and Language

Title:Learning to Decompose: Hypothetical Question Decomposition Based on Comparable Texts

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Learning to Decompose: Hypothetical Question Decomposition Based on Comparable Texts

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators