Computer Science > Computation and Language

arXiv:2309.04087 (cs)

[Submitted on 8 Sep 2023]

Title:Unsupervised Multi-document Summarization with Holistic Inference

Authors:Haopeng Zhang, Sangwoo Cho, Kaiqiang Song, Xiaoyang Wang, Hongwei Wang, Jiawei Zhang, Dong Yu

View PDF

Abstract:Multi-document summarization aims to obtain core information from a collection of documents written on the same topic. This paper proposes a new holistic framework for unsupervised multi-document extractive summarization. Our method incorporates the holistic beam search inference method associated with the holistic measurements, named Subset Representative Index (SRI). SRI balances the importance and diversity of a subset of sentences from the source documents and can be calculated in unsupervised and adaptive manners. To demonstrate the effectiveness of our method, we conduct extensive experiments on both small and large-scale multi-document summarization datasets under both unsupervised and adaptive settings. The proposed method outperforms strong baselines by a significant margin, as indicated by the resulting ROUGE scores and diversity measures. Our findings also suggest that diversity is essential for improving multi-document summary performance.

Comments:	Findings of IJCNLP-AACL 2023
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2309.04087 [cs.CL]
	(or arXiv:2309.04087v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2309.04087

Submission history

From: Haopeng Zhang [view email]
[v1] Fri, 8 Sep 2023 02:56:30 UTC (6,872 KB)

Computer Science > Computation and Language

Title:Unsupervised Multi-document Summarization with Holistic Inference

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Unsupervised Multi-document Summarization with Holistic Inference

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators