Nothing Special   »   [go: up one dir, main page]

On the Use of Context for Predicting Citation Worthiness of Sentences in Scholarly Articles

Rakesh Gosangi, Ravneet Arora, Mohsen Gheisarieha, Debanjan Mahata, Haimin Zhang


Abstract
In this paper, we study the importance of context in predicting the citation worthiness of sentences in scholarly articles. We formulate this problem as a sequence labeling task solved using a hierarchical BiLSTM model. We contribute a new benchmark dataset containing over two million sentences and their corresponding labels. We preserve the sentence order in this dataset and perform document-level train/test splits, which importantly allows incorporating contextual information in the modeling process. We evaluate the proposed approach on three benchmark datasets. Our results quantify the benefits of using context and contextual embeddings for citation worthiness. Lastly, through error analysis, we provide insights into cases where context plays an essential role in predicting citation worthiness.
Anthology ID:
2021.naacl-main.359
Volume:
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Month:
June
Year:
2021
Address:
Online
Editors:
Kristina Toutanova, Anna Rumshisky, Luke Zettlemoyer, Dilek Hakkani-Tur, Iz Beltagy, Steven Bethard, Ryan Cotterell, Tanmoy Chakraborty, Yichao Zhou
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
4539–4545
Language:
URL:
https://aclanthology.org/2021.naacl-main.359
DOI:
10.18653/v1/2021.naacl-main.359
Bibkey:
Cite (ACL):
Rakesh Gosangi, Ravneet Arora, Mohsen Gheisarieha, Debanjan Mahata, and Haimin Zhang. 2021. On the Use of Context for Predicting Citation Worthiness of Sentences in Scholarly Articles. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 4539–4545, Online. Association for Computational Linguistics.
Cite (Informal):
On the Use of Context for Predicting Citation Worthiness of Sentences in Scholarly Articles (Gosangi et al., NAACL 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.naacl-main.359.pdf
Optional supplementary data:
 2021.naacl-main.359.OptionalSupplementaryData.pdf
Video:
 https://aclanthology.org/2021.naacl-main.359.mp4
Data
PMOA-CITE