Computer Science > Computation and Language

arXiv:2409.09249 (cs)

[Submitted on 14 Sep 2024 (v1), last revised 18 Sep 2024 (this version, v2)]

Title:NovAScore: A New Automated Metric for Evaluating Document Level Novelty

Authors:Lin Ai, Ziwei Gong, Harshsaiprasad Deshpande, Alexander Johnson, Emmy Phung, Ahmad Emami, Julia Hirschberg

Abstract:The rapid expansion of online content has intensified the issue of information redundancy, underscoring the need for solutions that can identify genuinely new information. Despite this challenge, the research community has seen a decline in focus on novelty detection, particularly with the rise of large language models (LLMs). Additionally, previous approaches have relied heavily on human annotation, which is time-consuming, costly, and particularly challenging when annotators must compare a target document against a vast number of historical documents. In this work, we introduce NovAScore (Novelty Evaluation in Atomicity Score), an automated metric for evaluating document-level novelty. NovAScore aggregates the novelty and salience scores of atomic information, providing high interpretability and a detailed analysis of a document's novelty. With its dynamic weight adjustment scheme, NovAScore offers enhanced flexibility and an additional dimension to assess both the novelty level and the importance of information within a document. Our experiments show that NovAScore strongly correlates with human judgments of novelty, achieving a 0.626 Point-Biserial correlation on the TAP-DLND 1.0 dataset and a 0.920 Pearson correlation on an internal human-annotated dataset.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2409.09249 [cs.CL]
	(or arXiv:2409.09249v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2409.09249

Submission history

From: Lin Ai [view email]
[v1] Sat, 14 Sep 2024 01:21:56 UTC (10,116 KB)
[v2] Wed, 18 Sep 2024 17:44:08 UTC (10,116 KB)

Computer Science > Computation and Language

Title:NovAScore: A New Automated Metric for Evaluating Document Level Novelty

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:NovAScore: A New Automated Metric for Evaluating Document Level Novelty

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators