STORYSUMM: Evaluating Faithfulness in Story Summarization.

AllImages Videos Books Maps News Shopping

STORYSUMM: Evaluating Faithfulness in Story Summarization

Jul 9, 2024 · We therefore introduce a new dataset, STORYSUMM, comprising LLM summaries of short stories with localized faithfulness labels and error ...

StorySumm: Evaluating Faithfulness in Story Summarization - arXiv

arxiv.org › html

Jul 9, 2024 · StorySumm consists of 96 short stories and LLM-generated summaries with localized faithfulness errors and explanations. Each unfaithful summary is labeled as ...

STORYSUMM: Evaluating Faithfulness in Story Summarization

github.com › melaniesubbiah › storysumm

The StorySumm dataset is in the file storysumm.json. Description of data fields: label - final label for the summary (0 is unfaithful, 1 is faithful)

(PDF) STORYSUMM: Evaluating Faithfulness in Story Summarization

www.researchgate.net › ... › Faith

Jul 9, 2024 · We therefore introduce a new dataset, STORYSUMM, comprising LLM summaries of short stories with localized faithfulness labels and error ...

STORYSUMM: Evaluating Faithfulness in Story Summarization

www.aimodels.fyi › papers › arxiv › stor...

Jul 9, 2024 · This paper introduces StorySumm, a new dataset for evaluating the faithfulness of story summarization models.

AI Papers on X: "STORYSUMM: Evaluating Faithfulness in Story ...

twitter.com › SciFi › status

Jul 10, 2024 · Human evaluation has been the gold standard for checking faithfulness in abstractive summarization. However, with a challenging source ...

STORYSUMM: Evaluating Faithfulness in Story Summarization

www.zhuanzhi.ai › paper

Human evaluation has been the gold standard for checking faithfulness in abstractive summarization. However, with a challenging source domain like narrative ...

Reading Subtext: Evaluating Large Language Models on Short Story ...

www.semanticscholar.org › paper › Read...

STORYSUMM: Evaluating Faithfulness in Story Summarization · Melanie Subbiah ... A new dataset, STORYSUMM, comprising LLM summaries of short stories with localized ...

Faisal Ladhak - CatalyzeX

www.catalyzex.com › author

We therefore introduce a new dataset, STORYSUMM, comprising LLM summaries of short stories with localized faithfulness labels and error explanations. This ...

A STORYSUMM example illustrating an incorrect interpretation of double...

www.researchgate.net › figure › A-STOR...

By focusing on faithfulness in narrative summarization and using real-world data from LLMs and Reddit, STORYSUMM poses a realistic but hard benchmark to push ...