Computer Science > Computation and Language

arXiv:2311.09564 (cs)

[Submitted on 16 Nov 2023]

Title:LongBoX: Evaluating Transformers on Long-Sequence Clinical Tasks

Authors:Mihir Parmar, Aakanksha Naik, Himanshu Gupta, Disha Agrawal, Chitta Baral

View PDF

Abstract:Many large language models (LLMs) for medicine have largely been evaluated on short texts, and their ability to handle longer sequences such as a complete electronic health record (EHR) has not been systematically explored. Assessing these models on long sequences is crucial since prior work in the general domain has demonstrated performance degradation of LLMs on longer texts. Motivated by this, we introduce LongBoX, a collection of seven medical datasets in text-to-text format, designed to investigate model performance on long sequences. Preliminary experiments reveal that both medical LLMs (e.g., BioGPT) and strong general domain LLMs (e.g., FLAN-T5) struggle on this benchmark. We further evaluate two techniques designed for long-sequence handling: (i) local-global attention, and (ii) Fusion-in-Decoder (FiD). Our results demonstrate mixed results with long-sequence handling - while scores on some datasets increase, there is substantial room for improvement. We hope that LongBoX facilitates the development of more effective long-sequence techniques for the medical domain. Data and source code are available at this https URL.

Comments:	8 pages
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2311.09564 [cs.CL]
	(or arXiv:2311.09564v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2311.09564

Submission history

From: Mihir Parmar [view email]
[v1] Thu, 16 Nov 2023 04:57:49 UTC (6,735 KB)

Computer Science > Computation and Language

Title:LongBoX: Evaluating Transformers on Long-Sequence Clinical Tasks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:LongBoX: Evaluating Transformers on Long-Sequence Clinical Tasks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators