Computer Science > Computation and Language

arXiv:2403.11904 (cs)

[Submitted on 18 Mar 2024 (v1), last revised 30 May 2024 (this version, v3)]

Title:CICLe: Conformal In-Context Learning for Largescale Multi-Class Food Risk Classification

Authors:Korbinian Randl, John Pavlopoulos, Aron Henriksson, Tony Lindgren

Abstract:Contaminated or adulterated food poses a substantial risk to human health. Given sets of labeled web texts for training, Machine Learning and Natural Language Processing can be applied to automatically detect such risks. We publish a dataset of 7,546 short texts describing public food recall announcements. Each text is manually labeled, on two granularity levels (coarse and fine), for food products and hazards that the recall corresponds to. We describe the dataset and benchmark naive, traditional, and Transformer models. Based on our analysis, Logistic Regression based on a tf-idf representation outperforms RoBERTa and XLM-R on classes with low support. Finally, we discuss different prompting strategies and present an LLM-in-the-loop framework, based on Conformal Prediction, which boosts the performance of the base classifier while reducing energy consumption compared to normal prompting.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2403.11904 [cs.CL]
	(or arXiv:2403.11904v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2403.11904

Submission history

From: Korbinian Randl [view email]
[v1] Mon, 18 Mar 2024 16:04:55 UTC (8,048 KB)
[v2] Tue, 2 Apr 2024 10:25:34 UTC (8,048 KB)
[v3] Thu, 30 May 2024 08:37:45 UTC (8,050 KB)

Computer Science > Computation and Language

Title:CICLe: Conformal In-Context Learning for Largescale Multi-Class Food Risk Classification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:CICLe: Conformal In-Context Learning for Largescale Multi-Class Food Risk Classification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators