Computer Science > Computation and Language

arXiv:2405.14039 (cs)

[Submitted on 22 May 2024 (v1), last revised 30 Oct 2024 (this version, v2)]

Title:Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning

Authors:Yiming Wang, Pei Zhang, Baosong Yang, Derek F. Wong, Zhuosheng Zhang, Rui Wang

Abstract:Real-world data deviating from the independent and identically distributed (i.i.d.) assumption of in-distribution training data poses security threats to deep networks, thus advancing out-of-distribution (OOD) detection algorithms. Detection methods in generative language models (GLMs) mainly focus on uncertainty estimation and embedding distance measurement, with the latter proven to be most effective in traditional linguistic tasks like summarization and translation. However, another complex generative scenario mathematical reasoning poses significant challenges to embedding-based methods due to its high-density feature of output spaces, but this feature causes larger discrepancies in the embedding shift trajectory between different samples in latent spaces. Hence, we propose a trajectory-based method TV score, which uses trajectory volatility for OOD detection in mathematical reasoning. Experiments show that our method outperforms all traditional algorithms on GLMs under mathematical reasoning scenarios and can be extended to more applications with high-density features in output spaces, such as multiple-choice questions.

Comments:	Accepted by NeurIPS 2024
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2405.14039 [cs.CL]
	(or arXiv:2405.14039v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2405.14039

Submission history

From: Yiming Wang [view email]
[v1] Wed, 22 May 2024 22:22:25 UTC (1,958 KB)
[v2] Wed, 30 Oct 2024 12:10:42 UTC (2,487 KB)

Computer Science > Computation and Language

Title:Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators