Computer Science > Computation and Language

arXiv:2302.06690 (cs)

[Submitted on 13 Feb 2023]

Title:Bag of Tricks for In-Distribution Calibration of Pretrained Transformers

Authors:Jaeyoung Kim, Dongbin Na, Sungchul Choi, Sungbin Lim

View PDF

Abstract:While pre-trained language models (PLMs) have become a de-facto standard promoting the accuracy of text classification tasks, recent studies find that PLMs often predict over-confidently. Although various calibration methods have been proposed, such as ensemble learning and data augmentation, most of the methods have been verified in computer vision benchmarks rather than in PLM-based text classification tasks. In this paper, we present an empirical study on confidence calibration for PLMs, addressing three categories, including confidence penalty losses, data augmentations, and ensemble methods. We find that the ensemble model overfitted to the training set shows sub-par calibration performance and also observe that PLMs trained with confidence penalty loss have a trade-off between calibration and accuracy. Building on these observations, we propose the Calibrated PLM (CALL), a combination of calibration techniques. The CALL complements the drawbacks that may occur when utilizing a calibration method individually and boosts both classification and calibration accuracy. Design choices in CALL's training procedures are extensively studied, and we provide a detailed analysis of how calibration techniques affect the calibration performance of PLMs.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2302.06690 [cs.CL]
	(or arXiv:2302.06690v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2302.06690

Submission history

From: Jaeyoung Kim [view email]
[v1] Mon, 13 Feb 2023 21:11:52 UTC (7,155 KB)

Computer Science > Computation and Language

Title:Bag of Tricks for In-Distribution Calibration of Pretrained Transformers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Bag of Tricks for In-Distribution Calibration of Pretrained Transformers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators