Computer Science > Computation and Language

arXiv:2407.18119 (cs)

[Submitted on 25 Jul 2024]

Title:Tracking linguistic information in transformer-based sentence embeddings through targeted sparsification

Abstract:Analyses of transformer-based models have shown that they encode a variety of linguistic information from their textual input. While these analyses have shed a light on the relation between linguistic information on one side, and internal architecture and parameters on the other, a question remains unanswered: how is this linguistic information reflected in sentence embeddings? Using datasets consisting of sentences with known structure, we test to what degree information about chunks (in particular noun, verb or prepositional phrases), such as grammatical number, or semantic role, can be localized in sentence embeddings. Our results show that such information is not distributed over the entire sentence embedding, but rather it is encoded in specific regions. Understanding how the information from an input text is compressed into sentence embeddings helps understand current transformer models and help build future explainable neural models.

Comments:	12 pages, 9 figures, 1 table, published in RepL4NLP 2024
Subjects:	Computation and Language (cs.CL)
MSC classes:	68T50
ACM classes:	I.2.7
Cite as:	arXiv:2407.18119 [cs.CL]
	(or arXiv:2407.18119v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.18119

Submission history

From: Vivi Nastase [view email]
[v1] Thu, 25 Jul 2024 15:27:08 UTC (2,565 KB)

Computer Science > Computation and Language

Title:Tracking linguistic information in transformer-based sentence embeddings through targeted sparsification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Tracking linguistic information in transformer-based sentence embeddings through targeted sparsification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators