ML Mob at SemEval-2023 Task 1: Probing CLIP on Visual Word-Sense Disambiguation

Clifton Poth, Martin Hentschel, Tobias Werner, Hannah Sterz, Leonard Bongard

Abstract

Successful word sense disambiguation (WSD)is a fundamental element of natural languageunderstanding. As part of SemEval-2023 Task1, we investigate WSD in a multimodal setting,where ambiguous words are to be matched withcandidate images representing word senses. Wecompare multiple systems based on pre-trainedCLIP models. In our experiments, we findCLIP to have solid zero-shot performance onmonolingual and multilingual data. By em-ploying different fine-tuning techniques, we areable to further enhance performance. However,transferring knowledge between data distribu-tions proves to be more challenging.

Anthology ID:: 2023.semeval-1.201
Volume:: Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023)
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Atul Kr. Ojha, A. Seza Doğruöz, Giovanni Da San Martino, Harish Tayyar Madabushi, Ritesh Kumar, Elisa Sartori
Venue:: SemEval
SIG:: SIGLEX
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1463–1469
Language:
URL:: https://aclanthology.org/2023.semeval-1.201
DOI:: 10.18653/v1/2023.semeval-1.201
Bibkey:
Cite (ACL):: Clifton Poth, Martin Hentschel, Tobias Werner, Hannah Sterz, and Leonard Bongard. 2023. ML Mob at SemEval-2023 Task 1: Probing CLIP on Visual Word-Sense Disambiguation. In Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 1463–1469, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: ML Mob at SemEval-2023 Task 1: Probing CLIP on Visual Word-Sense Disambiguation (Poth et al., SemEval 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.semeval-1.201.pdf

PDF Cite Search