Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/2387636.2387697dlproceedingsArticle/Chapter ViewAbstractPublication PagessemevalConference Proceedingsconference-collections
research-article
Free access

SemEval-2012 task 6: a pilot on semantic textual similarity

Published: 07 June 2012 Publication History

Abstract

Semantic Textual Similarity (STS) measures the degree of semantic equivalence between two texts. This paper presents the results of the STS pilot task in Semeval. The training data contained 2000 sentence pairs from previously existing paraphrase datasets and machine translation evaluation resources. The test data also comprised 2000 sentences pairs for those datasets, plus two surprise datasets with 400 pairs from a different machine translation evaluation corpus and 750 pairs from a lexical resource mapping exercise. The similarity of pairs of sentences was rated on a 0-5 scale (low to high similarity) by human judges using Amazon Mechanical Turk, with high Pearson correlation scores, around 90%. 35 teams participated in the task, submitting 88 runs. The best results scored a Pearson correlation >80%, well above a simple lexical baseline that only scored a 31% correlation. This pilot task opens an exciting way ahead, although there are still open issues, specially the evaluation metric.

References

[1]
Chris Callison-Burch, Cameron Fordyce, Philipp Koehn, Christof Monz, and Josh Schroeder. 2007. (meta-) evaluation of machine translation. In Proceedings of the Second Workshop on Statistical Machine Translation, StatMT '07, pages 136--158.
[2]
Chris Callison-Burch, Cameron Fordyce, Philipp Koehn, Christof Monz, and Josh Schroeder. 2008. Further meta-evaluation of machine translation. In Proceedings of the Third Workshop on Statistical Machine Translation, StatMT '08, pages 70--106.
[3]
David L. Chen and William B. Dolan. 2011. Collecting highly parallel data for paraphrase evaluation. In Proceedings of the 49th Annual Meetings of the Association for Computational Linguistics (ACL).
[4]
B. Dolan, C. Quirk, and C. Brockett. 2004. Unsupervised construction of large paraphrase corpora: Exploiting massively parallel news sources. In COLING 04: Proceedings of the 20th international conference on Computational Linguistics, page 350.
[5]
Christiane Fellbaum. 1998. WordNet: An Electronic Lexical Database. MIT Press.
[6]
Eduard Hovy, Mitchell Marcus, Martha Palmer, Lance Ramshaw, and Ralph Weischedel. 2006. Ontonotes: The 90% solution. In Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL.
[7]
Michael D. Lee, Brandon Pincombe, and Matthew Welsh. 2005. An empirical evaluation of models of text document similarity. In Proceedings of the 27th Annual Conference of the Cognitive Science Society, pages 1254--1259, Mahwah, NJ.
[8]
Y. Li, D. McLean, Z. A. Bandar, J. D. O'Shea, and K. Crockett. 2006. Sentence similarity based on semantic nets and corpus statistics. IEEE Transactions on Knowledge and Data Engineering, 18(8): 1138--1150, August.
[9]
Herbert Rubenstein and John B. Goodenough. 1965. Contextual correlates of synonymy. Commun. ACM, 8(10):627--633, October.
[10]
E. Ukkonen. 1985. Algorithms for approximate string matching. Information and Contro, 64:110--118.

Cited By

View all
  • (2024)Towards Flexible Evaluation for Generative Visual Question AnsweringProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681400(38-47)Online publication date: 28-Oct-2024
  • (2024)C-Pack: Packed Resources For General Chinese EmbeddingsProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657878(641-649)Online publication date: 10-Jul-2024
  • (2023)DebCSE: Rethinking Unsupervised Contrastive Sentence Embedding Learning in the Debiasing PerspectiveProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614833(1847-1856)Online publication date: 21-Oct-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
SemEval '12: Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
June 2012
758 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 07 June 2012

Qualifiers

  • Research-article

Acceptance Rates

Overall Acceptance Rate 8 of 31 submissions, 26%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)89
  • Downloads (Last 6 weeks)28
Reflects downloads up to 18 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Towards Flexible Evaluation for Generative Visual Question AnsweringProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681400(38-47)Online publication date: 28-Oct-2024
  • (2024)C-Pack: Packed Resources For General Chinese EmbeddingsProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657878(641-649)Online publication date: 10-Jul-2024
  • (2023)DebCSE: Rethinking Unsupervised Contrastive Sentence Embedding Learning in the Debiasing PerspectiveProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614833(1847-1856)Online publication date: 21-Oct-2023
  • (2022)Journey to the center of the words: Word weighting scheme based on the geometry of word embeddingsProceedings of the 34th International Conference on Scientific and Statistical Database Management10.1145/3538712.3538720(1-12)Online publication date: 6-Jul-2022
  • (2022)Improving Contrastive Learning of Sentence Embeddings with Case-Augmented Positives and Retrieved NegativesProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3477495.3531823(2159-2165)Online publication date: 6-Jul-2022
  • (2021)A Combination of Enhanced WordNet and BERT for Semantic Textual SimilarityProceedings of the 2021 2nd International Conference on Control, Robotics and Intelligent System10.1145/3483845.3483898(191-198)Online publication date: 20-Aug-2021
  • (2021)Building Arabic Paraphrasing Benchmark based on Transformation RulesACM Transactions on Asian and Low-Resource Language Information Processing10.1145/344677020:4(1-17)Online publication date: 9-Jun-2021
  • (2020)A Systematic Study of Inner-Attention-Based Sentence Representations in Multilingual Neural Machine TranslationComputational Linguistics10.1162/coli_a_0037746:2(387-424)Online publication date: 1-Jun-2020
  • (2020)Siamese Multiplicative LSTM for Semantic Text SimilarityProceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence10.1145/3446132.3446160(1-5)Online publication date: 24-Dec-2020
  • (2019)WikitheoriaProceedings of the 7th ACIS International Conference on Applied Computing and Information Technology10.1145/3325291.3325355(1-5)Online publication date: 29-May-2019
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media