Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3331184.3331340acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

Critically Examining the "Neural Hype": Weak Baselines and the Additivity of Effectiveness Gains from Neural Ranking Models

Published: 18 July 2019 Publication History

Abstract

Is neural IR mostly hype? In a recent SIGIR Forum article, Lin expressed skepticism that neural ranking models were actually improving ad hoc retrieval effectiveness in limited data scenarios. He provided anecdotal evidence that authors of neural IR papers demonstrate "wins" by comparing against weak baselines. This paper provides a rigorous evaluation of those claims in two ways: First, we conducted a meta-analysis of papers that have reported experimental results on the TREC Robust04 test collection. We do not find evidence of an upward trend in effectiveness over time. In fact, the best reported results are from a decade ago and no recent neural approach comes close. Second, we applied five recent neural models to rerank the strong baselines that Lin used to make his arguments. A significant improvement was observed for one of the models, demonstrating additivity in gains. While there appears to be merit to neural IR approaches, at least some of the gains reported in the literature appear illusory.

References

[1]
Abdul-Jaleel et al. 2004. UMass at TREC 2004: Novelty and HARD. TREC.
[2]
Akcay et al. 2017. On the Additivity and Weak Baselines for Search Result Diversification Research. ICTIR.
[3]
Armstrong et al. 2009. Improvements That Don't Add Up: Ad-hoc Retrieval Results Since 1998. CIKM.
[4]
Cormack et al. 2009. Reciprocal Rank Fusion Outperforms Condorcet and Individual Rank Learning Methods. SIGIR.
[5]
Fan et al. 2017. MatchZoo: A Toolkit for Deep Text Matching. arXiv:1707.07270.
[6]
Ganguly et al. 2015. Word Embedding Based Generalized Language Model for Information Retrieval. SIGIR.
[7]
Guo et al. 2016. A Deep Relevance Matching Model for Ad-hoc Retrieval. CIKM.
[8]
Hu et al. 2014. Convolutional Neural Network Architectures for Matching Natural Language Sentences. NIPS.
[9]
Huang et al. 2013. Learning Deep Structured Semantic Models for Web Search using Clickthrough Data. CIKM.
[10]
Kharazmi et al. 2016. Examining Additivity and Weak Baselines. TOSI, Vol. 34, 4 (2016), Article 23.
[11]
Lin. 2018. The Neural Hype and Comparisons Against Weak Baselines. SIGIR Forum, Vol. 52, 2 (2018), 40--51.
[12]
Lipton and Steinhardt. 2018. Troubling Trends in Machine Learning Scholarship. arXiv:1807.03341.
[13]
MacAvaney et al. 2019. CEDR: Contextualized Embeddings for Document Ranking. arXiv:1904.07094.
[14]
Mitra and Craswell. 2017. Neural Models for Information Retrieval. arXiv:1705.01509.
[15]
Mitra et al. 2017. Learning to Match using Local and Distributed Representations of Text for Web Search. WWW.
[16]
Rao et al. 2019. Multi-Perspective Relevance Matching with Hierarchical ConvNets for Social Media Search. AAAI.
[17]
Sculley et al. 2018. Winner's Curse? On Pace, Progress, and Empirical Rigor. ICLR Workshops.
[18]
Shen et al. 2014. Learning Semantic Representations using Convolutional Neural Networks for Web Search. WWW.
[19]
Van Gysel et al. 2018. Neural Vector Spaces for Unsupervised Information Retrieval. TOIS, Vol. 36, 4 (2018), Article 38.
[20]
Wan et al. 2016. A Deep Architecture for Semantic Matching with Multiple Positional Sentence Representations. AAAI.
[21]
Xiong et al. 2017. End-to-End Neural Ad-hoc Ranking with Kernel Pooling. SIGIR.
[22]
Yang et al. 2016. aNMM: Ranking Short Answer Texts with Attention-Based Neural Matching Model. CIKM.
[23]
Yang et al. 2018. Anserini: Reproducible Ranking Baselines Using Lucene. JDIQ, Vol. 10, 4 (2018), Article 16.
[24]
Yang et al. 2019. End-to-End Open-Domain Question Answering with BERTserini. NAACL.
[25]
Yang et al. 2019. Simple Applications of BERT for Ad Hoc Document Retrieval. arXiv:1903.10972.
[26]
Zamani and Croft. 2016. Embedding-based query language models. ICTIR.

Cited By

View all
  • (2024)Towards Effective and Efficient Sparse Neural Information RetrievalACM Transactions on Information Systems10.1145/363491242:5(1-46)Online publication date: 29-Apr-2024
  • (2024)"Ask Me Anything": How Comcast Uses LLMs to Assist Agents in Real TimeProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3661345(2827-2831)Online publication date: 10-Jul-2024
  • (2024)A Novel Centrality-Driven Clustering Approach for Information Retrieval and Question Answering2024 8th International Artificial Intelligence and Data Processing Symposium (IDAP)10.1109/IDAP64064.2024.10711092(1-11)Online publication date: 21-Sep-2024
  • Show More Cited By

Index Terms

  1. Critically Examining the "Neural Hype": Weak Baselines and the Additivity of Effectiveness Gains from Neural Ranking Models

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval
        July 2019
        1512 pages
        ISBN:9781450361729
        DOI:10.1145/3331184
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 18 July 2019

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. document ranking
        2. meta-analysis
        3. neural IR

        Qualifiers

        • Short-paper

        Funding Sources

        Conference

        SIGIR '19
        Sponsor:

        Acceptance Rates

        SIGIR'19 Paper Acceptance Rate 84 of 426 submissions, 20%;
        Overall Acceptance Rate 792 of 3,983 submissions, 20%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)43
        • Downloads (Last 6 weeks)8
        Reflects downloads up to 05 Mar 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)Towards Effective and Efficient Sparse Neural Information RetrievalACM Transactions on Information Systems10.1145/363491242:5(1-46)Online publication date: 29-Apr-2024
        • (2024)"Ask Me Anything": How Comcast Uses LLMs to Assist Agents in Real TimeProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3661345(2827-2831)Online publication date: 10-Jul-2024
        • (2024)A Novel Centrality-Driven Clustering Approach for Information Retrieval and Question Answering2024 8th International Artificial Intelligence and Data Processing Symposium (IDAP)10.1109/IDAP64064.2024.10711092(1-11)Online publication date: 21-Sep-2024
        • (2024)Improving Performance of Neural IR Models by Using a Keyword-Extraction-Based Weak-Supervision MethodIEEE Access10.1109/ACCESS.2024.338219012(46851-46863)Online publication date: 2024
        • (2024)Conditional variational autoencoder for query expansion in ad-hoc information retrievalInformation Sciences: an International Journal10.1016/j.ins.2023.119764652:COnline publication date: 1-Jan-2024
        • (2024)Pipeline and dataset generation for automated fact-checking in almost any languageNeural Computing and Applications10.1007/s00521-024-10113-536:30(19023-19054)Online publication date: 1-Oct-2024
        • (2024)Designing for the Future of Information Access with Generative Information RetrievalInformation Access in the Era of Generative AI10.1007/978-3-031-73147-1_9(223-248)Online publication date: 12-Sep-2024
        • (2023)DeBEIR: A Python Package for Dense Bi-Encoder Information RetrievalJournal of Open Source Software10.21105/joss.050178:87(5017)Online publication date: Jul-2023
        • (2023)Report on the Dagstuhl Seminar on Frontiers of Information Access Experimentation for Research and EducationACM SIGIR Forum10.1145/3636341.363635157:1(1-28)Online publication date: 1-Jun-2023
        • (2023)A Next Basket Recommendation Reality CheckACM Transactions on Information Systems10.1145/358715341:4(1-29)Online publication date: 21-Apr-2023
        • Show More Cited By

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media