Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3437120.3437313acmotherconferencesArticle/Chapter ViewAbstractPublication PagespciConference Proceedingsconference-collections
short-paper

Results Merging in the Patent Domain

Published: 04 March 2021 Publication History

Abstract

In this paper, we test machine learning methods for results merging in patent document retrieval. Specifically, we examine random forest, decision tree, support vector machine (SVR), linear regression, polynomial regression and, deep neural networks (DNNs). The models are tested in cooperative and uncooperative environments where text statistics and document scores from remote patent collections may be available or not respectively. We use two different methods for results merging, the multiple models (MMs) method, and the global models (GMs) method. Furthermore, we examine whether the ranking of the document's scores is linearly explainable. The CLEF-IP 2011 standard test collection was used in our experiments. The random forest produces the best results in comparison to all other models and it fits the data better than linear and polynomial approaches.

Supplementary Material

p229-stamatis-supplement (p229-stamatis-supplement.pptx)
Presentation slides

References

[1]
Michail Salampasis and Allan Hanbury. 2014. PerFedPat: An integrated federated system for patent search. World Patent Information, 38, pp. 4- 11, 2014.
[2]
Jamie Callan. 2002. Distributed Information Retrieval. Croft W.B. (eds) Advances in Information Retrieval. The Information Retrieval Series, vol 7. Springer, Boston, MA, pp. 127-150, 2002.
[3]
James P. Callan, Zhihong Lu, and Bruce W. Croft. 1995. Searching distributed collections with inference networks. In: Proceedings of the 18th annual internationalACM SIGIR conference on research and development in information retrieval – SIGIR’95, pp. 21-28, 1995.
[4]
Nick Craswell, David Hawking, and Paul Thistlewaite. 1999. Merging Results from Isolated Search Engines. In: Australasian Database Conference, pp. 189-200, 1999.
[5]
Luo Si and Jamie Callan. 2003. A semisupervised learning method to merge search engine results. ACM Transactions on Information Systems, 21(4), pp. 457-491, 2003.
[6]
Walid Shalaby and Wlodek Zadrozny. 2018. Patent retrieval: a literature review. Knowledge and Information Systems, 2018.
[7]
Michael Taylor, Filip Radlinski, and Milad Shokouhi. 2016. MERGING SEARCH RESULTS. Patent No. US 9,495.460 B2, Filed May 27th., 2009, Issued Nov 15th., 2016.
[8]
Jianchang Mao, Rajat Mukherjee, Prabhakar Raghavan and Panayiotis Tsaparas. 2004. METHOD AND APPARATUS FOR MERGING. Patent No. US 6,728,704 B2, Filed Aug 27th., 2001, Issued April 27th., 2004.
[9]
Milad Shokouhi and Justin Zobel. 2009. Robust result merging using sample-based score estimates. ACM Trans. Inform. Syst. 27, 3, Article 14, 2009.
[10]
Chia-Jung Lee, Qingyao Ai, Bruce W. Croft and Daniel Sheldon. 2015. An Optimization Framework for Merging Multiple Result Lists. In: CIKM’15, Melbourne, VIC, Australia, 2015.
[11]
Daniel Sheldon, Milad Shokouhi, Martin Szummer and Nick Craswell. 2011. Lambdamerge: Merging the results of query reformulations. In: WSDM, 2011.
[12]
Michail Salampasis, Giorgos Paltoglou, and Anastasia Giahanou. 2012. Report on the CLEF-IP 2012 Experiments: Search of Topically Organized Patents. In: CLEF (Online Working Notes/Labs/Workshop), 2012.
[13]
Anastasia Giachanou, Michail Salampasis, and Georgios Paltoglou. 2015 Multilayer source selection as a tool for supporting patent search and classification. Information Retrieval Journal vol. 18, pp. 559-585, 2015.
[14]
Peilin Yang, Hui Fang, and Jimmy Lin. 2018. Anserini: Reproducible Ranking Baselines Using Lucene. J. Data and Information Quality 10, 4, Article 16, 2018.
[15]
Thi Truong Avrahami, Lawrence Yau, Luo Si, and Jamie Callan. 2005. The FedLemur Project: Federated Search in the Real World. Journal of the American society for information science and technology 57(3), pp. 347-358, 2005.
[16]
Jamie Callan and Margaret Connell. 2001. Query-based sampling of text databases. ACM Trans Inf Syst 19(2), pp. 97-130, 2001.
[17]
Walid Magdy and Gareth J.F. Jones. 2010. PRES: A Score Metric for Evaluating Recall-Oriented. In: SIGIR’10, July 19–23, 2010, Geneva,Switzerland.

Cited By

View all
  • (2022)End to End Neural Retrieval for Patent Prior Art SearchAdvances in Information Retrieval10.1007/978-3-030-99739-7_66(537-544)Online publication date: 5-Apr-2022

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
PCI '20: Proceedings of the 24th Pan-Hellenic Conference on Informatics
November 2020
433 pages
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 March 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Federated Search
  2. Patent Retrieval
  3. Results Merging

Qualifiers

  • Short-paper
  • Research
  • Refereed limited

Funding Sources

Conference

PCI 2020
PCI 2020: 24th Pan-Hellenic Conference on Informatics
November 20 - 22, 2020
Athens, Greece

Acceptance Rates

Overall Acceptance Rate 190 of 390 submissions, 49%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)12
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2022)End to End Neural Retrieval for Patent Prior Art SearchAdvances in Information Retrieval10.1007/978-3-030-99739-7_66(537-544)Online publication date: 5-Apr-2022

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media