Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2108616.2108646acmconferencesArticle/Chapter ViewAbstractPublication PagesicuimcConference Proceedingsconference-collections
research-article

Discovering intermediate entities from two examples by using web search engine indices

Published: 14 January 2010 Publication History

Abstract

We propose a system for finding intermediate entities from two examples by using web search engine indices. For example, a user wants to find recipients of the Nobel Peace Prize in the thirty years between Mother Teresa in 1979 and Barack Obama in 2009. In this example, the answer is, for example, Kofi Atta Annan. In this situation, the user wants to find something intermediate between two entities. We first describe the problem of finding entities between two examples. We then propose a system for extracting intermediate entities between two inputs by using a Web search engine indices. The system focuses on the positions of terms in Web pages and then extracts candidate terms that are likely to appear between the two inputs. Then, our system ranks candidate terms based on term frequencies and positions. Finally, we conducted experiments to show the usefulness of our system.

References

[1]
B. Liu, Y. Ma, and P. S. Yu, "Discovering Unexpected Information from Your Competitors' Web Sites", In Proc. of KDD '01, pp. 144--153, 2001.
[2]
B. Liu, K. Zhao, and L. Yi, "Visualizing Web Site Comparisons", In Proc. of WWW '02, pp. 693--703, 2002.
[3]
A. Nadamoto and K. Tanaka, "A Comparative Web Browser (CWB) for Browsing and Comparing Web Pages", In Proc. of WWW '03, pp. 727--735, 2003.
[4]
J. Sun, X. Wang, D. Shen, H. Zeng, and Z. Chen, "CWS: A Comparative Web Search System", In Proc. of WWW '06, pp. 467--476, 2006.
[5]
D. Mahler, "Holistic Query Expansion Using Graphical Models", New Directions in Question Answering 2004, pp.203--214.
[6]
P. Zang, "CTMs: A Comparative Text Mining System", Master thesis, University of Illinois at Urbana-Champaign, Computer Science Department, 2004.
[7]
C. Zhai, A. Velivelli, and B. Yu, "A Cross-collection Mixture Model for Comparative Text Mining", In Proc. of KDD '04, pp. 743--748, 2004.
[8]
S. M. Harabagiu, V. F. Lacatusu, and A. Hickl, "Answering Complex Questions with Random Walk Models", In Proc. of SIGIR 2006, pp. 220--227, 2006.
[9]
G. Luo, C. Tang, and Y. Tian, "Answering Relationship Queries on the Web", In Proc. of WWW '07, pp. 561--571, 2007.
[10]
Z. Ghahramani and K. Heller, "Bayesian sets", In Proc. of the 19th Annual Conference on Neural Information Processing Systems (NIPS2005), 2005.
[11]
M. Yamaguchi, H. Ohshima, S. Oyama, and K. Tanaka, "Unsupervised Discovery of Coordinate Terms for Multiple Aspects from Search Engine Query Logs", In Proc. of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, pp.249--257, 2008.
[12]
D. Lin, "Automatic Retrieval and Clustering of Similar Words", In Proc. of the 36th annual meeting on Association for Computational Linguistics, pp. 768--774, 1998.
[13]
K. Shinzato, and K. Torisawa, "A Simple WWW-based Method for Semantic Word Class Acquisition", In Proc. of the Recent Advances in Natural Language Processing (RANLP05), pp. 493--500, 2005.
[14]
R. C. Wang and W. W. Cohen, "Language-Independent Set Expansion of Named Entities Using the Web", ICDM2007, pp. 342--350, 2007.
[15]
H. Ohshima, S. Oyama, and K. Tanaka, "Searching Coordinate Terms with Their Context from the Web", In Proc. of WISE 2007, pp. 40--47, 2006.
[16]
Google Sets http://labs.google.com/sets
[17]
V Sheinman, N. Rubens, and T. Tokunaga, "Commonly Perceived Order within a Category", In Proc. of OntoLex 2007: from text to knowledge, 2007.
[18]
N. Rubens, V. Sheinman, and T. Tokunaga, "Order Retrieval", LKR2008, Lecture Notes in Computer Science, vol. 4938, pp. 310--317, 2008.
[19]
S. Nakamura, T. Yamamoto, and K. Tanaka, "SyncRerank: Reranking Multi Search Results Based on Vertical and Horizontal Propagation of User Intention", WISE 2008, LNCS, vol. 5175, pp. 120--135, 2008.
[20]
T. Kurashima, K. Bessho, H. Toda, T. Uchiyama, and R. Kataoka, "Ranking entities using comparative relations", DEXA 2008, LNCS, vol. 5181, pp. 124--133, 2008.

Index Terms

  1. Discovering intermediate entities from two examples by using web search engine indices

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ICUIMC '10: Proceedings of the 4th International Conference on Uniquitous Information Management and Communication
    January 2010
    550 pages
    ISBN:9781605588933
    DOI:10.1145/2108616
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 14 January 2010

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. intermediate entity
    2. interpolation search
    3. knowledge extraction
    4. web search

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    ICUIMC '10
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 251 of 941 submissions, 27%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 53
      Total Downloads
    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 14 Dec 2024

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media