Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/775152.775154acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
Article

Query-free news search

Published: 20 May 2003 Publication History

Abstract

Many daily activities present information in the form of a stream of text, and often people can benefit from additional information on the topic discussed. TV broadcast news can be treated as one such stream of text; in this paper we discuss finding news articles on the web that are relevant to news currently being broadcast.We evaluated a variety of algorithms for this problem, looking at the impact of inverse document frequency, stemming, compounds, history, and query length on the relevance and coverage of news articles returned in real time during a broadcast. We also evaluated several postprocessing techniques for improving the precision, including reranking using additional terms, reranking by document similarity, and filtering on document similarity. For the best algorithm, 84%-91% of the articles found were relevant, with at least 64% of the articles being on the exact topic of the broadcast. In addition, a relevant article was found for at least 70% of the topics.

References

[1]
J. Allan, R. Gupta, and V. Khandelwal. Temporal summaries of news topics. In Research and Development in Information Retrieval, pages 10--18, 2001.
[2]
Electronic Industries Alliance. EIA-746-A: Transport of internet uniform resource locator (url) information using text-2 (t-2) service. Technical report, 1998.
[3]
E. Brill. Transformation-based error-driven learning and natural language processing: A case study in part-of-speech tagging. Computation Linguistics, 21(4):543--565, 1995.
[4]
S. Brin, R. Motwani, L. Page, and T. Winograd. What can you do with a web in your pocket? Data Engineering Bulletin, 21(2):37--47, 1998.
[5]
J. Budzik, K. Hammond, and L. Birnbaum. Information access in context. Knowledge based systems, 14(1-2):37--53, 2001.
[6]
J. Davis. Intercast dying of neglect. CNET News, January 29, 1997.
[7]
E. Frank, G.W. Paynter, I.H. Witten, C. Gutwin, and C.G. Nevill-Manning. Domain-specific keyphrase extraction. In IJCAI, pages 668--673, 1999.
[8]
P. Hart and J. Graham. Query-free information retrieval. IEEE Expert, 12(5):32--37, 1997.
[9]
B. Krulwich and C. Burkey. Learning user information interests through the extraction of semantically significant phrases. In AAAI 1996 Spring Symposium on Machine Learning in Information Access, 1996.
[10]
H. Lieberman. Letizia: An agent that assists web browsing. In C. S. Mellish, editor, Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI-95), pages 924-929, 1995.
[11]
P. Maglio, R. Barrett, C. Campbell, T. Selker. Suitor: An attentive information system. In International Conference on Intelligent User Interfaces, 2000.
[12]
A. Munoz. Compound key word generation from document databases using a hierarchical clustering art model. Intelligent Data Analysis, 1(1), 1997.
[13]
M.N. Price, G. Golovchinsky, and B.N. Schilit. Linking by inking: Trailblazing in a paper-like hypertext. In Hypertext '98, pages 30--39, 1998.
[14]
B. Rhodes and P. Maes. Just-in-time information retrieval agents. IBM Systems Journal, 39(3--4), 2000.
[15]
B.J. Rhodes. Just-In-Time Information Retrieval. PhD thesis, MIT Media Laboratory, Cambridge, MA, May 2000.
[16]
S. Robertson, S. Walker, and M. Beaulieu. Okapi at TREC-7: automatic ad hoc, filtering, VLC and interactive track. In Proceedings of the 7th International Text Retrieval Conference (TREC), pages 253--264, 1999.
[17]
G.D. Robson. Closed captions, V-chip, and other VBI data. Nuts and Volts, 2000.
[18]
G. Salton. The SMART System - Experiments in Automatic Document Processing. Prentice Hall, 1971.
[19]
A.M. Steier and R.K. Belew. Exporting phrases: A statistical analysis of topical language. In Second Symposium on Document Analysis and Information Retrieval, pages 179--190, 1993.
[20]
P.D. Turney. Learning algorithms for keyphrase extraction. Information Retrieval, 2(4):303--336, 2000.

Cited By

View all
  • (2024)Entity Footprinting: Modeling Contextual User States via Digital Activity MonitoringACM Transactions on Interactive Intelligent Systems10.1145/3643893Online publication date: 5-Feb-2024
  • (2023)DotHash: Estimating Set Similarity Metrics for Link Prediction and Document DeduplicationProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599314(1758-1769)Online publication date: 6-Aug-2023
  • (2021)Entity Recommendation for Everyday Digital TasksACM Transactions on Computer-Human Interaction10.1145/345891928:5(1-41)Online publication date: 20-Aug-2021
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
WWW '03: Proceedings of the 12th international conference on World Wide Web
May 2003
772 pages
ISBN:1581136803
DOI:10.1145/775152
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 May 2003

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. query-free search
  2. web information retrieval

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)12
  • Downloads (Last 6 weeks)4
Reflects downloads up to 24 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Entity Footprinting: Modeling Contextual User States via Digital Activity MonitoringACM Transactions on Interactive Intelligent Systems10.1145/3643893Online publication date: 5-Feb-2024
  • (2023)DotHash: Estimating Set Similarity Metrics for Link Prediction and Document DeduplicationProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599314(1758-1769)Online publication date: 6-Aug-2023
  • (2021)Entity Recommendation for Everyday Digital TasksACM Transactions on Computer-Human Interaction10.1145/345891928:5(1-41)Online publication date: 20-Aug-2021
  • (2016)Building the search pattern of web users using conceptual semantic space modelInternational Journal of Web and Grid Services10.1504/IJWGS.2016.07915812:3(328-347)Online publication date: 1-Jan-2016
  • (2016)Finding News Citations for WikipediaProceedings of the 25th ACM International on Conference on Information and Knowledge Management10.1145/2983323.2983808(337-346)Online publication date: 24-Oct-2016
  • (2015)Generating Personalized Web Search Using Semantic ContextThe Scientific World Journal10.1155/2015/4627822015(1-10)Online publication date: 2015
  • (2015)Back to the PastProceedings of the Eighth ACM International Conference on Web Search and Data Mining10.1145/2684822.2685315(339-348)Online publication date: 2-Feb-2015
  • (2015)IntoNews: Online news retrieval using closed captionsInformation Processing & Management10.1016/j.ipm.2014.07.01051:1(148-162)Online publication date: Jan-2015
  • (2015)Generating Semantic Snapshots of Newscasts Using Entity ExpansionProceedings of the 15th International Conference on Engineering the Web in the Big Data Era - Volume 911410.1007/978-3-319-19890-3_26(410-419)Online publication date: 23-Jun-2015
  • (2014)Efficient automatic search query formulation using phrase-level analysisJournal of the Association for Information Science and Technology10.1002/asi.2302265:5(1058-1075)Online publication date: 1-May-2014
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media