Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1183614.1183658acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Summarizing local context to personalize global web search

Published: 06 November 2006 Publication History

Abstract

The PC Desktop is a very rich repository of personal information, efficiently capturing user's interests. In this paper we propose a new approach towards an automatic personalization of web search in which the user specific information is extracted from such local desktops, thus allowing for an increased quality of user profiling, while sharing less private information with the search engine. More specifically, we investigate the opportunities to select personalized query expansion terms for web search using three different desktop oriented approaches: summarizing the entire desktop data, summarizing only the desktop documents relevant to each user query, and applying natural language processing techniques to extract dispersive lexical compounds from relevant desktop resources. Our experiments with the Google API showed at least the latter two techniques to produce a very strong improvement over current web search.

References

[1]
P. G. Anick and S. Tipirneni. The paraphrase search assistant: Terminological feedback for iterative information seeking. In Proc. of the 22nd Annual Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 1999.
[2]
R. Baeza-Yates and B. Ribeiro-Neto. Modern Information Retrieval. ACM Press / Addison-Wesley, 1999.
[3]
J. Budzik and K. Hammond. Watson: Anticipating and contextualizing information needs. In Proceedings of the Sixty-second Annual Meeting of the American Society for Information Science, 1999.
[4]
J. Carbonell and J. Goldstein. The use of mmr, diversity-based reranking for reordering documents and producing summaries. In Proc. of the 21st Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 1998.
[5]
P. A. Chirita, C. S. Firan, and W. Nejdl. Pushing task relevant web links down to the desktop. In Proc. of the 8th ACM Intl. Workshop on Web Information and Data Management held at the 15th Intl. ACM CIKM Conference on Information and Knowledge Management, 2006.
[6]
P.-A. Chirita, W. Nejdl, R. Paiu, and C. Kohlschütter. Using odp metadata to personalize search. In Proc. of the 28th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2005.
[7]
P.-A. Chirita, D. Olmedilla, and W. Nejdl. Pros: A personalized ranking platform for web search. In Proc. of the 3rd Intl. Conf. on Adaptive Hypermedia and Adaptive Web-Based Systems, 2004.
[8]
D. R. Cutting, D. R. Karger, and J. O. Pedersen. Constant interaction-time scatter/gather browsing of very large document collections. In SIGIR, 1993.
[9]
D. R. Cutting, J. O. Pedersen, D. R. Karger, and J. W. Tukey. Scatter/gather: A cluster-based approach to browsing large document collections. In SIGIR, 1992.
[10]
S. Dumais, E. Cutrell, R. Sarin, and E. Horvitz. Implicit queries (iq) for contextualized search. In Proc. of the 27th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2004.
[11]
G. Erkan and D. R. Radev. Lexrank: Graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. (JAIR), 22:457--479, 2004.
[12]
S. Gauch, J. Chaffee, and A. Pretschner. Ontology-based personalized search and browsing. Web Intelli. and Agent Sys., 1(3-4):219--234, 2003.
[13]
J. Goldstein, M. Kantrowitz, V. Mittal, and J. Carbonell. Summarizing text documents: Sentence selection and evaluation metrics. In Proc. of the 22nd Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 1999.
[14]
T. Haveliwala. Topic-sensitive pagerank. In In Proceedings of the Eleventh International World Wide Web Conference, Honolulu, Hawaii, May 2002.
[15]
G. Jeh and J. Widom. Scaling personalized web search. In Proc. of the 12th Intl. World Wide Web Conference, 2003.
[16]
T. Joachims, L. Granka, B. Pan, H. Hembrooke, and G. Gay. Accurately interpreting clickthrough data as implicit feedback. In Proc. of the 28th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2005.
[17]
K. S. Jones, S. Walker, and S. Robertson. Probabilistic model of information retrieval: Development and status. Technical report, Cambridge University, 1998.
[18]
S. Katz. Distribution of content words and phrases in text and language modelling. Natural Language Engineering, 2(1):15--59, 1996.
[19]
A. M. Lam-Adesina and G. J. F. Jones. Applying summarization techniques for term selection in relevance feedback. In Proc. of the 24th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2001.
[20]
D. Lawrie and W. Croft. Generating hierarchical summaries for web searches. In Proc. of the 26th Intl. ACM SIGIR Conf. on Research and Development in Information Retr., 2003.
[21]
D. Lawrie, W. B. Croft, and A. L. Rosenberg. Finding topic words for hierarchical summarization. In Proc. of the 24th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2001.
[22]
F. Liu, C. Yu, and W. Meng. Personalized web search for improving retrieval effectiveness. IEEE Trans. on Knowledge and Data Eng., 16(1):28--40, 2004.
[23]
H. Luhn. Automatic creation of literature abstracts. IBM Journ. of Research and Development, 2(2):159--165, 1958.
[24]
G. Miller. Wordnet: An electronic lexical database. Communications of the ACM, 38(11):39--41, 1995.
[25]
M. Mitra, A. Singhal, and C. Buckley. Improving automatic query expansion. In Proc. of the 21st Intl. ACM SIGIR Conf. on Research and Development in Information Retr., 1998.
[26]
T. Nomoto and Y. Matsumoto. A new approach to unsupervised text summarization. In Proc. of the 24th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2001.
[27]
L. Page, S. Brin, R. Motwani, and T. Winograd. The PageRank citation ranking: Bringing order to the web. Technical report, Stanford Digital Library Technologies Project, 1998.
[28]
D. R. Radev, H. Jing, M. Stys, and D. Tam. Centroid-based summarization of multiple documents. Inf. Process. and Management, 40(6):919--938, 2004.
[29]
B. J. Rhodes and P. Maes. Just-in-time information retrieval agents. IBM Syst. J., 39(3-4):685--704, 2000.
[30]
S. E. Robertson and S. Walker. Okapi/keenbow at trec-8. In TREC, 1999.
[31]
J. Rocchio. Relevance feedback in information retrieval. The Smart Retrieval System: Experiments in Automatic Document Processing, pages 313--323, 1971.
[32]
D. Rose, R. Mander, T. Oren, D. Ponceleon, G. Salomon, and Y. Wong. Content awareness in a file system interface: Implementing the 'pile' metaphor for organizing information. In Proc. of the 16th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 1993.
[33]
M. Sanderson and W. B. Croft. Deriving concept hierarchies from text. In Proc. of the 22nd Intl. ACM SIGIR Conf. on Research and Development in Information Retr., 1999.
[34]
K. Sugiyama, K. Hatano, and M. Yoshikawa. Adaptive web search based on user profile constructed without any effort from users. In Proc. of the 13th Intl. WWW Conf., 2004.
[35]
D. Sullivan. The older you are, the more you want personalized search, 2004. http://searchenginewatch.com/searchday/article.php/3385131.
[36]
J. Teevan, S. Dumais, and E. Horvitz. Personalizing search via automated analysis of interests and activities. In Proc. of the 28th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2005.
[37]
A. Tombros and M. Sanderson. Advantages of query biased summaries in information retrieval. In Proc. of the 21st Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 1998.
[38]
E. Volokh. Personalization and privacy. Commun. ACM, 43(8), 2000.
[39]
P. Willett. Recent trends in hierarchic document clustering: a critical review. Inf. Process. and Management, 24(5), 1988.
[40]
O. Zamir and O. Etzioni. Grouper: a dynamic clustering interface to web search results. Comput. Networks, 31(11-16), 1999.
[41]
H.-J. Zeng, Q.-C. He, Z. Chen, W.-Y. Ma, and J. Ma. Learning to cluster web search results. In Proc. of the 27th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2004.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '06: Proceedings of the 15th ACM international conference on Information and knowledge management
November 2006
916 pages
ISBN:1595934332
DOI:10.1145/1183614
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 November 2006

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. desktop summarization
  2. personalized web search
  3. relevance feedback
  4. user profile

Qualifiers

  • Article

Conference

CIKM06
CIKM06: Conference on Information and Knowledge Management
November 6 - 11, 2006
Virginia, Arlington, USA

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)9
  • Downloads (Last 6 weeks)2
Reflects downloads up to 27 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Idea-Centric Search: Four Patterns of Information Seeking During Creative IdeationProceedings of the 16th Conference on Creativity & Cognition10.1145/3635636.3656193(280-291)Online publication date: 23-Jun-2024
  • (2020)SoTaRePoExpert Systems with Applications: An International Journal10.1016/j.eswa.2019.112955141:COnline publication date: 1-Mar-2020
  • (2020)Personalization in text information retrievalJournal of the Association for Information Science and Technology10.1002/asi.2423471:3(349-369)Online publication date: 28-Jan-2020
  • (2018)Personalized Web SearchEncyclopedia of Database Systems10.1007/978-1-4614-8265-9_267(2748-2753)Online publication date: 7-Dec-2018
  • (2017)Time Sensitivity for Personalized Search2017 IEEE/ACS 14th International Conference on Computer Systems and Applications (AICCSA)10.1109/AICCSA.2017.77(585-592)Online publication date: Oct-2017
  • (2017)Personalized Web SearchEncyclopedia of Database Systems10.1007/978-1-4899-7993-3_267-2(1-6)Online publication date: 28-Aug-2017
  • (2015)A Preferences Based Approach for Better Comprehension of User Information NeedsTransactions on Computational Collective Intelligence XVIII10.1007/978-3-662-48145-5_4(67-85)Online publication date: 31-Jul-2015
  • (2014)Using Personalization to Improve XML RetrievalIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2013.7526:5(1280-1292)Online publication date: 1-May-2014
  • (2014)Web Search Personalization Using Ontological User ProfilesProceedings of the Second International Conference on Soft Computing for Problem Solving (SocProS 2012), December 28-30, 201210.1007/978-81-322-1602-5_90(849-855)Online publication date: 26-Feb-2014
  • (2014)A Preferences Based Approach for Better Comprehension of User Information NeedsComputational Collective Intelligence. Technologies and Applications10.1007/978-3-319-11289-3_10(94-103)Online publication date: 2014
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media