Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1401890.1401905acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Generating succinct titles for web URLs

Published: 24 August 2008 Publication History

Abstract

How can a search engine automatically provide the best and most appropriate title for a result URL (link-title) so that users will be persuaded to click on the URL? We consider the problem of automatically generating link-titles for URLs and propose a general statistical framework for solving this problem. The framework is based on using information from a diverse collection of sources, each of which can be thought of as contributing one or more candidate link-titles for the URL. It can also incorporate the context in which the link-title will be used, along with constraints on its length. Our framework is applicable to several scenarios: obtaining succinct titles for displaying quicklinks, obtaining titles for URLs that lack a good title, constructing succinct sitemaps, etc. Extensive experiments show that our method is very effective, producing results that are at least 20% better than non-trivial baselines.

References

[1]
P. Anick. Using terminological feedback for web search refinement: A log-based study. In 26th SIGIR, pages 88--95, 2003.
[2]
P. Anick and S. Tipirneni. The paraphrase search assistant: Terminological feedback for iterative information seeking. In 22nd SIGIR, pages 153--159, 1999.
[3]
M. Banko, V. O. Mittal, and M. J. Witbrock. Headline generation based on statistical translation. In 38th ACL, pages 318--325, 2000.
[4]
O. Buyukkokten, H. Garcia-Molina, and A. Paepcke. Seeing the whole in parts: Text summarization for web browsing on handheld devices. In 10th WWW, pages 652--662, 2001.
[5]
B. Dorr, D. Zajic, and R. Schwartz. Hedge trimmer: A parse-and-trim approach to headline generation. In HLT-NAACL Text Summarization Workshop, pages 1--8, 2003.
[6]
J. Goldstein, V. Mittal, J. Carbonell, and M. Kantrowitz. Creating and evaluating multi-document sentence extract summaries. In 9th CIKM, pages 165--172, 2000.
[7]
Y. Gong and X. Liu. Generic text summarization using relevance measure and latent semantic analysis. In 24th SIGIR, pages 19--25, 2001.
[8]
R. Jin. Statistical Approaches Toward Title Generation. PhD thesis, Carnegie Mellon University, 2003.
[9]
T. Joachims. Optimizing search engines using clickthrough data. In 8th KDD, pages 133--142, 2002.
[10]
D. R. Radev and K. R. McKeown. Generating natural language summaries from multiple on-line sources. Computational Linguistics, 24(3):470--500, 1998.
[11]
D. Shen, Z. Chen, Q. Yang, H.-J. Zeng, B. Zhang, Y. Lu, and W.-Y. Ma. Web-page classification through summarization. In 27th SIGIR, pages 242--249, 2004.
[12]
J.-T. Sun, D. Shen, H.-J. Zeng, Q. Yang, Y. Lu, and Z. Chen. Web-page summarization using clickthrough data. In 27th SIGIR, pages 194--201, 2005.
[13]
X. Wang, J.-T. Sun, Z. Chen, and C. Zhai. Latent semantic analysis for multiple-type interrelated data objects. In 28th SIGIR, pages 236--243, 2006.
[14]
H. Zha. Generic summarization and keyphrase extraction using mutual reinforcement principle and sentence clustering. In 25th SIGIR, pages 113--120, 2002.

Cited By

View all

Index Terms

  1. Generating succinct titles for web URLs

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '08: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
    August 2008
    1116 pages
    ISBN:9781605581934
    DOI:10.1145/1401890
    • General Chair:
    • Ying Li,
    • Program Chairs:
    • Bing Liu,
    • Sunita Sarawagi
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 August 2008

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. quicklinks
    2. sitemaps
    3. web page title generation

    Qualifiers

    • Research-article

    Conference

    KDD08

    Acceptance Rates

    KDD '08 Paper Acceptance Rate 118 of 593 submissions, 20%;
    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 24 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2014)How can catchy titles be generated without loss of informativeness?Expert Systems with Applications: An International Journal10.1016/j.eswa.2013.07.10241:4(1051-1062)Online publication date: 1-Mar-2014
    • (2014)Moved but not goneInternational Journal on Digital Libraries10.1007/s00799-014-0108-014:1-2(17-38)Online publication date: 1-Apr-2014
    • (2012)A section title authoring tool for clinical guidelinesProceedings of the 2012 ACM symposium on Document engineering10.1145/2361354.2361364(41-44)Online publication date: 4-Sep-2012
    • (2011)Pseudo test collections for learning web search ranking functionsProceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval10.1145/2009916.2010058(1073-1082)Online publication date: 24-Jul-2011
    • (2011)Personalized News Filtering and Summarization on the WebProceedings of the 2011 IEEE 23rd International Conference on Tools with Artificial Intelligence10.1109/ICTAI.2011.68(414-421)Online publication date: 7-Nov-2011
    • (2009)Learning document aboutness from implicit user feedback and document structureProceedings of the 18th ACM conference on Information and knowledge management10.1145/1645953.1646002(365-374)Online publication date: 2-Nov-2009
    • (2009)Quicklink selection for navigational query resultsProceedings of the 18th international conference on World wide web10.1145/1526709.1526762(391-400)Online publication date: 20-Apr-2009

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media