Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2063576.2063639acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Personalizing web search results by reading level

Published: 24 October 2011 Publication History

Abstract

Traditionally, search engines have ignored the reading difficulty of documents and the reading proficiency of users in computing a document ranking. This is one reason why Web search engines do a poor job of serving an important segment of the population: children. While there are many important problems in interface design, content filtering, and results presentation related to addressing children's search needs, perhaps the most fundamental challenge is simply that of providing relevant results at the right level of reading difficulty. At the opposite end of the proficiency spectrum, it may also be valuable for technical users to find more advanced material or to filter out material at lower levels of difficulty, such as tutorials and introductory texts. We show how reading level can provide a valuable new relevance signal for both general and personalized Web search. We describe models and algorithms to address the three key problems in improving relevance for search using reading difficulty: estimating user proficiency, estimating result difficulty, and re-ranking based on the difference between user and result reading level profiles. We evaluate our methods on a large volume of Web query traffic and provide a large-scale log analysis that highlights the importance of finding results at an appropriate reading level for the user.

References

[1]
P. N. Bennett, K. Svore, and S. T. Dumais. Classification-enhanced ranking. In Proc. of WWW 2010, 111--120.
[2]
D. Bilal. Children's use of the yahooligans! web search engine: Cognitive, physical, and affective behaviors on fact-based search tasks. J. Am. Soc. Inf. Sci., 51(7):646--665, 2000.
[3]
J. Chall, E. Dale. Readability Revisited: The New Dale Chall Readability Formula. Brookline Books, 1995.
[4]
C. L. Clarke, E. Agichtein, S. Dumais, and R. W. White. The influence of caption features on clickthrough patterns in web search. In Proc. of SIGIR 2007, 135--142.
[5]
K. Collins-Thompson and J. P. Callan. A language modeling approach to predicting reading difficulty. In HLT-NAACL, 193--200.
[6]
A. Druin, E. Foss, H. Hutchinson, E. Golub, and L. Hatley. Children's roles using keyword search interfaces at home. In Proc. of CHI 2010, 413--422.
[7]
C. Eickhoff, P. Serdyukov, and A. de Vries. A combined topical/non-topical approach to identifying web sites for children. In Proc. of WSDM 2011, 505--514.
[8]
B. Efron, R. J. Tibshirani. An Introduction to the Bootstrap. Chapman & Hall/CRC, New York, 1994.
[9]
J. Gao, W. Yuan, X. Li, K. Deng, and J.-Y. Nie. Smoothing clickthrough data for web search ranking. In Proc. of SIGIR 2009.
[10]
K. Gyllstrom and M.-F. Moens. Wisdom of the ages: toward delivering the children's web with the link-based agerank algorithm. In Proc. of CIKM 2008, 159--168.
[11]
S. Hirsh. Children's relevance criteria and information seeking on electronic resources. JASIST, 50(14):1265--1283.
[12]
P. Kidwell, G. Lebanon, and K. Collins-Thompson. Statistical estimation of word acquisition with application to readability prediction. In Proc. of EMNLP 2009.
[13]
G. Kumaran, R. Jones, and O. Madani. Biasing web search results for topic familiarity. In Proc. of CIKM 2005, 271--272.
[14]
X. Liu, W. B. Croft, P. Oh, and D. Hart. Automatic recognition of reading levels from user queries. In Proc. of SIGIR 2004, 548--549.
[15]
PuppyIR. PuppyIR: An open source environment to construct information services for children. 2011. http://www.puppyir.eu/.
[16]
J. Teevan, S. T. Dumais, and E. Horvitz. Personalizing search via automated analysis of interests and activities. In Proc. of SIGIR 2005, 449--456.
[17]
S. D. Torres, D. Hiemstra, and P. Serdyukov. An analysis of queries intended to search information for children. In IIiX 2010, 235--244.
[18]
M. van Kalsbeek, J. de Wit, D. Trieschnigg, P. van der Vet, T. Huibers, and D. Hiemstra. Automatic reformulation of children's search queries. Technical Report TR-CTIT-10--23, June 2010.
[19]
K. Wang, T. Walker, and Z. Zheng. pSkip: Estimating relevance ranking quality from web search clickthrough data. In Proc. of SIGKDD 2009, 1355--1364.
[20]
R. W. White, P. N. Bennett, and S. T. Dumais. Predicting short-term interests using activity-based search context. In Proc. of CIKM 2010, 1009--1018.
[21]
R. W. White and S. T. Dumais. Characterizing and predicting search engine switching behavior. In Proc. of CIKM 2009, 87--96.
[22]
R. W. White, S. T. Dumais, and J. Teevan. Characterizing the influence of domain expertise on web search behavior. In Proc. of WSDM 2009, 132--141.
[23]
Q. Wu, C. J. C. Burges, K. M. Svore, and J. Gao. Adapting boosting for information retrieval measures. Information Retrieval, 3(13):254--270, 2010.
[24]
C. X. Zhai, W. W. Cohen, and J. Lafferty. Beyond independent relevance: methods and evaluation metrics for subtopic retrieval. In Proc. of SIGIR 2003, 10--17.

Cited By

View all
  • (2024)Search Result Presentation for Non-Native Language DocumentsCompanion Proceedings of the 29th International Conference on Intelligent User Interfaces10.1145/3640544.3645224(89-94)Online publication date: 18-Mar-2024
  • (2024)How Readability Cues Affect Children's Navigation of Search Engine Result PagesProceedings of the 23rd Annual ACM Interaction Design and Children Conference10.1145/3628516.3655818(62-69)Online publication date: 17-Jun-2024
  • (2024)Towards Self-Contained Answers: Entity-Based Answer Rewriting in Conversational SearchProceedings of the 2024 Conference on Human Information Interaction and Retrieval10.1145/3627508.3638300(209-218)Online publication date: 10-Mar-2024
  • Show More Cited By

Index Terms

  1. Personalizing web search results by reading level

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management
    October 2011
    2712 pages
    ISBN:9781450307178
    DOI:10.1145/2063576
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 October 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. personalization
    2. re-ranking
    3. reading difficulty

    Qualifiers

    • Research-article

    Conference

    CIKM '11
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)30
    • Downloads (Last 6 weeks)6
    Reflects downloads up to 08 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Search Result Presentation for Non-Native Language DocumentsCompanion Proceedings of the 29th International Conference on Intelligent User Interfaces10.1145/3640544.3645224(89-94)Online publication date: 18-Mar-2024
    • (2024)How Readability Cues Affect Children's Navigation of Search Engine Result PagesProceedings of the 23rd Annual ACM Interaction Design and Children Conference10.1145/3628516.3655818(62-69)Online publication date: 17-Jun-2024
    • (2024)Towards Self-Contained Answers: Entity-Based Answer Rewriting in Conversational SearchProceedings of the 2024 Conference on Human Information Interaction and Retrieval10.1145/3627508.3638300(209-218)Online publication date: 10-Mar-2024
    • (2024)Encoding Group Interests With Persistent Homology for Personalized SearchIEEE Transactions on Systems, Man, and Cybernetics: Systems10.1109/TSMC.2024.341002954:9(5606-5616)Online publication date: Sep-2024
    • (2024)Good for Children, Good for All?Advances in Information Retrieval10.1007/978-3-031-56066-8_24(302-313)Online publication date: 15-Mar-2024
    • (2024)A systematic review of multidimensional relevance estimation in information retrievalWIREs Data Mining and Knowledge Discovery10.1002/widm.154114:5Online publication date: 7-May-2024
    • (2023)Incorporating Explicit Subtopics in Personalized SearchProceedings of the ACM Web Conference 202310.1145/3543507.3583488(3364-3374)Online publication date: 30-Apr-2023
    • (2023)Multi-Perspective Learning to Rank to Support Children's Information Seeking in the Classroom2023 IEEE International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)10.1109/WI-IAT59888.2023.00050(311-317)Online publication date: 26-Oct-2023
    • (2023)Back to the Fundamentals: Extend the Rational AssumptionsA Behavioral Economics Approach to Interactive Information Retrieval10.1007/978-3-031-23229-9_5(131-152)Online publication date: 18-Feb-2023
    • (2023)Developing Executive Functions of Cognitive Abilities in the Information Culture of Preschool ChildrenAdvances in Natural, Human-Made, and Coupled Human-Natural Systems Research10.1007/978-3-030-75483-9_69(717-726)Online publication date: 17-Mar-2023
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media