Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Determining the informational, navigational, and transactional intent of Web queries

Published: 01 May 2008 Publication History

Abstract

In this paper, we define and present a comprehensive classification of user intent for Web searching. The classification consists of three hierarchical levels of informational, navigational, and transactional intent. After deriving attributes of each, we then developed a software application that automatically classified queries using a Web search engine log of over a million and a half queries submitted by several hundred thousand users. Our findings show that more than 80% of Web queries are informational in nature, with about 10% each being navigational and transactional. In order to validate the accuracy of our algorithm, we manually coded 400 queries and compared the results from this manual classification to the results determined by the automated method. This comparison showed that the automatic classification has an accuracy of 74%. Of the remaining 25% of the queries, the user intent is vague or multi-faceted, pointing to the need for probabilistic classification. We discuss how search engines can use knowledge of user intent to provide more targeted and relevant results in Web searching.

References

[1]
The long tail: Why the future of business is selling more of less. Hyperion, New York.
[2]
Baeza-Yates, R., Calde¿on-Benavides, L., & Gon¿alez, C. (2006). In The intention behind Web queries (pp. 98-109). Paper presented at the string processing and information retrieval (SPIRE 2006), 11-13 October, Glasgow, Scotland.
[3]
Automatic classification of Web queries using very large unlabeled query logs. ACM Transactions on Information Systems. v25 i2.
[4]
Anomalous states of knowledge as a basis for information retrieval. Canadian Journal of Information Science. v5. 133-143.
[5]
Belkin, N. J. (1993). Interaction with texts: Information retrieval as information-seeking behavior. In Information retrieval '93, Von der Modellierung zur Anwendung (pp. 55-66). Konstanz, Germany: Universitaetsverlag Konstanz.
[6]
Belkin, N., Cool, C., Croft, W. B., & Callan, J. (1993). In The effect of multiple query representations on information retrieval systems (pp. 339-346). Paper presented at the 16th annual international ACM SIGIR conference on research and development in information retrieval.
[7]
Belkin, N., Cool, C., Kelly, D., Lee, H.-J., Muresan, G., Tang, M.-C., et al. (2003). In Query length in interactive information retrieval (pp. 205-212). Paper presented at the 26th annual international ACM conference on research and development in information retrieval, 28 July-1 August, Toronto, Canada.
[8]
Relevance for browsing, relevance for searching. Journal of the American Society of Information Science and Technology. v57 i1. 69-86.
[9]
A taxonomy of Web search. SIGIR Forum. v36 i2. 3-10.
[10]
Byrne, M., John, B., Wehrle, N., & Crow, D. (1999). In The tangled Web we wove: A taskonomy of WWW use (pp. 544-551). Paper presented at the human factors in computing systems: CHI 99, May 15-20, Pittsburgh, PA.
[11]
Carmel, E., Crawford, S., & Chen, H. (1992). In Browsing in hypertext: A cognitive study (pp. 865-884). Paper presented at the IEEE transactions on systems, man and cybernetics, 5-10 October, Chicago IL.
[12]
Chi, E. H., Pirolli, P., Chen, K., & Pitkow, J. (2001). In Using information scent to model user information needs and actions on the Web (pp. 490-497). Paper presented at the ACM CHI 2001 conference on human factors in computing systems, 31 March-5 April, Seattle, WA.
[13]
Choo, C., &amp; Turnbull, D. (2000). Information seeking on the web: An integrated model of browsing and searching. First Monday, 5(2). Available from <http://firstmonday.org/issues/issue5_2/choo/index.html>.
[14]
Choo, C., Betlor, B., &amp; Turnbull, D. (1998). In A behavioral model of information seeking on the Web: Preliminary results of a study of how managers and IT specialists use the Web (pp. 290-302). Paper presented at the 61st annual meeting of the American society for information science, Pittsburgh, PA, ASIS.
[15]
I3: A new approach to the design of document retrieval systems. Journal of the American Society for Information Science. v38 i6. 389-404.
[16]
Cronen-Townsend, S., Zhou, Y., &amp; Croft, W. B. (2002). In Predicting query performance (pp. 299-306). Paper presented at the 25th annual international ACM SIGIR conference on research and development in information retrieval, 11-15 August, Tampere, Finland.
[17]
Dai, H. K., Nie, Z., Wang, L., Zhao, L., Wen, J. -R., &amp; Li, Y. (2006). In Detecting online commercial intention (OCI) (pp. 829-837). Paper presented at the World Wide Web conference (WWW2006), 23-26 May, Edinburgh, Scotland.
[18]
Interactive query expansion: A user-based evaluation in a relevance feedback environment. Journal of the American Society of Information Science and Technology. v51 i11. 989-1003.
[19]
Visual attention to online search engine results. Market Research Agency De Vos &amp; Jansen.
[20]
Cognitive perspectives of information retrieval interaction: Elements of a cognitive IR theory. Journal of Documentation. v52 i1. 3-50.
[21]
Seeking and implementing automated assistance during the search process. Information Processing &amp; Management. v41 i4. 909-928.
[22]
Using temporal patterns of interactions to design effective automated searching assistance systems. Communications of the ACM. v49 i4. 72-74.
[23]
Evaluating the effectiveness of and patterns of interactions with automated searching assistance. Journal of the American Society for Information Science and Technology. v56 i14. 1480-1503.
[24]
An analysis of Web searching by European Alltheweb.com users. Information Processing &amp; Management. v41 i2. 361-381.
[25]
How are we searching the World Wide Web? A comparison of nine search engine transaction logs. Information Processing &amp; Management. v42 i1. 248-263.
[26]
Web searcher interactions with the Dogpile.com meta-search engine. Journal of the American Society for Information Science and Technology. v58 i4. 1875-1887.
[27]
Trend analysis of AltaVista Web searching. Journal of the American Society for Information Science and Technology. v56 i6. 559-570.
[28]
Real life, real users, and real needs: A study and analysis of user queries on the Web. Information Processing &amp; Management. v36 i2. 207-227.
[29]
Kang, I., &amp; Kim, G. (2003). In Query type classification for Web document retrieval (pp. 64-71). Paper presented at the 26th annual international ACM SIGIR conference on research and development in information retrieval, 28 July-1 August, Toronto, Canada.
[30]
A field study characterizing Web-based information-seeking tasks. Journal of the American Society for Information Science and Technology. v58 i7. 999-1018.
[31]
Kelly, D., &amp; Belkin, N. J. (2001). In Reading time, scrolling and interaction: Exploring implicit sources of user preferences for relevance feedback (pp. 408-409). Paper presented at the 24th annual international ACM SIGIR conference on research and development in information retrieval, New Orleans, Louisiana, United States.
[32]
Kelly, D., &amp; Belkin, N. J. (2004). In Display time as implicit feedback: Understanding task effects (pp. 377-384). Paper presented at the 27th annual international conference on research and development in information retrieval, 25-29 July, Sheffield, United Kingdom.
[33]
Implicit feedback for inferring user preference: A bibliography. SIGIR Forum. v37 i2. 18-28.
[34]
Lee, U., Liu, Z., &amp; Cho, J. (2005). In Automatic identification of user goals in Web search (pp. 391-401). Paper presented at the World Wide Web conference, 10-14 May, Chiba, Japan.
[35]
Information seeking in electronic environments. Cambridge University Press, Cambridge.
[36]
Morrison, J. B., Pirolli, P., &amp; Card, S. K. (2001). In A taxonomic analysis of what world wide Web activities significantly impact people's decisions and actions (pp. 163-164). Paper presented at the conference on human factors in computing systems (CHI '01), 31 March-05 April, Seattle, Washington.
[37]
Navarro-Prieto, R., Scaife, M., &amp; Rogers, Y. (1999, July). Cognitive strategies in Web searching. Paper presented at the the 5th Conference on human factors and the web, Gaithersburg, Maryland.
[38]
Nettleton, D. F., Calderon, L., &amp; Baeza-Yates, R. (2006). Analysis of Web search engine query and click data from two perspectives: Query session and document. In Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining (KDD 2006), Philadelphia, Pennsylvania.
[39]
Oard, D., &amp; Kim, J. (2001). In Modeling information content using observable behavior (pp. 38-45). Paper presented at the 64th annual meeting of the American society for information science and technology, 31 October-4 November, Washington, DC, USA.
[40]
O'Day, V., &amp; Jeffries, R. (1993). In Orienteering in an information landscape: How information seekers get from here to there (pp. 438-445). Paper presented at the ACM InterCHI '93, Amsterdam, The Netherlands.
[41]
End user searching: A Web log analysis of NAVER, a Korean Web search engine. Library &amp; Information Science Research. v27 i2. 203-221.
[42]
Information foraging theory: Adaptive interaction with information. Oxford University Press, Oxford.
[43]
Rose, D. E., &amp; Levinson, D. (2004). In Understanding user goals in Web search (pp. 13-19). Paper presented at the World Wide Web conference (WWW 2004), 17-22 May, New York, NY, USA.
[44]
Rozanski, H. D., Bollman, G., &amp; Lipman, M. (2001). Seize the occasion! The seven-segment system for online marketing. Retrieved 3 August 2006, Available from http://faculty.msb.edu/homak/HomaHelpSite/WebHelp/Online_Segmentation_S+B_Q4_2001.htm.
[45]
Saracevic, T. (1996). In Modeling interaction in information retrieval (IR): A review and proposal: Vol. 33 (pp. 3-9). Paper presented at the 59th American society for information science annual meeting, 19-24 October, Baltimore, MD.
[46]
Saracevic, T. (1997). In Extension and application of the stratified model of information retrieval interaction: Vol. 34 (pp. 313-327). Paper presented at the annual meeting of the American society for information science, 1-6 November, Washington, DC.
[47]
Sellen, A. J., Murphy, R., &amp; Shaw, K. L. (2002). In How knowledge workers use the Web (pp. 227-234). Paper presented at the conference on human factors in computing systems (CHI '02), 20-25 April, Minneapolis, Minnesota, USA.
[48]
Analysis of a very large Web search engine query log. SIGIR Forum. v33 i1. 6-12.
[49]
Web search: Public searching of the Web. Kluwer, New York.
[50]
Basics of qualitative research: Grounded theory procedures and techniques. Sage Publications, Newbury Park, CA.
[51]
Sullivan, D. (2006). Nielsen/NetRatings search engine ratings. Retrieved 1 June 2006, Available from http://www.searchenginewatch.com/reports/netratings.html (February 23).
[52]
Question negotiation and information seeking in libraries. College &amp; Research Libraries. v28. 178-194.
[53]
Teevan, J., Alvarado, C., Ackerman, M. S., &amp; Karger, D. R. (2004). In The perfect search engine is not enough: A study of orienteering behavior in directed search (pp. 415-422). Paper presented at the CHI 2004, 24-29 April, Vienna, Austria.

Cited By

View all
  • (2024)Seeking Socially Responsible Consumers: Exploring the Intention-Search-Behaviour GapProceedings of the 2024 Conference on Human Information Interaction and Retrieval10.1145/3627508.3638324(153-164)Online publication date: 10-Mar-2024
  • (2024)General-Purpose User Modeling with Behavioral Logs: A Snapchat Case StudyProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657908(2431-2436)Online publication date: 10-Jul-2024
  • (2024)``It Is a Moving Process": Understanding the Evolution of Explainability Needs of Clinicians in Pulmonary MedicineProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642551(1-21)Online publication date: 11-May-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Information Processing and Management: an International Journal
Information Processing and Management: an International Journal  Volume 44, Issue 3
May, 2008
407 pages

Publisher

Pergamon Press, Inc.

United States

Publication History

Published: 01 May 2008

Author Tags

  1. Search engines
  2. User intent
  3. Web queries
  4. Web searching

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 19 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Seeking Socially Responsible Consumers: Exploring the Intention-Search-Behaviour GapProceedings of the 2024 Conference on Human Information Interaction and Retrieval10.1145/3627508.3638324(153-164)Online publication date: 10-Mar-2024
  • (2024)General-Purpose User Modeling with Behavioral Logs: A Snapchat Case StudyProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657908(2431-2436)Online publication date: 10-Jul-2024
  • (2024)``It Is a Moving Process": Understanding the Evolution of Explainability Needs of Clinicians in Pulmonary MedicineProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642551(1-21)Online publication date: 11-May-2024
  • (2024)Evaluation of semantic relations impact in query expansion-based retrieval systemsKnowledge-Based Systems10.1016/j.knosys.2023.111183283:COnline publication date: 11-Jan-2024
  • (2024)Comparison of information search behavior for different exploratory tasksInformation Processing and Management: an International Journal10.1016/j.ipm.2024.10379461:5Online publication date: 1-Sep-2024
  • (2023)What Web Search Behaviors Lead to Online Purchase Satisfaction?Proceedings of the 15th ACM Web Science Conference 202310.1145/3578503.3583615(324-334)Online publication date: 30-Apr-2023
  • (2023)Keyword decisions in sponsored search advertisingInformation Processing and Management: an International Journal10.1016/j.ipm.2022.10314260:1Online publication date: 20-Jan-2023
  • (2022)Integrating Keyword Segmentation and Budget Allocation Decisions in Sponsored Search AdvertisingInternational Journal of E-Business Research10.4018/IJEBR.29668718:1(1-23)Online publication date: 15-Aug-2022
  • (2022)Generating Clarifying Questions with Web Search ResultsProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3477495.3531981(234-244)Online publication date: 6-Jul-2022
  • (2022)ORCAS-IProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3477495.3531737(3057-3066)Online publication date: 6-Jul-2022
  • Show More Cited By

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media