Nothing Special   »   [go: up one dir, main page]

skip to main content
article

How are we searching the World Wide Web? A comparison of nine search engine transaction logs

Published: 01 January 2006 Publication History

Abstract

The Web and especially major Web search engines are essential tools in the quest to locate online information for many people. This paper reports results from research that examines characteristics and changes in Web searching from nine studies of five Web search engines based in the US and Europe. We compare interactions occurring between users and Web search engines from the perspectives of session length, query length, query complexity, and content viewed among the Web search engines. The results of our research shows (1) users are viewing fewer result pages, (2) searchers on US-based Web search engines use more query operators than searchers on European-based search engines, (3) there are statistically significant differences in the use of Boolean operators and result pages viewed, and (4) one cannot necessary apply results from studies of one particular Web search engine to another Web search engine. The wide spread use of Web search engines, employment of simple queries, and decreased viewing of result pages may have resulted from algorithmic enhancements by Web search engine companies. We discuss the implications of the findings for the development of Web search engines and design of online content.

References

[1]
Internet and personality. Computers in Human Behavior. v18. 1-10.
[2]
Baeza-Yates, R., & Castillo, C. (October 2000). Relating web characteristics {in Spanish} {Website}. University of Chile. Retrieved 15.07.02 from the World Wide Web: http://www.todocl.cl/stats/rbaeza.pdf.
[3]
Baeza-Yates, R., & Castillo, C. (2001). Relating web structure and user search behavior. In Proceedings of the 10th World Wide Web conference (pp. 1-2). Hong Kong, China. 1-5 May.
[4]
The use of web search engines in information science research. In: Cronin, B. (Ed.), Annual review of information science and technology, Vol. 33. Information Today, Medford, NY, USA. pp. 231-288.
[5]
Where should the person stop and the information search interface start?. Information Processing & Management. v26 i5. 575-591.
[6]
Beitzel, S. M., Jensen, E. C., Chowdhury, A., Grossman, D., & Frieder, O. (2004). Hourly analysis of a very large topically categorized Web query log. In Proceedings of the 27th annual international conference on Research and development in information retrieval (pp. 321-328). Sheffield, UK, 25-29 July.
[7]
Cases, scripts, and information-seeking strategies: on the design of interactive information retrieval systems. Expert Systems with Applications. v9 i3. 379-395.
[8]
Cacheda, F., & Viña, Á. (2001a). Experiences retrieving information in the World Wide Web. In Proceedings of the 6th IEEE Symposium on Computers and Communications (pp. 72-79). Hammamet, Tunisia. July.
[9]
Cacheda, F., & Viña, Á. (2001b). Understanding how people use search engines: a statistical analysis for e-business. In Proceedings of the e-Business and e-Work Conference and Exhibition 2001 (pp. 319-325). Venice, Italy, October.
[10]
Callan, J., & Smeaton, A. (2003). Personalisation and recommender systems in digital libraries. Joint NSF_EU_DELOS working group report. Joint NSF-EU DELOS Working Group Report. Retrieved 1.1.02 from the World Wide Web: http://www-2.cs.cmu.edu/~callan/papers/personalisation03-wg.pdf.
[11]
Using clustering techniques to detect usage patterns in a web-based information system. Journal of the American Society for Information Science and Technology. v52 i11. 888-904.
[12]
Stochastic modeling of usage patterns in a web-based information system. Journal of the American Society for Information Science and Technology. v53 i7. 536-548.
[13]
Cole, J. I., Suman, M., Schramm, P., Lunn, R., & Aquino, J. S. (February 2003) The ucla internet report surveying the digital future year three {Website}. UCLA Center for Communication Policy. Retrieved 1.2.2003 from the World Wide Web: http://www.ccp.ucla.edu/pdf/ucla-internet-report-year-three.pdf.
[14]
Cyber Atlas. (1999). US Top 50 internet properties, December 1999, at home/work combined {Website}. CyberAtlas. Retrieved 1.7.2000 from the World Wide Web: http://cyberatlas.internet.com.
[15]
Cyber Atlas. (2001). US Top 50 internet properties, may 2001, at home/work combined {Website}. CyberAtlas. Retrieved 1.7.2000 from the World Wide Web: http://cyberatlas.internet.com.
[16]
Web searching: a process-oriented experimental study of three interactive search paradigms. Journal of the American Society for Information Science and Technology. v53 i2. 120-133.
[17]
Dumais, S. T. (2002). Web experiments and test collections {Presentation}. Retrieved 20.4.03 from the World Wide Web: http://www2002.org/presentations/dumais.pdf.
[18]
Feedback and interaction in information retrieval. In: Oppenheim, C. (Ed.), Perspectives in information management, Butterworths, London. pp. 257-272.
[19]
Eiron, N., & McCurley, K. (2003). Analysis of anchor text for web search. In Proceedings of the 26th annual international ACM SIGIR conference on research and development in information retrieval (pp. 459-460). Toronto, Canada. 28 July-1 August.
[20]
Fox, S. (2002, July 2002). Search engines {website} The Pew Internet &American Life Project. Retrieved 15.10.2002 from the World Wide Web: http://www.pewinternet.org/reports/toc.asp.
[21]
Hansen, M. H., & Shriver, E. (2001). Using navigation data to improve ir functions in the context of web search. In Proceedings of the tenth international conference on information and knowledge management (pp. 135-142). Atlanta, Georgia, USA. October.
[22]
Combining evidence for automatic web session identification. Information Processing & Management. v38 i5. 727-742.
[23]
Web search behavior of internet experts and newbies. International Journal of Computer and Telecommunications Networking. v33 i1-6. 337-346.
[24]
Research on web search behavior. Library & Information Science Research. v23 i1. 168-185.
[25]
Relevant term suggestion in interactive web search based on contextual information in query session logs. Journal of the American Society for Information Science and Technology. v54 i7. 638-649.
[26]
Web user studies: a review and framework for future work. Journal of the American Society of Information Science and Technology. v52 i3. 235-246.
[27]
Jansen, B. J., & Spink, A. (2003). An analysis of web information seeking and use: documents retrieved versus documents viewed. In Proceedings of the 4th international conference on Internet computing (pp. 65-69). Las Vegas, Nevada. 23-26 June.
[28]
Jansen, B. J., & Spink, A. (Forthcoming). An analysis of web searching by European alltheweb.Com users. Information Processing &Management.
[29]
Real life, real users, and real needs: a study and analysis of user queries on the web. Information Processing & Management. v36 i2. 207-227.
[30]
Exploring behavior of e-journal users in science and technology: transaction log analysis of Elsevier's Sciencedirect onsite in Taiwan. Library & Information Science Research. v24 i1. 265-291.
[31]
Lalmas, M., & Ruthven, I. (1999). A framework for investigating the interaction in information retrieval. In Proceedings of 9th European-Japanese conferences on information modeling and knowledge bases (pp. 222-239). Iwate, Japan. 24-28 May.
[32]
Accessibility of information on the web. Nature. v400. 107-109.
[33]
Lempel, R., & Moran, S. (2003). Predictive caching and prefetching of query results in search engines. In Proceedings of the twelfth international conference on World Wide Web (pp. 19-28). Budapest, Hungary.
[34]
An investigation of user attitudes toward search engines as an information retrieval tool. Computers in Human Behavior. v19. 751-765.
[35]
Lin, S.-J. (2002). Design space of personalized indexing: enhancing successive web searching for transmuting information problems. In Proceedings of the eighth Americas conference on information systems (pp. 1092-1100). Dallas, Texas. 9-11 August.
[36]
Online study behavior of 100,000 students preparing for the SAT, ACT, and GRE. Journal of Educational Computing Research. v30 i3. 255-262.
[37]
Identifying web browsing trends and patterns. IEEE Computer. v34 i7. 94-95.
[38]
Munarriz, R. A. (1997). How did it double? Daily double. Retrieved 10.11.2002 from the World Wide Web: http://www.fool.com/ddouble/1997/ddouble970812.htm.
[39]
National Telecommunications and Information Administration. (2002). A nation online: How Americans are expanding their use of the internet. Washington, DC: US Department of Commerce.
[40]
Nielsen Media. (1997). Search engines most popular method of surfing the web {Website}. Commerce Net/Nielsen Media. Retrieved 30.8.2000 from the World Wide Web: http://www.commerce.net/news/press/0416.html.
[41]
Ozmutlu, H. C., & Cavdur, F. (Forthcoming). Application of automatic topic identification on excite web search engine data logs. Information Processing &Management.
[42]
Park, M., Bae, J., & Lee, S. (Forthcoming). End user searching: a Web log analysis of NAVAR, a Korean web search engine. Library &Information Science Research, 27(2).
[43]
The history and development of transaction log analysis. Library Hi Tech. v42 i11. 41-66.
[44]
An exploratory analysis on search terms of network users in Taiwan {in Chinese}. Central Library Bulletin. v89 i1. 23-37.
[45]
Rieh, S. Y., & Xu, H. (2001). Patterns and sequences of multiple query reformulation in web searching: a preliminary study. In Proceedings of the 64th annual meeting of the American society for information science and technology, pp. 246-255.
[46]
A methodology for analyzing web-based qualitative data. Journal of Management Information Systems. v19 i4. 213-246.
[47]
End user searching on the internet: an analysis of term pair topics submitted to the excite search engine. Journal of the American Society for Information Science. v51 i10. 949-958.
[48]
Analysis of a very large web search engine query log. SIGIR Forum. v33 i1. 6-12.
[49]
Multitasking information behavior and information task switching: an exploratory study. Journal of Documentation. v60 i3. 336-345.
[50]
Web search: public searching of the web. Kluwer, New York.
[51]
From e-sex to e-commerce: Web search changes. IEEE Computer. v35 i3. 107-111.
[52]
US versus European Web searching trends. SIGIR Forum. v32 i1. 30-37.
[53]
Modeling users' successive searches in digital environments. D-Lib Magazine.
[54]
On the justification of Dempster's rule of combination. Artificial Intelligence. v48 i1. 171-197.
[55]
Mining longitudinal web queries: trends and patterns. Journal of the American Society for Information Science and Technology. v54 i8. 743-758.
[56]
Information retrieval and the virtual document. Journal of the American Society for Information Science. v50 i11. 1028-1029.
[57]
Wen, J.-R., Nie, J.-Y., & Zhang, H.-J. (2001). Clustering user queries of a search engine. In Proceedings of the 10th international conference on World Wide Web (pp. 162-168). Hong Kong. 1-5 May.
[58]
Vox populi: the public searching of the web. Journal of the American Society of Information Science and Technology. v52 i12. 1073-1074.
[59]
Xie, Y., & O'Hallaron, D. (2002). Locality in search engine queries and its implications for caching. In Proceedings of the twenty-first annual joint conference of the IEEE computer and communications societies (pp. 307-317). New York City, New York, USA. 23-27 June.
[60]
Studying e-journal user behavior using log files: the experience of superjournal. Library & Information Science Research. v22 i3. 311-338.

Cited By

View all
  • (2024)Re-evaluating the Command-and-Control Paradigm in Conversational Search InteractionsProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679588(2260-2270)Online publication date: 21-Oct-2024
  • (2024)What do Users Really Ask Large Language Models? An Initial Log Analysis of Google Bard Interactions in the WildProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657914(2703-2707)Online publication date: 10-Jul-2024
  • (2023)Shop by image: characterizing visual search in e-commerceInformation Retrieval10.1007/s10791-023-09418-126:1-2Online publication date: 3-Mar-2023
  • Show More Cited By
  1. How are we searching the World Wide Web? A comparison of nine search engine transaction logs

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Information Processing and Management: an International Journal
    Information Processing and Management: an International Journal  Volume 42, Issue 1
    Special issue: Formal methods for information retrieval
    January 2006
    422 pages

    Publisher

    Pergamon Press, Inc.

    United States

    Publication History

    Published: 01 January 2006

    Author Tags

    1. Transaction log analysis
    2. Web search engines
    3. Web searching

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 21 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Re-evaluating the Command-and-Control Paradigm in Conversational Search InteractionsProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679588(2260-2270)Online publication date: 21-Oct-2024
    • (2024)What do Users Really Ask Large Language Models? An Initial Log Analysis of Google Bard Interactions in the WildProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657914(2703-2707)Online publication date: 10-Jul-2024
    • (2023)Shop by image: characterizing visual search in e-commerceInformation Retrieval10.1007/s10791-023-09418-126:1-2Online publication date: 3-Mar-2023
    • (2021)Accessing Media Via an Audio-only Communication Channel: A Log AnalysisProceedings of the 3rd Conference on Conversational User Interfaces10.1145/3469595.3469623(1-6)Online publication date: 27-Jul-2021
    • (2021)Exploring Topic-language Preferences in Multilingual Swahili Information Retrieval in TanzaniaACM Transactions on Asian and Low-Resource Language Information Processing10.1145/345867120:6(1-30)Online publication date: 12-Aug-2021
    • (2021)Window Navigation with Adaptive Probing for Executing BlockMax WANDProceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3404835.3463109(2323-2327)Online publication date: 11-Jul-2021
    • (2021)Term position‐based language model for information retrievalJournal of the Association for Information Science and Technology10.1002/asi.2443172:5(627-642)Online publication date: 10-Apr-2021
    • (2020)Informational, transactional, and navigational need of information: relevance of search intention in search engine advertisingInformation Retrieval10.1007/s10791-019-09368-723:2(117-135)Online publication date: 1-Apr-2020
    • (2020)The impact of ad positioning in search engine advertising: a multifaceted decision problemElectronic Commerce Research10.1007/s10660-018-9313-z20:4(945-968)Online publication date: 1-Dec-2020
    • (2019)A Study on Effective Measurement of Search Results from Search EnginesJournal of Global Information Management10.4018/JGIM.201901011027:1(196-221)Online publication date: 1-Jan-2019
    • Show More Cited By

    View Options

    View options

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media