Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/332040.332418acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
Article
Free access

Bringing order to the Web: automatically categorizing search results

Published: 01 April 2000 Publication History

Abstract

We developed a user interface that organizes Web search results into hierarchical categories. Text classification algorithms were used to automatically classify arbitrary search results into an existing category structure on-the-fly. A user study compared our new category interface with the typical ranked list interface of search results. The study showed that the category interface is superior both in objective and subjective measures. Subjects liked the category interface much better than the list interface, and they were 50% faster at finding information that was organized into categories. Organizing search results allows users to focus on items in categories of interest rather than having to browse through all the results sequentially.

References

[1]
Allen, R. B., Two digital library interfaces that exploit hierarchical structure. In Proceedings of DAGS95: Electronic Publishing and the Information Superhighway (1995).
[2]
Chakrabarti, S., Dom, B., Agrawal, R., and Raghavan, P. Scalable feature selection, classification and signature generation for organizing large text databases into hierarchical topic taxonomies. The VLDB Journal 7, (1998), 163-178.
[3]
Chekuri, C., Goldwasser, M., Raghavan, P. and Upfal, E. Web search using automated classification. In Sixth International World Wide Web Conference, Santa Clara, California, Apr. 1997, Poster POS725.
[4]
Chen, M., Hearst, M., Hong, J., and Lin, J. Cha-Cha: a system for organizing intranet search results. In Proceedings of the 2nd USENIX Symposium on Internet Technologies and SYSTEMS (USITS) (Boulder CO, October 1999) (to appear).
[5]
Dumais, S. T., Platt, J., Heckerman, D. and Sahami, M. Inductive learning algorithms and representations for text categorization. In Proceedings of A CM-CIKM98, Nov. 1998.
[6]
Hearst, M., and Karadi, C. Searching and browsing text collections with large category hierarchies. In Proceedings of the A CM SIGCHI Conference on Human Factors in Computing Systems (CHI), Conference Companion (Atlanta GA, March 1997).
[7]
Hearst, M., and Pedersen, P. Reexamining the cluster hypothesis: scatter/gather on retrieval results. In Proceedings of 19th Annual International A CM/SIGIR Conference (Zurich 1996).
[8]
Hearst, M., Pedersen, J., and Karger, D. Scatter/gather as a tool for the analysis of retrieval results. Working Notes of the AAAI Fall Symposium on AI Applications in Knowledge Navigation (Cambridge MA, November 1995).
[9]
Johnson, B., and Shneiderman, B. Treemaps: a spacefilling approach to the visualization of hierarchical information structures. In Sparks of Innovation in Human-Computer Interaction. Ablex Publishitig Corporation, Norwood NJ, 1993
[10]
Landauer, T., Egan, D., Remde, J., Lesk, M., Lochbaum, C., and Ketchum, D. Enhancing the usability of text through computer delivery and formative evaluation: the SuperBook project. In Hypertext - A Psychological Perspective. Ellis Horwood, 1993.
[11]
Maarek, Y., Jacovi, M., Shtalhaim, M., Ur, S., Zernik, D., and Ben Shaul, I.Z. WebCutter: a system for dynamic and tailorable site mapping. In Proceedings of the 6th International World Wide Web Conference (Santa-Clara CA, April 1997).
[12]
Marchionini, G., Plaisant, C., and Komlodi, A. Interfaces and tools for the Library of Congress national digital library program. Information Processing and Management, 34, 535-555, 1998.
[13]
Mladenic, D. Turning Yahoo into an automatic web page classifier. In Proceedings of the 13th European Conference on Artificial Intelligence (ECAI'98) 473- 474.
[14]
Platt, J. Fast training of support vector machines using sequential minimal optimization. In Advances in Kernel Methods -Support Vector Learning. B. Sch61kopf, C. Burges, and A. Smola, eds., MIT Press, (1999).
[15]
Pratt, W. Dynamic organization of search results using the umls. In American Medical lnformatics Association Fall Symposium, 1997.
[16]
Pratt, W., Hearst, M. and Fagan, L. A knowledge-based approach to organizing retrieved documents. In Proceedings of AAAI-99.
[17]
Shneiderman, B., Feldman, D. and Rose, A. Visualizing digital library search results with categorical and hierarchical axes. CS-TR-3993, UMIACS-TR-99-12. ftp ://ftp.cs.umd. edu/pub/hcil/Reports-Abstracts- Bibliography/99-03html/99-03.html
[18]
Wittenburg, K. and Sigman, E. Integration of browsing, searching and filtering in an applet for information access. In Proceedings of A CM CH197: Human Factors in Computing Systems, (Atlanta GA, March 1997).
[19]
Zamir, O., and Etzioni, O. Grouper: A dynamic clustering interface to web search results. In Proceedings of WWW8 (Toronto, Canada, May 1999).
[20]
Zamir, O., and Etzioni, O. Web document clustering: a feasibility demonstration. In Proceedings of the 19th International A CM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '98), 46- 54.
[21]
http://cha-cha.berkeley.edu/
[22]
http://search.msn.com/
[23]
http ://www.inktomi.com/new/press/directory.html/
[24]
http://www.looksmart.com/
[25]
http ://www. northernlight.com/
[26]
http://www.snap.com/
[27]
http ://www. yahoo.com/

Cited By

View all
  • (2024)AI-UNet: Attention Information-based deep URL Network for adult webpage classificationNeural Computing and Applications10.1007/s00521-024-10408-7Online publication date: 7-Dec-2024
  • (2024)Minimizing Web Diversion Using Query Classification and Text MiningData Intelligence and Cognitive Informatics10.1007/978-981-99-7962-2_12(151-165)Online publication date: 7-Jan-2024
  • (2024)Fine-Grained Entity Classification Technology for Data Standard AdaptationAdvances in Mechanical Design10.1007/978-981-97-0922-9_109(1711-1719)Online publication date: 20-Jun-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CHI '00: Proceedings of the SIGCHI conference on Human Factors in Computing Systems
April 2000
587 pages
ISBN:1581132166
DOI:10.1145/332040
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 April 2000

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. World Wide Web
  2. classification
  3. search
  4. support vector machine
  5. text categorization
  6. text categrization
  7. user interface
  8. user study

Qualifiers

  • Article

Conference

CHI00
Sponsor:
CHI00: Human Factors in Computing Systems
April 1 - 6, 2000
The Hague, The Netherlands

Acceptance Rates

CHI '00 Paper Acceptance Rate 72 of 336 submissions, 21%;
Overall Acceptance Rate 6,199 of 26,314 submissions, 24%

Upcoming Conference

CHI 2025
ACM CHI Conference on Human Factors in Computing Systems
April 26 - May 1, 2025
Yokohama , Japan

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)151
  • Downloads (Last 6 weeks)28
Reflects downloads up to 25 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)AI-UNet: Attention Information-based deep URL Network for adult webpage classificationNeural Computing and Applications10.1007/s00521-024-10408-7Online publication date: 7-Dec-2024
  • (2024)Minimizing Web Diversion Using Query Classification and Text MiningData Intelligence and Cognitive Informatics10.1007/978-981-99-7962-2_12(151-165)Online publication date: 7-Jan-2024
  • (2024)Fine-Grained Entity Classification Technology for Data Standard AdaptationAdvances in Mechanical Design10.1007/978-981-97-0922-9_109(1711-1719)Online publication date: 20-Jun-2024
  • (2024)How Order and Omission of Web Content Can Vary Unintentionally Across User Cohorts: A ReviewUniversal Access in Human-Computer Interaction10.1007/978-3-031-60881-0_6(80-99)Online publication date: 1-Jun-2024
  • (2023)Query Sub-intent Mining by Incorporating Search Results with Query Logs for Information Retrieval2023 IEEE 8th International Conference on Big Data Analytics (ICBDA)10.1109/ICBDA57405.2023.10104948(180-186)Online publication date: 3-Mar-2023
  • (2021)Recommendations and Results Organization in Netflix SearchProceedings of the 15th ACM Conference on Recommender Systems10.1145/3460231.3474602(577-579)Online publication date: 13-Sep-2021
  • (2021)CoNotate: Suggesting Queries Based on Notes Promotes Knowledge DiscoveryProceedings of the 2021 CHI Conference on Human Factors in Computing Systems10.1145/3411764.3445618(1-14)Online publication date: 6-May-2021
  • (2021)Web Content Authentication: A Machine Learning Approach to Identify Fake and Authentic Web Pages on InternetInformation and Communication Technology for Competitive Strategies (ICTCS 2020)10.1007/978-981-16-0882-7_6(85-103)Online publication date: 6-Jul-2021
  • (2021)Teens’ Conceptual Understanding of Web Search Engines: The Case of Google Search Engine Result Pages (SERPs)Human-Computer Interaction. Design and User Experience Case Studies10.1007/978-3-030-78468-3_18(253-270)Online publication date: 3-Jul-2021
  • (2021)Effective Seed-Guided Topic Labeling for Dataless Hierarchical Short Text ClassificationWeb Engineering10.1007/978-3-030-74296-6_21(271-285)Online publication date: 18-May-2021
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media