Nothing Special   »   [go: up one dir, main page]

Skip to main content
Log in

Information gathering and searching approaches on the Web

  • Tutorial Series on Web-Computing 1
  • Published:
New Generation Computing Aims and scope Submit manuscript

Abstract

The information accessible through the Internet is increasing explosively as the Web is getting more and more widespread. In this situation, the Web is indispensable information resource for both of information gathering and information searching. Though traditional information retrieval techniques have been applied to information gathering and searching in the Web, they are insufficient for this new form of information source. Fortunately some Al techniques can be straightforwardly applicable to such tasks in the Web, and many researchers are trying this approach. In this paper, we attempt to describe the current state of information gathering and searching technologies in the Web, and the application of AI techniques in the fields. Then we point out limitations of these traditional and AI approaches and introduce two aapproaches: navigation planning and a Mondou search engine for overcoming them. The navigation planning system tries to collect systematic knowledge, rather than Web pages, which are only pieces of knowledge. The Mondou search engine copes with the problems of the query expansion/modification based on the techniques of text/web mining and information visualization.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  1. Agrawal, R., and Srikant, R., “Fast Algorithms for Mining Association, Rules,” inProc. of the 20th International Conference on Very Large Data Bases, pp. 487–499, Santiago, Chile, 1994.

  2. Chaomei, C.,Information Visualisation and Virtual Environments, Springer-Verlag, 1999.

  3. Cheong, F. C.,Internet Agents: Spiders, Wanderers, Brokers, and Bots, New Riders, 1996.

  4. Doorenbos, R. B., Etzioni, O. and Weld, D. S., “A Scalable Comparison-shopping Agent for the World-Wide Web,” inProc. of the First International Conference on Autonomous Agent, pp. 39–48, 1997.

  5. Etzioni, O. and Weld, D., “A SoftBot-based Interface to the Internet,”Communication of the ACM, 37, 7, pp. 72–76, 1994.

    Article  Google Scholar 

  6. Feldman, R., “Practical Text Mining,” inSecond Symposium on Principles of Data Mining and Knowledge Discovery (PKDD-97), Nantes, France, 1998.

  7. Fikes, R. E. and Nilsson, N. J., “STRIPS: a New Approach to the Application of Theorem Proving to Problem Solving,”Artificial Intelligence, 2, pp. 189–208, 1971.

    Article  MATH  Google Scholar 

  8. Howe, A. E. and Dreillinger, D., “Savvy Search: a Metasearch Engine that Learns Which Search Engines to Query,”AI Magazine, 18, 2, pp. 19–25, 1997.

    Google Scholar 

  9. Jamsa, K., Lalani, S. and Weakley, S.,Web Programming, Jamsa Press, 1996.

  10. Joachims, T., Freitag, D. and Mitchell, T., “Webwatcher: a Tour Guide for the World Wide Web,” inProc. of the Fifteenth International Joint Conference on Artificial Intelligence, pp. 770–775, 1997.

  11. Kawahara, M. and Kawano, H., “Performance Evaluation of Bibliographic Navigation System with Association Rules from Roc Convex Hull Method,”Transactions of the IPSJ: Database, 40 (SIG3(TOD1)) pp. 105–113, 1999.

    Google Scholar 

  12. Kawano, H., “Mondou: Web Search Engine with Textual Data Mining,” inProc. of IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, pp. 402–405, 1997.

  13. Kawano, H. and Kawahara, M., “Mondou: Information Navigator with Visual Interface,” inData Warehousing and Knowledge, Discovery, Second International Conference, DaWaK 2000, pp. 425–430, London, UK, Sep. 2000.

  14. Knoblock, C. A., “Planning, Executing, Sensing, and Replanning for Information Gathering,” inProc. of the Fourteenth International Joint Conference on Artificial Intelligence, pp. 1686–1693, 1995.

  15. Kwok, C. T. and Weld, D. S., “Planning to Gather Information,” inProc. of the Thirteenth National Conference on Artificial Intelligence, pp. 32–39, 1996.

  16. Lieberman, H., “Letizia: a Agent that Assists Web Browsing,” inProc. of the Fourteenth International Joint Conference on Artificial Intelligence, pp. 924–929, 1995.

  17. Menczer, F., “ARACHNID: Adaptive Retrieval Agents Choosing Heuristic Neighborhoods for Information Discovery,” inProc. of the Fourteenth International Conference on Machine Learning, pp. 227–235, 1997.

  18. Menczer, F. and Monge, A. E., “Scalable Web Search by Adaptive Online Agents: an Inforspiders Case Study,” inIntelligent Information Agents, pp. 323–347. Springer, 1999.

  19. Ohsawa, Y., Benson, N. E. and Yachida, M., “KeyGraph: Automatic Indexing by Co-occurrence Graph Based on Building Construction Metaphor,” inProc. of IEEE Advanced Digital Library Conference, pp. 12–18, 1998.

  20. Provost, F. and Fawcett, T., “Analysis and Visualization of Classifier Performance: Comparison Under Imprecise Class and Cost Distributions,” inProc. of 3rd Int’l Conference on Knowledge Discovery and Data Mining (KDD-97), pp. 43–48, 1997.

  21. Russell, S. and Norvig, P.,Artificial Intelligence-A Modern Approach-, Prentice-Hall, 1995.

  22. Salton, G. and Buckley, C., “Term-weighting Approaches in Automatic Text Retrieval,” inReadings in Information Retrieval (Jones, K. S. and Willet, P., eds.),Morgan Kaufmann, pp. 323–328. Morgan Kaufmann, 1997.

  23. Selberg, E. and Etzioni, O., “Multi-service Search and Comparison Using the Metacrawler,” inthe 1995 World Wide Web Conference, 1995.

  24. Selberg, E. and Etzioni, O., “the Metacrawler Architecture for Resource Aggregation on the Web,” inIEEE Expert, IEEE, January-February, pp. 11–14, 1997.

  25. Yamada, S. and Ohsawa, Y., “Navigation Planning to Guide Concept Understanding in the World Wide Web,” inProc. of the Fourth International Conference on Autonomous Agent, pp. 114–115, 2000.

  26. Yamada, S. and Osawa, Y., “Planning to Guide Concept Understanding in the WWW,” inAAAI 1998 Workshop on AI and Information Integration, pp. 121–126, 1998.

  27. Yamana, H., Tamura, K., Kawano, H., Kamei, S., Harada, M., Nishimura, H., Asai, I., Kusumoto, H., Shinoda, Y. and Muraoka, Y., “Experiments of Collecting WWW Information Using Distributed WWW Robots,” inProc. of SIGIR’98, pp. 379–380, Melbourne, Australia, 1998.

  28. Yates, R. B. and Neto, B. R.,Modern Information Retrieval, Addison Wesley, 1999.

Download references

Author information

Authors and Affiliations

Authors

Additional information

Seiji Yamada, Dr. Eng.: He received the B.S., M.S. and Ph.S. degrees in control engineering and artificial intelligence from Osaka University, Osaka, Japan, in 1984, 1986 and 1989, respectively. From 1989 to 1991, he served as a Research Associate in the Department of Control Engineering at Osaka University. From 1991 to 1996, he served as a Lecturer in the Institute of Scientific and Industrial Research at Osaka University. In 1996, he joined the Department of Computational Intelligence and Systems Science at Tokyo Institute of Technology, Yokohama, Japan, as an Associate Professor. His research interests include artificial intelligence, planning, machine learning for a robotics, intelligent information retrieval in the WWW, human computer interaction, He is a member of AAAI, IEEE, JSAI, RSJ and IEICE.

Hiroyuki Kawano, Dr.Eng.: He is an Associate Professor at the Department of Systems Science, Graduate School of Informatics, Kyoto University, Japan. He obtained his B.Eng. and M.Eng. degrees in Applied Mathematics and Physics, and his Dr.Eng. degree in Applied Systems Science from Kyoto University. His research interests are in advanced database technologies, such as data mining, data warehousing, knowledge discovery and web search engine (Mondou). He has served on the program committees of several conferences in the areas of Data Base Systems, and technical committes of advanced information systems.

About this article

Cite this article

Yamada, S., Kawano, H. Information gathering and searching approaches on the Web. NGCO 19, 195–208 (2001). https://doi.org/10.1007/BF03037255

Download citation

  • Received:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF03037255

Keywords

Navigation