Information gathering and searching approaches on the Web

Seiji Yamada¹ &
Hiroyuki Kawano²

116 Accesses
5 Citations
Explore all metrics

Abstract

The information accessible through the Internet is increasing explosively as the Web is getting more and more widespread. In this situation, the Web is indispensable information resource for both of information gathering and information searching. Though traditional information retrieval techniques have been applied to information gathering and searching in the Web, they are insufficient for this new form of information source. Fortunately some Al techniques can be straightforwardly applicable to such tasks in the Web, and many researchers are trying this approach. In this paper, we attempt to describe the current state of information gathering and searching technologies in the Web, and the application of AI techniques in the fields. Then we point out limitations of these traditional and AI approaches and introduce two aapproaches: navigation planning and a Mondou search engine for overcoming them. The navigation planning system tries to collect systematic knowledge, rather than Web pages, which are only pieces of knowledge. The Mondou search engine copes with the problems of the query expansion/modification based on the techniques of text/web mining and information visualization.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The Journey is the Reward - Towards New Paradigms in Web Search

Searching the Web of Data

Semantic Web Search and Inductive Reasoning

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Agrawal, R., and Srikant, R., “Fast Algorithms for Mining Association, Rules,” inProc. of the 20th International Conference on Very Large Data Bases, pp. 487–499, Santiago, Chile, 1994.
Chaomei, C.,Information Visualisation and Virtual Environments, Springer-Verlag, 1999.
Cheong, F. C.,Internet Agents: Spiders, Wanderers, Brokers, and Bots, New Riders, 1996.
Doorenbos, R. B., Etzioni, O. and Weld, D. S., “A Scalable Comparison-shopping Agent for the World-Wide Web,” inProc. of the First International Conference on Autonomous Agent, pp. 39–48, 1997.
Etzioni, O. and Weld, D., “A SoftBot-based Interface to the Internet,”Communication of the ACM, 37, 7, pp. 72–76, 1994.
Article Google Scholar
Feldman, R., “Practical Text Mining,” inSecond Symposium on Principles of Data Mining and Knowledge Discovery (PKDD-97), Nantes, France, 1998.
Fikes, R. E. and Nilsson, N. J., “STRIPS: a New Approach to the Application of Theorem Proving to Problem Solving,”Artificial Intelligence, 2, pp. 189–208, 1971.
Article MATH Google Scholar
Howe, A. E. and Dreillinger, D., “Savvy Search: a Metasearch Engine that Learns Which Search Engines to Query,”AI Magazine, 18, 2, pp. 19–25, 1997.
Google Scholar
Jamsa, K., Lalani, S. and Weakley, S.,Web Programming, Jamsa Press, 1996.
Joachims, T., Freitag, D. and Mitchell, T., “Webwatcher: a Tour Guide for the World Wide Web,” inProc. of the Fifteenth International Joint Conference on Artificial Intelligence, pp. 770–775, 1997.
Kawahara, M. and Kawano, H., “Performance Evaluation of Bibliographic Navigation System with Association Rules from Roc Convex Hull Method,”Transactions of the IPSJ: Database, 40 (SIG3(TOD1)) pp. 105–113, 1999.
Google Scholar
Kawano, H., “Mondou: Web Search Engine with Textual Data Mining,” inProc. of IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, pp. 402–405, 1997.
Kawano, H. and Kawahara, M., “Mondou: Information Navigator with Visual Interface,” inData Warehousing and Knowledge, Discovery, Second International Conference, DaWaK 2000, pp. 425–430, London, UK, Sep. 2000.
Knoblock, C. A., “Planning, Executing, Sensing, and Replanning for Information Gathering,” inProc. of the Fourteenth International Joint Conference on Artificial Intelligence, pp. 1686–1693, 1995.
Kwok, C. T. and Weld, D. S., “Planning to Gather Information,” inProc. of the Thirteenth National Conference on Artificial Intelligence, pp. 32–39, 1996.
Lieberman, H., “Letizia: a Agent that Assists Web Browsing,” inProc. of the Fourteenth International Joint Conference on Artificial Intelligence, pp. 924–929, 1995.
Menczer, F., “ARACHNID: Adaptive Retrieval Agents Choosing Heuristic Neighborhoods for Information Discovery,” inProc. of the Fourteenth International Conference on Machine Learning, pp. 227–235, 1997.
Menczer, F. and Monge, A. E., “Scalable Web Search by Adaptive Online Agents: an Inforspiders Case Study,” inIntelligent Information Agents, pp. 323–347. Springer, 1999.
Ohsawa, Y., Benson, N. E. and Yachida, M., “KeyGraph: Automatic Indexing by Co-occurrence Graph Based on Building Construction Metaphor,” inProc. of IEEE Advanced Digital Library Conference, pp. 12–18, 1998.
Provost, F. and Fawcett, T., “Analysis and Visualization of Classifier Performance: Comparison Under Imprecise Class and Cost Distributions,” inProc. of 3rd Int’l Conference on Knowledge Discovery and Data Mining (KDD-97), pp. 43–48, 1997.
Russell, S. and Norvig, P.,Artificial Intelligence-A Modern Approach-, Prentice-Hall, 1995.
Salton, G. and Buckley, C., “Term-weighting Approaches in Automatic Text Retrieval,” inReadings in Information Retrieval (Jones, K. S. and Willet, P., eds.),Morgan Kaufmann, pp. 323–328. Morgan Kaufmann, 1997.
Selberg, E. and Etzioni, O., “Multi-service Search and Comparison Using the Metacrawler,” inthe 1995 World Wide Web Conference, 1995.
Selberg, E. and Etzioni, O., “the Metacrawler Architecture for Resource Aggregation on the Web,” inIEEE Expert, IEEE, January-February, pp. 11–14, 1997.
Yamada, S. and Ohsawa, Y., “Navigation Planning to Guide Concept Understanding in the World Wide Web,” inProc. of the Fourth International Conference on Autonomous Agent, pp. 114–115, 2000.
Yamada, S. and Osawa, Y., “Planning to Guide Concept Understanding in the WWW,” inAAAI 1998 Workshop on AI and Information Integration, pp. 121–126, 1998.
Yamana, H., Tamura, K., Kawano, H., Kamei, S., Harada, M., Nishimura, H., Asai, I., Kusumoto, H., Shinoda, Y. and Muraoka, Y., “Experiments of Collecting WWW Information Using Distributed WWW Robots,” inProc. of SIGIR’98, pp. 379–380, Melbourne, Australia, 1998.
Yates, R. B. and Neto, B. R.,Modern Information Retrieval, Addison Wesley, 1999.

Download references

Author information

Authors and Affiliations

Department of Computational Intelligence and Systems Science Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology, 4259 Nagatsuta-cho, Midori-ku, 226-8502, Yokohama, Japan
Seiji Yamada
Department of Systems Science Graduate School of Informatics, Kyoto University, Sakyou-ku, Yoshida-honmachi, 606-8501, Kyoto, Japan
Hiroyuki Kawano

Authors

Seiji Yamada
View author publications
You can also search for this author in PubMed Google Scholar
Hiroyuki Kawano
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Seiji Yamada, Dr. Eng.: He received the B.S., M.S. and Ph.S. degrees in control engineering and artificial intelligence from Osaka University, Osaka, Japan, in 1984, 1986 and 1989, respectively. From 1989 to 1991, he served as a Research Associate in the Department of Control Engineering at Osaka University. From 1991 to 1996, he served as a Lecturer in the Institute of Scientific and Industrial Research at Osaka University. In 1996, he joined the Department of Computational Intelligence and Systems Science at Tokyo Institute of Technology, Yokohama, Japan, as an Associate Professor. His research interests include artificial intelligence, planning, machine learning for a robotics, intelligent information retrieval in the WWW, human computer interaction, He is a member of AAAI, IEEE, JSAI, RSJ and IEICE.

Hiroyuki Kawano, Dr.Eng.: He is an Associate Professor at the Department of Systems Science, Graduate School of Informatics, Kyoto University, Japan. He obtained his B.Eng. and M.Eng. degrees in Applied Mathematics and Physics, and his Dr.Eng. degree in Applied Systems Science from Kyoto University. His research interests are in advanced database technologies, such as data mining, data warehousing, knowledge discovery and web search engine (Mondou). He has served on the program committees of several conferences in the areas of Data Base Systems, and technical committes of advanced information systems.

About this article

Cite this article

Yamada, S., Kawano, H. Information gathering and searching approaches on the Web. NGCO 19, 195–208 (2001). https://doi.org/10.1007/BF03037255

Download citation

Received: 15 November 2000
Issue Date: June 2001
DOI: https://doi.org/10.1007/BF03037255

Information gathering and searching approaches on the Web

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

The Journey is the Reward - Towards New Paradigms in Web Search

Searching the Web of Data

Semantic Web Search and Inductive Reasoning

References

Author information

Authors and Affiliations

Additional information

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Information gathering and searching approaches on the Web

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

The Journey is the Reward - Towards New Paradigms in Web Search

Searching the Web of Data

Semantic Web Search and Inductive Reasoning

Explore related subjects

References

Author information

Authors and Affiliations

Additional information

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation