A Practical Agent-Based Method to Extract Semantic Information from the Web

J. L. Arjona⁷,
R. Corchuelo⁷,
A. Ruiz⁷ &
…
M. Toro⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2348))

Included in the following conference series:

International Conference on Advanced Information Systems Engineering

1074 Accesses
1 Citations

Abstract

The semantic Web will bring meaning to the Internet, making it possible for web agents to understand the information it contains. However, current trends seem to suggest that it is not likely to be adopted in the forthcoming years. In this sense, meaningful information extraction from the web becomes a handicap for web agents. In this article, we present a framework for automatic extraction of semantically-meaningful information from the current web. Separating the extraction process from the business logic of an agent enhances modularity, adaptability, and maintainability. Our approach is novel in that it combines different technologies to extract information, surf the web and automatically adapt to some changes.

The work reported in this article was supported by the Spanish Inter-ministerial Commission on Science and Technology under grant TIC2000-1106-C02-01

Download to read the full chapter text

Chapter PDF

An Agent-Architecture for Automated Decision-Making on the Semantic Web

Semantic Web and Declarative Agent Languages and Technologies: Current and Future Trends

A Comprehensive Review on Ontology and Semantic Web

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

DARPA (Defense Advanced Research Projects Agency). The darpa agent mark up language (daml). http://www.daml.org, 2000.
W. W. Cohen and L. S. Jensen. A structured wrapper induction system for extracting information from semi-structured documents. In Workshop on Adaptive Text Extraction and Mining (IJCAI-2001), 2001.
Google Scholar
O. Corcho and A. Gómez-Pérez. A road map on ontology specification languages. In Workshop on Applications of Ontologies and Problem solving methods. 14th European Conference on Artificial Intelligence (ECAI’00), 2000.
Google Scholar
S. Cranefield and M. Purvis. Generating ontology-specific content languages. In Proceedings of Ontologies in Agent Systems Workshop (Agents 2001), pages 29–35, 2000.
Google Scholar
H. García-Molina, J. Hammer, K. Ireland, Y. Papakonstantinou, J. Ullman, and J. Widom. Integrating and accessing heterogeneous information sources in TSIM-MIS. In The AAAI Symposium on Information Gathering, pages 61–64, March 1995.
Google Scholar
C. A. Knoblock. Accurately and reliably extracting data from the web: A machine learning approach. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 2000.
Google Scholar
N. Kushmerick. Wrapper induction: Efficiency and expressiveness. Artificial Intelligence, 118(2000):15–68, 1999.
MathSciNet Google Scholar
G. Mecca, P. Merialdo, and P. Atzeni. ARANEUS in the era of XML. Data Engineering Bullettin, Special Issue on XML, September 1999.
Google Scholar
I. Muslea, S. Minton, and C. Knoblock. Wrapper induction for semistructured, web-based information sources. In Proceedings of the Conference on Automated Learning and Discovery (CONALD), 1998.
Google Scholar
S. Soderland. Learning information extraction rules for semi-structured and free text. Machine Learning, pages 1–44, 1999.
Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Lenguajes y Sistemas Informáticos, Escuela Técnica Superior de Ingeniería Informática de la Universidad de Sevilla, Avda. de la Reina Mercedes, s/n, Sevilla, Spain
J. L. Arjona, R. Corchuelo, A. Ruiz & M. Toro

Authors

J. L. Arjona
View author publications
You can also search for this author in PubMed Google Scholar
R. Corchuelo
View author publications
You can also search for this author in PubMed Google Scholar
A. Ruiz
View author publications
You can also search for this author in PubMed Google Scholar
M. Toro
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science, University of Waterloo, 200 University Avenue West, Waterloo, Ontario, N2L 3G1, Canada
Anne Banks Pidduck & M. Tamer Ozsu &
University of Toronto, Pratt Building 6 King’s College Road, Toronto, Ontario, M5S 3H5
John Mylopoulos
Faculty of Commerce and Business Administration, University of British Columbia, 2053 Main Mall, Vancouver, B.C., V6T 1Z2, Canada
Carson C. Woo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Arjona, J.L., Corchuelo, R., Ruiz, A., Toro, M. (2002). A Practical Agent-Based Method to Extract Semantic Information from the Web. In: Pidduck, A.B., Ozsu, M.T., Mylopoulos, J., Woo, C.C. (eds) Advanced Information Systems Engineering. CAiSE 2002. Lecture Notes in Computer Science, vol 2348. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-47961-9_48

Download citation

DOI: https://doi.org/10.1007/3-540-47961-9_48
Published: 29 May 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43738-3
Online ISBN: 978-3-540-47961-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

A Practical Agent-Based Method to Extract Semantic Information from the Web

Abstract

Chapter PDF

Similar content being viewed by others

An Agent-Architecture for Automated Decision-Making on the Semantic Web

Semantic Web and Declarative Agent Languages and Technologies: Current and Future Trends

A Comprehensive Review on Ontology and Semantic Web

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Practical Agent-Based Method to Extract Semantic Information from the Web

Abstract

Chapter PDF

Similar content being viewed by others

An Agent-Architecture for Automated Decision-Making on the Semantic Web

Semantic Web and Declarative Agent Languages and Technologies: Current and Future Trends

A Comprehensive Review on Ontology and Semantic Web

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation