Abstract
Efficient software infrastructures for the design and implementation of Intelligent Information Systems (IIS) are very important, especially in the area of intelligent NLP-based systems. Recently several approaches have been proposed in literature. However, the emphasis is usually centred on the integration of heterogeneous linguistic processors and the problem of the representation of linguistic data in vivo is left in the shadow. In this paper an object oriented architecture for a NLP-based IIS devoted to information extraction tasks will be discussed. An application of this model to a distributed document categorisation framework, employed within an existing system, TREVI [9], will be discussed as a relevant case study.
Preview
Unable to display preview. Download preview PDF.
References
The common object request broker: Architecture and specification, ver. 2.0. Technical document ptc/96-03-0, OMG, 1995.
McKelvie D., Brew C., and Thompson H. Using sgml as a basis for data-intensive nlp. In ANLP97, 1997.
Gamma E., Helm R., Johnson R., Vlissides J., and Booch G. (Foreword), editors. Design Patterns: Elements of Reusable Object-Oriented Software. Addison-Wesley Professional Computing, October 1994.
EAGLES. Evaluation of natural language processing systems. In EAG-EWG-PR.2, 1994.
Miller G. Wordnet: an on-line lexical database. International Journal of Lexicography, 3:656–691, 1994.
Cunningham H., Humphreys K., Gaizauskas R., and Wilks Y. Software infrastructure for natural language processing. In ANLP97, 1997.
Mazzucchelli L. and Marabello M.V. Specification of the overall toolkit architecture. In EP 23311 TREVI Project Deliverable 7D1, 1997.
Fowler M., Scott K. (Contributor), and Booch G., editors. Uml Distilled: Applying the Standard Object Modeling Language. Addison-Wesley Object Technology Series, June 1997.
Basili R., Mazzucchelli L. Di Nanni M., Marabello M.V., and Pazienza, M.T. Nlp for text classification: the trevi experience. In Proceedings of the Second International Conference on Natural Language Processing and Industrial Applications, Universite’ de Moncton, New Brunswick (Canada), August 1998.
Grishman R. and CAWG. Tipster text phase ii: Architecture design. Technical report, New York University, 1996.
Zajac R., Carper M., and Sharples N. An open distributed architecture for reuse and integration of heterogeneous nlp component. In ANLP97, 1997.
Peters W., Cunningham H., McCauley C., Bontcheva K., and Wilks Y. Uniform language resource access and distribution. In ICLRE98, 1998.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Basili, R., Di Nanni, M., Pazienza, M.T. (1999). Representing document content via an object-oriented paradigm. In: Raś, Z.W., Skowron, A. (eds) Foundations of Intelligent Systems. ISMIS 1999. Lecture Notes in Computer Science, vol 1609. Springer, Berlin, Heidelberg . https://doi.org/10.1007/BFb0095105
Download citation
DOI: https://doi.org/10.1007/BFb0095105
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65965-5
Online ISBN: 978-3-540-48828-6
eBook Packages: Springer Book Archive