Abstract
In this paper, a system, RitroveRAI, addressing the general problem of enriching a multimedia news stream with semantic metadata is presented. News metadata here are explicitly derived from transcribed sentences or implicitly expressed into a topical category automatically detected. The enrichment process is accomplished by searching the same news expressed by different agencies reachable over the Web. Metadata extraction from the alternative sources (i.e. Web pages) is similarly applied and finally integration of the sources (according to some heuristic of pertinence) is carried out. Performance evaluation of the current system prototype has been carried out on a large scale. It confirms the viability of the RitroveRAI approach for realistic (i.e. 24 hours) applications and continuous monitoring and metadata extraction from multimedia news data.
The research work presented in this paper has been partially funded by the PrestoSpace IST Integrated Project, n. IST-FP6-507336.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Barzilay, R., Elhadad, M.: Using Lexical Chains for Text Summarization. In: The Proceedings of the Intelligent Scalable Text Summarization Workshop (ISTS 1997), ACL, Madrid (1997)
Basili, R., Pazienza, M.T., Zanzotto, F.M.: Efficient Parsing for Information Extraction. In: Proceedings of the European Conference on Artificial Intelligence (ECAI 1998), Brighton, UK (1998)
Basili, R., Moschitti, A., Pazienza, M.T.: NLP-driven IR: Evaluating performance over a text classification task. In: Proceeding of the 10th International Joint Conference of Artificial Intelligence (IJCAI 2001), Seattle, Washington, USA, August 4 (2001)
Basili, R., Zanzotto, F.M.: Parsing Engineering and Empirical Robustness. Journal of Language Engineering 8(2/3), 97โ120 (2002)
Choi, F.Y.Y., Wiemer-Hastings, P., Moore, J.: Latent semantic analysis for text segmentation. In: Proceedings of the 6th Conference on Empirical Methods in Natural Language Processing, pp. 109โ117 (2001)
Hatzivassiloglou, V., Klavans, J., Eskin, E.: Detecting text similarity over short passages: Exploring linguistic feature combinations via machine learning (1999)
Gospodnetic, O.: Advanced Text Indexing with Lucene (2003), http://lucene.apache.org
Ittner, D.J., Lewis, D.D., Ahn, D.D.: Text categorization of low quality images. In: Proceedings of SDAIR 1995, 4th Annual Symposium on Document Analysis and Information Retrieval, Las Vegas, US, pp. 301โ315 (1995)
Landauer, T.K., Foltz, P.W., Laham, D.: Introduction to Latent Semantic Analysis. Discourse Processes 25, 259โ284 (1998)
Marcu, D.: The automatic construction of large-scale corpora for summarization research. In: Proceedings of SIGIR 1999 (1999)
Popov, B., Kiryakov, A., Ognyanoff, D., Manov, D., Kirilov, A., Goranov, M.: KIM โ Semantic Annotation Platform. In: Fensel, D., Sycara, K., Mylopoulos, J. (eds.) ISWC 2003. LNCS, vol. 2870, pp. 834โ849. Springer, Heidelberg (2003)
Jing, H.: Using hidden Markov modeling to decompose human-written summaries. Computational Linguistics 28(4), 527โ543 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
ยฉ 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Basili, R., Cammisa, M., Donati, E. (2005). RitroveRAI: A Web Application for Semantic Indexing and Hyperlinking of Multimedia News. In: Gil, Y., Motta, E., Benjamins, V.R., Musen, M.A. (eds) The Semantic Web โ ISWC 2005. ISWC 2005. Lecture Notes in Computer Science, vol 3729. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11574620_10
Download citation
DOI: https://doi.org/10.1007/11574620_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29754-3
Online ISBN: 978-3-540-32082-1
eBook Packages: Computer ScienceComputer Science (R0)