Abstract
This paper focuses on the structural comparison of multimedia documents. Most of the systems treating the multimedia documents exploit only the text part of these documents. However, the text is no longer the only means to carry information. The major issue is to extend these systems to the other modality notably to the image that constitutes one of the basic components of multimedia documents. The complexity of multimedia documents, multistructured in essence, imposes not only a structural representation in the form of trees, but rather in the form of graphs. The graphs are in appropriateness to the description of these documents. For example, one will be able to describe the components of a scene of an image, the relations between these components, their positions (spatial relations), etc.
We propose a new similarity measure of graphs, based on a univocal matching between the graphs to compare. In our approach, we will take account of structural information and specificities of multimedia information. We evaluate our measure on a corpus of multi-structured documents from the INEX 2007 corpus.
Chapter PDF
Similar content being viewed by others
References
Idarrou, A., Soulé-Dupuy, C., Mammass, D., Vallès-Parlangeau, N.: Appariement et similarité des graphes. In: JDTIC’09 Journée doctorales en Technologie de l’Information et de la Communication Juillet 16-18, Rabat Maroc (2009)
Idarrou, A.: Classification des documents multi-structurés: comparaison de structures Ateliers Jeunes Chercheurs. In: CIFED 2010, Sousse Tunis, pp 501–506 (2010)
Mezghiche, A.A., Souam, F.: Classification de structures arborescentes: cas de documents XML. In: COREA 2009, 6th French Information Retrieval Conference, Proceeding of LSIS-USTV 2009, Presqu’île de Giens, France, May 5-7, pp. 301–317 (2009) ISBN 2-9524747-1-0
Régin, J.C.: Développement d’outils algorithmiques pour l’intelligence Artificielle. Application à la chimie orgnique. PhD thesis, Université de Montpellier II (1995)
Djemal, K., Dupuy, C.S., Valles-Parlangeau, N.: Modélisation d’un Entrepôt de Documents Multi-structurés Dans: ÉDIT 2007: actes du colloque des doctorants de l’École Doctorale Informatique et Télécommunications, Toulouse, May 24-25 (2007)
Mbarki, M.: Gestion de l’hétérogénéité documentaire: le cas d’un entrepôt de documents multimédias, thèse de doctorat de l’université de paul Sabatier, Toulouse (2008)
Torjmen, M., Pinel-Sauvagnat, K.: Une étude que l’impact de la structure sur la recherche multimédia. In: Conférence frnacophone en Recherche d’Information et Application (CORIA 2009), Presqu’île de Giens-Var, 05/05/2009-07/05/2009, Ludovia, mai 2009, pp. 51–66 (2009)
Champin, P.-A., Solnon, C.: Measuring the similarity of labeles graphs. In: Ashley, K.D., Bridge, D.G. (eds.) ICCBR 2003. LNCS, vol. 2689, pp. 80–95. Springer, Heidelberg (2003)
Sorlin, S., Champin, P.-A., Solnon, C.: Mesurer la similarité de graphes étiquetés: Dans 9èmes Journées Nationales sur la résolution pratique de problèmes NP-Complets. In: JNPC 2003, pp. 325–339 (2003)
Sorlin, S., Sammoud, O., Solnon, C., Jolin, J. M.: Mesurer la similarité de graphes: Dans Extraction de Connaissance à partir d’Images (ECOI 2006). In: Vincent, N., Lomenie, N. (eds.) Atelier de Extraction et Gestion de Connaissances (EGC 2006), Lille, pp. 21–30 (2006)
Jouili, S., Tabone, S.: Applications des graphes en traitement d’images. In: International Conference on Relations, Orders and Graphs: Interaction with Computer Science- ROGICS’08, Mahdia, Tunisia, pp. 434–442 (2008)
Ambauen, R., Fischer, S., Bunke, H.: Graph edit distance with node splitting and merging, and its application to diatom idenfication. In: Hancock, E.R., Vento, M. (eds.) GbRPR 2003. LNCS, vol. 2726, pp. 95–106. Springer, Heidelberg (2003)
INEX 2007, Collection (same as old collection with Image IDs) (2007), http://www-connex.lip6.fr/~denoyer/wikipediaXML/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Idarrou, A., Mammass, D., Dupuy, C.S., Valles-Parlangeau, N. (2010). Classification of Multi-structured Documents: A Comparison Based on Media Image. In: Elmoataz, A., Lezoray, O., Nouboud, F., Mammass, D., Meunier, J. (eds) Image and Signal Processing. ICISP 2010. Lecture Notes in Computer Science, vol 6134. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13681-8_50
Download citation
DOI: https://doi.org/10.1007/978-3-642-13681-8_50
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13680-1
Online ISBN: 978-3-642-13681-8
eBook Packages: Computer ScienceComputer Science (R0)