Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2645791.2645822acmotherconferencesArticle/Chapter ViewAbstractPublication PagespciConference Proceedingsconference-collections
research-article

A Preliminary Investigation into the Automatic EuroVoc Indexing of Greek Documents

Published: 02 October 2014 Publication History

Abstract

In this paper, we present an automatic indexing experiment of greek documents. In particular, we describe an attempt to use JEX, the JRC-developed indexing tool, in order to assign EuroVoc descriptors to a collection of Greek open data. We discuss the results and limitations of this approach and we propose solutions which take into account the particularities of the Greek language.

References

[1]
EuroVoc 2012. Multilingual thesaurus of the European Union. http://eurovoc.europa.eu/
[2]
Fellbaum C. (ed.) 1998. WordNet: An Electronic Lexical Database. MIT Press.
[3]
Geodata.gov.gr 2012. Web service for Greek open geospatial data http://www.geodata.gov.gr/geodata
[4]
JEX-JRC EuroVoc Indexer 2014. http://langtech.jrc.ec.europa.eu/Eurovoc.html
[5]
Karanikolas, N. and Skourlas, C. 2006. Text Classification: Forming Candidate Key-Phrases from Existing Shorter Ones. FACTA UNIVERSITATIS Series: Electronics and Energetics, ISSN 0353-3670, 19, 3.
[6]
Lancaster, F.W. 1998. Indexing and abstracting in theory and practice. Library Association Publishing, London.
[7]
Pouliquen, B., Steinberger, R. and Degeurnel, O. 2008. Story tracking: Linking similar news over time and across languages. In Proceedings of the 2nd workshop "Multi-source Multilingual Information Extraction and Summarization (MMIES'2008)" held at CoLing'2008 (Manchester, Aug.23, 2008).
[8]
Pouliquen, B., Steinberger, R. and Ignat, C. 2003. Automatic annotation of multilingual text collections with a conceptual thesaurus. In Proceedings of the workshop "Ontologies and Information Extraction" - at the summer school "The Semantic Web and Language Technology -- Its Potential and Practicalities (EUROLAN 2003)" (Bucharest, July 28 -- Aug. 8, 2003).
[9]
Stamou S., Oflazer K., Pala K., Christoudoulakis D., Cristea D., Tufiş D., Koeva S., Totkov G., Dutoit D., Grigoriadou M. 2002. Balkanet: A Multilingual Semantic Network for the Balkan Languages. In Proceedings of the International Wordnet Conference, January 21-25, Mysore, India, 12--14.
[10]
Steinberger, R., Ebrahim, M. and Turchi, M. 2012. JRC EuroVoc Indexer JEX -- A freely available multi-label categorisation tool. In Proceedings of the 8th Int. Conference LREC'2012, Istanbul, 798--805.
[11]
Steinberger, R., Ehrmann, M., Pajzs, J., Ebrahim, M., Steinberger, J. and Turchi, M. 2013. Multilingual media monitoring and text analysis -- Challenges for highly inflected languages. In Proceedings of the 16th Int. Conference TSD 2013, Pilsen, Springer -- Verlag, 22--33.
[12]
Tsoumakas, G. and Katakis, I. 2007. Multi-label classification: An overview, Int. J. Data Warehousing and Mining, 3, 1--13.
[13]
Vossen P. (ed.) 1998. EuroWordNet: A Multilingual Database with Lexical Semantic Networks. Kluwer Academic Publishers

Cited By

View all
  • (2016)Retrieval and Dissemination of Information in Distributed and Wireless EnvironmentsStrategic Innovative Marketing10.1007/978-3-319-33865-1_83(683-690)Online publication date: 27-Sep-2016

Index Terms

  1. A Preliminary Investigation into the Automatic EuroVoc Indexing of Greek Documents

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    PCI '14: Proceedings of the 18th Panhellenic Conference on Informatics
    October 2014
    355 pages
    ISBN:9781450328975
    DOI:10.1145/2645791
    • General Chairs:
    • Katsikas Sokratis,
    • Hatzopoulos Michael,
    • Apostolopoulos Theodoros,
    • Anagnostopoulos Dimosthenis,
    • Program Chairs:
    • Carayiannis Elias,
    • Varvarigou Theodora,
    • Nikolaidou Mara
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    In-Cooperation

    • Greek Com Soc: Greek Computer Society
    • Univ. of Piraeus: University of Piraeus
    • National and Kapodistrian University of Athens: National and Kapodistrian University of Athens
    • Athens U of Econ & Business: Athens University of Economics and Business

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 02 October 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. EuroVoc
    2. Greek language
    3. conceptual thesaurus
    4. keyword assignment

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    PCI '14

    Acceptance Rates

    PCI '14 Paper Acceptance Rate 51 of 102 submissions, 50%;
    Overall Acceptance Rate 190 of 390 submissions, 49%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 18 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2016)Retrieval and Dissemination of Information in Distributed and Wireless EnvironmentsStrategic Innovative Marketing10.1007/978-3-319-33865-1_83(683-690)Online publication date: 27-Sep-2016

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media