Abstract
This article describes the first participation of the Chair Media Informatics of the Chemnitz University of Technology in the Cross Language Evaluation Forum. An experimental prototype is introduced which implements several methods of optimizing search results. The configuration of the prototype is tested with the CLEF training data. The results of the Domain-Specific Monolingual German task suggest that combining the suffix stripping stemming and the decompounding approach is very useful. Also, a local document clustering (LDC) approach used to improve the query expansion (QE) based on pseudo-relevance feedback (PRF) seems to be quite beneficial. Nevertheless, the evaluation of the English task using the same configuration suggests that the qualities of the results are highly speech dependent.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
The Apache Software Foundation: Lucene. Retrieved August 10, 2006 from the World Wide Web (1998-2006), http://lucene.apache.org
CLEF: Guidelines for Participation in CLEF 2006 Ad-Hoc and Domain-Specific Tracks. Retrieved August 10, 2006 from the World Wide Web (restricted access, 2006), http://www.clef-campaign.org/delos/clef/protect/guidelines06.htm
Porter, M.: The Snowball Project. Retrieved August 10, 2006 from the World Wide Web (2001), www.snowball.tartarus.org
Wagner, S.: A German Decompounder Retrieved August 10, 2006 from the World Wide Web (2005), http://www-user.tu-chemnitz.de/~wags/cv/clr.pdf
Steinbach, M., Karypis, G., Kumar, V.: A Comparison of Document Clustering Techniques, University of Minnesota, Technical Report # 00034 (2000)
Rasmussen, E.: Clustering Algorithms. In: Frakes, W.B., Baeza-Yates, R. (eds.) Information Retrieval -Data Structures and Algorithms, Prentice Hall, Englewood Cliffs New Jersey (1992)
Willett, P.: Recent Trends in Hierarchic Document Clustering. Information Processing & Management 24(5), 577–597 (1988)
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Pearson Addison-Wesley, Harlow Munich (2005)
Fox, E.A., Shaw, J.A.: Combination of Multiple Searches. In: Proceedings of the 2nd Text Retrieval Conference (TREC2), NIST Special Publication, pp. 215–500 (1994)
Savoy, J.: Data Fusion for Effective European Monolingual Information Retrieval. In: Working Notes for the CLEF 2004 Workshop (2004)
Lin, W.-C., Chen, H.-H.: Merging Mechanisms in Multilingual Information Retrieval. In: Working Notes for the CLEF 2002 Workshop (2002)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kürsten, J., Eibl, M. (2007). Monolingual Retrieval Experiments with a Domain-Specific Document Corpus at the Chemnitz University of Technology. In: Peters, C., et al. Evaluation of Multilingual and Multi-modal Information Retrieval. CLEF 2006. Lecture Notes in Computer Science, vol 4730. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74999-8_26
Download citation
DOI: https://doi.org/10.1007/978-3-540-74999-8_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74998-1
Online ISBN: 978-3-540-74999-8
eBook Packages: Computer ScienceComputer Science (R0)