Abstract
This paper describes the participation of the XLDB Group in the CLEF monolingual ad hoc task for Portuguese. We present tumba!, a Portuguese search engine and describe its architecture and the underlying assumptions. We discuss the way we used tumba! in CLEF, providing details on our runs and our experiments with ranking algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Arasu, A., Cho, J., Garcia-Molina, H., Paepcke, A., Raghavan, S.: Searching the Web. j-TOIT 1(1), 2–43 (2001), http://www.acm.org/pubs/contents/journals/toit
Braschler, M., Peters, C.: CLEF 2002 Methodology and Metrics, Advances in Cross-Language Information Retrieval: Results of the CLEF 2002 Evaluation Campaign. In: Peters, C., Braschler, M., Gonzalo, J. (eds.) CLEF 2002. LNCS, vol. 2785, pp. 394–404. Springer, Heidelberg (2003)
Costa, M., Silva, M.J.: Sidra: a Flexible Distributed Indexing and Ranking Architecture for Web Search. In: Proceedings of the VIII Conference on Software Engineering and Databases JISBD 2003, Alicante, Spain (November 2003)
Couto, F., Martins, B., Silva, M.J., Coutinho, P.: Classifying Biomedical Articles using Web Resources: application to KDD Cup 2002. DI/FCUL TR 03–24, Department of Informatics, University of Lisbon (July 2003)
Couto, F., Silva, M., Coutinho, P.: Finding Genomic Ontology Terms in Text using Information Content. In: Critical Assessment of Information Extraction systems in Biology (BioCreative), Granada, Spain (March 2004); BMC Bioinformatics Journal (accepted for publication)
Pólo XLDB da Linguateca, http://xldb.di.fc.ul.pt/linguateca/
Linguateca Distributed Resource Center for the Portuguese Language, http://www.linguateca.pt
Tumba! Portuguese Web Search Engine, http://www.tumba.pt
Gomes, D., Campos, J.P., Silva, M.J.: Versus: a Web Repository. In: WDAS - Workshop on Distributed Data and Structures 2002, Paris, France (March 2002)
Gomes, D., Silva, M.J.: Tarântula - Sistema de Recolha de Documentos da Web. In: CRC 2001 - 4a Conferência de Redes de Computadores (November 2001) (in Portuguese)
Notes on TREC Eval, http://ir.iit.edu/~dagr/cs529/files/project_files/trec_eval_desc.htm
Peters, C., Braschler, M.: Cross-Language Evaluation Forum: Objectives, Results, Achievements. Information Retrieval 7(1/2), 7–31 (2004)
Público, http://www.publico.pt
Santos, D., Rocha, P.: CHAVE: Topics and Questions on the Portuguese Participation in CLEF. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds.) CLEF 2004. LNCS, vol. 3491. Springer, Heidelberg (2005)
Silva, M.J.: The Case for a Portuguese Web Search Engine. In: Proceedings of the IADIS International Conference WWW/Internet 2003, ICWI 2003, Algarve, Portugal, November 5-8, pp. 411–418. IADIS (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cardoso, N., Silva, M.J., Costa, M. (2005). The XLDB Group at CLEF 2004. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds) Multilingual Information Access for Text, Speech and Images. CLEF 2004. Lecture Notes in Computer Science, vol 3491. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11519645_25
Download citation
DOI: https://doi.org/10.1007/11519645_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27420-9
Online ISBN: 978-3-540-32051-7
eBook Packages: Computer ScienceComputer Science (R0)