Newsletter Downloads
Open source search: a data mining platform
Commercial search engines provide a quality service at no cost to consumers thanks to embedded targeted marketing. Despite this, I argue there are still reasons why an open source effort should be encouraged in the community: as part of broader open ...
The TREC robust retrieval track
The robust retrieval track explores methods for improving the consistency of retrieval technology by focusing on poorly performing topics. The retrieval task in the track is a traditional ad hoc retrieval task where the evaluation methodology emphasizes ...
Report on the TREC 2004 genomics track
The goal of the TREC Genomics Track is to create test collections for evaluation of information retrieval (IR) and related tasks in the genomics domain. The Genomics Track differs from the other TREC tracks in that it is focused on retrieval in a ...
The TREC terabyte retrieval track
The Terabyte Retrieval Track of the Text REtrieval Conference (TREC) provides an opportunity to test retrieval techniques and evaluation methodologies in the context of a terabyte-scale corpus. Given the size of the corpus, the track also provides a ...
The second ACM international workshop on multimedia databases (MMDB 2004) held at ACM CIKM 2004
The Second ACM International Workshop on Multimedia Databases was held in Washington, DC, USA, November 13, 2004. Its aim is to bring together university researchers, scientists, industry professionals, software engineers and graduate students who need ...
Report on the 6th ACM international workshop on web information and data management (WIDM 2004) held at CIKM 2004
The 6th ACM International Workshop on Web Information and Data Management (WIDM 2004) was held at the Hyatt Arlington Hotel, in Washington DC, on November 12-13, 2004, in conjunction with the 13th ACM International Conference on Information and ...
Workshop on the evaluation of multimedia retrieval
The evaluation of multimedia retrieval is a subject that has gained momentum in the last couple of years. CWI, the National Research Institute for Mathematics and Computer Science in the Netherlands, organised a workshop organised on the subject on 24 ...
Report on the 27th European conference on information retrieval research (ECIR 2005)
The 27th European Conference on Information Retrieval Research (ECIR 2005), celebrated in Santiago de Compostela (Spain) on 21-23 March 2005, was jointly organized by the University of Santiago de Compostela (Spain) and the University of Granada (Spain),...
Relevance feedback at the INEX 2004 workshop
In 2004, the INitiative for the Evaluation of XML Retrieval (INEX), in its third year of investigations into various aspects of theoretical and applied structured retrieval, added a Relevance Feedback (RF) Track. The purpose of this Track is to explore ...
Report on the INEX 2004 interactive track
As scientific data repositories, digital libraries and publishers increasingly use the eXtensible Markup Language (XML) for publication and storage interest has arisen in exploiting this formatting for retrieval purposes. XML is attractive because it ...
The NLP task at INEX 2004
The INEX workshop is concerned with Evaluating the effectiveness of XML retrieval systems. In 2004 a natural language query task was added to the INEX Ad hoc track. Standard INEX Ad hoc topic titles are specified in NEXI -- a simplified and restricted ...
Video information retrieval using objects and ostensive relevance feedback
The thesis discusses and evaluates a model of video information retrieval that incorporates a variation of Relevance Feedback and facilitates object-based interaction and ranking. Object-based feature search for video IR is one of the main novel aspects ...
Effective web crawling
The key factors for the success of the World Wide Web are its large size and the lack of a centralized control over its contents. Both issues are also the most important source of problems for locating information. The Web is a context in which ...
Searching and mining the web for personalized and specialized information
With the rapid growth of the Web, users are often faced with the problem of information overload and find it difficult to search for relevant and useful information on the Web. Besides general-purpose search engines, there exist some alternative ...
Polyphonic music retrieval: the n-gram approach
This Music Information Retrieval (MIR) study investigates the use of n-grams and textual Information Retrieval (IR) approaches for the retrieval and access of polyphonic music data. IR, synonymous with text IR, implies the task of retrieving documents ...
Two information retrieval learning environments: their design and evaluation
The design and evaluation of two information retrieval (IR) learning environments took place in a basic course of IR (6 ECTS credits) at the Department of Information Studies at the University of Tampere. The course consisted of lectures, web exercises ...
Variations on language modeling for information retrieval
Search engine technology builds on theoretical and empirical research results in the area of information retrieval (IR). This dissertation makes a contribution to the field of language modeling (LM) for IR, which views both queries and documents as ...
A relational vector-space model of information retrieval adapted to images
The increase of digital image acquisition devices, combined to the growth of the Web, requires the definition of Information Retrieval (IR) models and systems providing fast access to images searched by users among large amounts of data.
Verification of bibliometric methods' applicability for thesaurus construction
The doctoral dissertation work concerns the development and exploration of a semi-automatic thesaurus construction approach based on bibliometric methods.
Automatic summarization focusing on document genre and text structure
This dissertation proposes a new automatic summarization method focusing on document genre and text structure, and verifies its effectiveness. "Document genre" refers to the type of document, such as a diary or a report. "Text structure" refers to the ...
Automated word sense disambiguation for web information retrieval
A word in the English language is considered ambiguous if, regardless of context, it can have more than one possible interpretation or meaning. Many words exhibit lexical ambiguity suggesting that it has the potential to impact upon the performance of ...
Using generative probabilistic models for multimedia retrieval
This thesis discusses information retrieval from multimedia archives, focusing on documents containing visual material. We investigate search and retrieval in collections of images and video, where video is defined as a sequence of still images. No ...
Implicit feedback for interactive information retrieval
Searchers can find the construction of query statements for submission to Information Retrieval (IR) systems a problematic activity. These problems are confounded by uncertainty about the information they are searching for, or an unfamiliarity with the ...
Aggregated feature video retrieval for MPEG-7 via clustering
MPEG-7 is a generic standard used to encode information about multimedia content and often, different MPEG-7 Descriptor Schemas are instantiated for different representations of a shot such as text annotations and visual features. Our work focuses on ...