default search action
27th SIGIR 2004: Sheffield, UK
- Mark Sanderson, Kalervo Järvelin, James Allan, Peter Bruza:
SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Sheffield, UK, July 25-29, 2004. ACM 2004, ISBN 1-58113-881-4 - Gordon Bell, Jim Gemmell, Roger Lueder:
Challenges in using lifetime personal information stores. 1
Opening session
- Chirag Shah, W. Bruce Croft:
Evaluating high accuracy retrieval techniques. 2-9 - Einat Amitay, David Carmel, Ronny Lempel, Aya Soffer:
Scaling IR-system evaluation using term relevance sets. 10-17 - Fernando Diaz, Rosie Jones:
Using temporal profiles of queries for precision prediction. 18-24
Test collections
- Chris Buckley, Ellen M. Voorhees:
Retrieval evaluation with incomplete information. 25-32 - Mark Sanderson, Hideo Joho:
Forming test collections with no system pooling. 33-40 - Douglas W. Oard, Dagobert Soergel, David S. Doermann, Xiaoli Huang, G. Craig Murray, Jianqiang Wang, Bhuvana Ramabhadran, Martin Franz, Samuel Gustman, James Mayfield, Liliya Kharevych, Stephanie M. Strassel:
Building an information retrieval test collection for spontaneous conversational speech. 41-48
Formal models-1
- Hui Fang, Tao Tao, ChengXiang Zhai:
A formal study of information retrieval heuristics. 49-56 - Ji-Rong Wen, Ni Lao, Wei-Ying Ma:
Probabilistic model for contextual retrieval. 57-63 - Ramesh Nallapati:
Discriminative models for information retrieval. 64-71
XML retrieval
- Gabriella Kazai, Mounia Lalmas, Arjen P. de Vries:
The overlap problem in content-oriented XML retrieval evaluation. 72-79 - Jaap Kamps, Maarten de Rijke, Börkur Sigurbjörnsson:
Length normalization in XML retrieval. 80-87 - Shaorong Liu, Qinghua Zou, Wesley W. Chu:
Configurable indexing and ranking for XML information retrieval. 88-95
Dimensionality reduction
- Xiaofei He, Deng Cai, Haifeng Liu, Wei-Ying Ma:
Locality preserving indexing for document representation. 96-103 - Effrosini Kokiopoulou, Yousef Saad:
Polynomial filtering in latent semantic indexing for information retrieval. 104-111 - Chunqiang Tang, Sandhya Dwarkadas, Zhichen Xu:
On scaling latent semantic indexing for large peer-to-peer systems. 112-121
Formal models-2
- John F. Canny:
GaP: a factor model for discrete data. 122-129 - Raymond Y. K. Lau, Peter Bruza, Dawei Song:
Belief revision for adaptive information retrieval. 130-137 - Weiguo Fan, Ming Luo, Li Wang, Wensi Xi, Edward A. Fox:
Tuning before feedback: combining ranking discovery and blind feedback for robust retrieval. 138-145
Cross-language information retrieval
- Pu-Jen Cheng, Jei-Wen Teng, Ruey-Cheng Chen, Jenq-Haur Wang, Wen-Hsiang Lu, Lee-Feng Chien:
Translating unknown queries with web corpora for cross-language information retrieval. 146-153 - Monica Rogati, Yiming Yang:
Resource selection for domain-specific cross-lingual IR. 154-161 - Ying Zhang, Phil Vines:
Using the web for automated translation extraction in cross-language information retrieval. 162-169
Language models
- Jianfeng Gao, Jian-Yun Nie, Guangyuan Wu, Guihong Cao:
Dependence language model for information retrieval. 170-177 - Djoerd Hiemstra, Stephen E. Robertson, Hugo Zaragoza:
Parsimonious language models for information retrieval. 178-185 - Xiaoyong Liu, W. Bruce Croft:
Cluster-based retrieval using language models. 186-193 - Oren Kurland, Lillian Lee:
Corpus structure, language models, and ad hoc information retrieval. 194-201
Clustering
- Wei Xu, Yihong Gong:
Document clustering by concept factorization. 202-209 - Hua-Jun Zeng, Qi-Cai He, Zheng Chen, Wei-Ying Ma, Jinwen Ma:
Learning to cluster web search results. 210-217 - Tao Li, Sheng Ma, Mitsunori Ogihara:
Document clustering via adaptive subspace iteration. 218-225 - Stefan Siersdorfer, Sergej Sizov:
Restrictive clustering and metaclustering for self-organizing document collections. 226-233
Text classification
- Dunja Mladenic, Janez Brank, Marko Grobelnik, Natasa Milic-Frayling:
Feature selection using linear classifier weights: interaction with classification models. 234-241 - Dou Shen, Zheng Chen, Qiang Yang, Hua-Jun Zeng, Benyu Zhang, Yuchang Lu, Wei-Ying Ma:
Web-page classification through summarization. 242-249 - Dmitry Davidov, Evgeniy Gabrilovich, Shaul Markovitch:
Parameterized generation of labeled datasets for text categorization based on a hierarchical directory. 250-257
Disambiguation
- Sang-Bum Kim, Hee-Cheol Seo, Hae-Chang Rim:
Information retrieval using word senses: root sense tagging approach. 258-265 - Shuang Liu, Fang Liu, Clement T. Yu, Weiyi Meng:
An effective approach to document retrieval via utilizing WordNet and recognizing phrases. 266-272 - Einat Amitay, Nadav Har'El, Ron Sivan, Aya Soffer:
Web-a-where: geotagging web content. 273-280
Recognising and using named entities
- Li Zhang, Yue Pan, Tong Zhang:
Focused named entity recognition using machine learning. 281-288 - Wai Lam, Ruizhang Huang, Pik-Shan Cheung:
Learning phonetic similarity for matching named entity translations and mining new translations. 289-296 - Giridhar Kumaran, James Allan:
Text classification and named entities for new event detection. 297-304
Efficiency and scaling
- Fabrizio Silvestri, Salvatore Orlando, Raffaele Perego:
Assigning identifiers to documents to enhance the clustering property of fulltext indexes. 305-312 - Christos Tryfonopoulos, Manolis Koubarakis, Yannis Drougas:
Filtering algorithms for information retrieval models with named attributes and proximity operators. 313-320 - Steven M. Beitzel, Eric C. Jensen, Abdur Chowdhury, David A. Grossman, Ophir Frieder:
Hourly analysis of a very large topically categorized web query log. 321-328
Content-based filtering & collaborative filtering
- Matthew R. McLaughlin, Jonathan L. Herlocker:
A collaborative filtering algorithm and evaluation metric that accurately model the user experience. 329-336 - Rong Jin, Joyce Y. Chai, Luo Si:
An automatic weighting scheme for collaborative filtering. 337-344 - Yi Zhang:
Using bayesian priors to combine classifiers for adaptive filtering. 345-352 - Kai Yu, Volker Tresp, Shipeng Yu:
A nonparametric hierarchical bayesian framework for information filtering. 353-360
Image retrieval, users and usability
- Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu:
Automatic image annotation by using concept-sensitive salient objects for image content representation. 361-368 - Toni M. Rath, R. Manmatha, Victor Lavrenko:
A search engine for historical manuscript images. 369-376 - Diane Kelly, Nicholas J. Belkin:
Display time as implicit feedback: understanding task effects. 377-384 - Mingfang Wu, Gheorghe Muresan, Alistair McLean, Muh-Chyun (Morris) Tang, Ross Wilkinson, Yuelin Li, Hyuk-Jin Lee, Nicholas J. Belkin:
Human versus machine in the topic distillation task. 385-392 - Peter Willett:
Chemoinformatics: an application domain for information retrieval techniques. 393
Machine learning for IR
- Wensi Xi, Jesper Lind, Eric Brill:
Learning effective ranking functions for newsgroup search. 394-401 - Leah S. Larkey, Fangfang Feng, Margaret E. Connell, Victor Lavrenko:
Language-specific models in multilingual topic tracking. 402-409 - Dell Zhang, Wee Sun Lee:
Web taxonomy integration through co-bootstrapping. 410-417
Natural language processing
- Jinxi Xu, Ralph M. Weischedel, Ana Licuanan:
Evaluation of an extraction-based approach to answering definitional questions. 418-424 - Hai Leong Chieu, Yoong Keok Lee:
Query based event extraction along a timeline. 425-432 - Korinna Grabski, Tobias Scheffer:
Sentence completion. 433-439
Web structure
- Deng Cai, Xiaofei He, Ji-Rong Wen, Wei-Ying Ma:
Block-level link analysis. 440-447 - Vassilis Plachouras, Iadh Ounis:
Usefulness of hyperlink structure for query-biased topic distillation. 448-455 - Deng Cai, Shipeng Yu, Ji-Rong Wen, Wei-Ying Ma:
Block-based web search. 456-463
Posters
- William P. Doran, Nicola Stokes, Eamonn Newman, John Dunnion, Joe Carthy:
A hybrid statistical/linguistic model for generating news story gists. 464-465 - Mark Sanderson, Robert C. Pasley:
Image based gisting in CLIR. 466-467 - Edel Greevy, Alan F. Smeaton:
Classifying racist texts using a support vector machine. 468-469 - Azreen Azman, Iadh Ounis:
Discovery of aggregate usage profiles based on clustering information needs. 470-471 - Jie Lu, Jamie Callan:
Merging retrieval results in hierarchical peer-to-peer networks. 472-473 - Tetsuya Sakai, Yoshimi Saito, Yumi Ichimura, Tomoharu Kokubu, Makoto Koyama:
The effect of back-formulating questions in question answering evaluation. 474-475 - Jesse Montgomery, Luo Si, Jamie Callan, David A. Evans:
Effect of varying number of documents in blind feedback: analysis of the 2003 NRRC RIA workshop "bf_numdocs" experiment suite. 476-477 - Laura A. Granka, Thorsten Joachims, Geri Gay:
Eye-tracking analysis of user behavior in WWW search. 478-479 - Raman Chandrasekar, Harr Chen, Simon Corston-Oliver, Eric Brill:
Subwebs for specialized search. 480-481 - Zhenmei Gu, Ming Luo:
Comparison of using passages and documents for blind relevance feedback in information retrieval. 482-483 - Paul D. Clough, Mark Sanderson:
Measuring pseudo relevance feedback & CLIR. 484-485 - Tao Tao, ChengXiang Zhai:
A two-stage mixture model for pseudo feedback. 486-487 - Eric Crestan, Claude de Loupy:
Natural language processing for browse help. 488-489 - James Mayfield, Paul McNamee:
Triangulation without translation. 490-491 - Smitha Sriram, Xuehua Shen, ChengXiang Zhai:
A session-based search engine. 492-493 - Steven M. Beitzel, Eric C. Jensen, Abdur Chowdhury, David A. Grossman, Ophir Frieder:
Evaluation of filtering current news search results. 494-495 - Eduard Hoenkamp, Dawei Song:
The document as an ergodic markov chain. 496-497 - Raymond J. D'Amore:
Expertise community detection. 498-499 - Dmitri Roussinov, Jose Antonio Robles-Flores:
Learning patterns to answer open domain questions on the web. 500-501 - Anton Leuski:
Email is a stage: discovering people roles from email archives. 502-503 - Gauri Shah, Tanveer Fathima Syeda-Mahmood:
Searching databases for sematically-related schemas. 504-505 - Chris Buckley:
Topic prediction based on comparative retrieval rankings. 506-507 - Elizabeth D. Liddy, Anne Diekema, Özgür Yilmazel:
Context-based question-answering evaluation. 508-509 - Yixing Sun, David J. Harper, Stuart N. K. Watt:
Design of an e-book user interface and visualizations to support reading for comprehension. 510-511 - David Hawking, Trystan Upstill, Nick Craswell:
Toward better weighting of anchors. 512-513 - Jiamin Ye, Alan F. Smeaton:
Aggregated feature retrieval for MPEG-7 via clustering. 514-515 - Andrés Corrada-Emmanuel, W. Bruce Croft:
Answer models for question answering passage retrieval. 516-517 - Harris Wu, Michael D. Gordon:
Collaborative filing in a document repository. 518-519 - Ryen W. White, Joemon M. Jose:
A study of topic similarity measures. 520-521 - Hui Yang, Tat-Seng Chua:
Effectiveness of web page classification on finding list answers. 522-523 - Ying Zhang, Phil Vines:
Detection and translation of OOV terms prior to query time. 524-525 - Yael Nemeth, Bracha Shapira, Meirav Taieb-Maimon:
Evaluation of the real and perceived value of automatic and interactive query expansion. 526-527 - Donna Harman, Chris Buckley:
The NRRC reliable information access (RIA) workshop. 528-529 - Ian Soboroff:
On evaluating web search with very few relevant documents. 530-531 - Qing Li, Byeong Man Kim, Donghai Guan, Duk whan Oh:
A music recommender based on audio features. 532-533 - Liping Ma, John Shepherd:
Information extraction using two-phase pattern discovery. 534-535 - Yue Lu, Li Zhang, Chew Lim Tan:
A search engine for imaged documents in PDF files. 536-537 - Yan Liu, Jaime G. Carbonell, Judith Klein-Seetharaman, Vanathi Gopalakrishnan:
Context sensitive vocabulary and its application in protein secondary structure prediction. 538-539 - Donald Metzler, Victor Lavrenko, W. Bruce Croft:
Formal multiple-bernoulli models for language modeling. 540-541 - Leif Azzopardi, Mark A. Girolami, Cornelis Joost van Rijsbergen:
User biased document language modelling. 542-543 - Kevyn Collins-Thompson, Jamie Callan:
Information retrieval for language tutoring: an overview of the REAP project. 544-545 - Yinghui Xu, Kyoji Umemura:
A unified model of literal mining and link analysis for ranking web resources. 546-547 - Xiaoyong Liu, W. Bruce Croft, Paul Oh, David M. Hart:
Automatic recognition of reading levels from user queries. 548-549 - Justin Basilico, Thomas Hofmann:
A joint framework for collaborative and content filtering. 550-551 - Hee-Soo Kim, Ikkyu Choi, Minkoo Kim:
Refining term weights of documents using term dependencies. 552-553 - Börkur Sigurbjörnsson, Jaap Kamps, Maarten de Rijke:
Multiple sources of evidence for XML retrieval. 554-555 - Yuen-Hsien Tseng, William John Teahan:
Verifying a Chinese collection for text categorization. 556-557 - Yih-Ling Hedley, Muhammad Younas, Anne E. James, Mark Sanderson:
Query-related data extraction of hidden web documents. 558-559 - Atsushi Fujii, Makoto Iwayama, Noriko Kando:
The patent retrieval task in the fourth NTCIR workshop. 560-561 - Ellen M. Voorhees:
Measuring ineffectiveness. 562-563 - Philip J. Cowans:
Information retrieval using hierarchical dirichlet processes. 564-565 - Abduelbaset Goweder, Massimo Poesio, Anne N. De Roeck:
Broken plural detection for arabic information retrieval. 566-567 - Rong Jin, Luo Si:
A study of methods for normalizing user ratings in collaborative filtering. 568-569 - Robert H. Warren, Ting Liu:
A review of relevance feedback experiments at the 2003 reliable information access (RIA) workshop. 570-571 - Bicheng Liu, David J. Harper, Stuart N. K. Watt:
Supporting federated information sharing communities. 572-573 - Kevyn Collins-Thompson, Jamie Callan, Egidio L. Terra, Charles L. A. Clarke:
The effect of document retrieval quality on factoid question answering performance. 574-575 - Trystan Upstill, Stephen E. Robertson:
Exploiting hyperlink recommendation evidence in navigational web search. 576-577 - D. S. Hunnisett, W. J. Teahan:
Context-based methods for text categorisation. 578-579 - Manu Aery, Sharma Chakravarthy:
eMailSift: mining-based approaches to email classification. 580-581 - Jack G. Conrad, Cindy P. Schriber:
Constructing a text corpus for inexact duplicate detection. 582-583 - Chris Buckley:
Why current IR engines fail. 584-585 - Manuel Zahariev:
Automatic sense disambiguation for acronyms. 586-587 - Gabriel Somlo, Adele E. Howe:
Filtering for personal web information agents. 588-589 - Michael G. Christel, Neema Moraveji, Chang Huang:
Evaluating content-based filters for image and video retrieval. 590-591 - Jianping Fan, Hangzai Luo:
Semantic video classification by integrating unlabeled samples for classifier training. 592-593 - Susan T. Dumais, Edward Cutrell, Raman Sarin, Eric Horvitz:
Implicit queries (IQ) for contextualized search. 594 - Ryen W. White, Joemon M. Jose:
An implicit system for predicting interests. 595 - Fredric C. Gey, Aitao Chen, Ray R. Larson, Kim Carl:
Geotemporal querying of multilingual documents. 596 - Xuehua Shen, Smitha Sriram, ChengXiang Zhai:
ACES: a contextual engine for search. 597 - Sam Chapman, Alexiei Dingli, Fabio Ciravegna:
Armadillo: harvesting information for the semantic web. 598 - Udo Kruschwitz, Hala Al-Bakour:
UKSearch: search with automatically acquired domain knowledge. 599 - Ray R. Larson, Patricia Frontiera:
Geographic information retrieval (GIR): searching where and what. 600
Doctorial consortium
- Melanie Gnasa:
Sharing knowledge online (abstract only): a dream or reality? 602 - Bicheng Liu:
Supporting federated information sharing communities (abstract only). 602 - Razvan Stefan Bot:
Improving document representation by accumulating relevance feedback (abstract only): the relevance feedback accumulation algorithm. 602 - Jochen L. Leidner:
Toponym resolution in text (abstract only): "which sheffield is it?". 602 - Paul Ogilvie:
Understanding combination of evidence using generative probabilistic models for information retrieval (abstract only). 603 - Yixing Sun:
Discovering and representing the contextual and narrative structure of e-books to support reading and comprehension (abstract only). 603 - Melanie J. Martin:
Reliability and verification of natural language text on the world wide web (abstract only). 603 - Andrew Trotman:
An artificial intelligence approach to information retrieval (abstract only). 603 - Xiaojun Yuan:
Supporting multiple information-seeking strategies in a single system framework (abstract only). 604
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.