Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleDecember 2024
AI Can Be Cognitively Biased: An Exploratory Study on Threshold Priming in LLM-Based Batch Relevance Assessment
SIGIR-AP 2024: Proceedings of the 2024 Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific RegionPages 54–63https://doi.org/10.1145/3673791.3698420Cognitive biases are systematic deviations in thinking that lead to irrational judgments and problematic decision-making, extensively studied across various fields. Recently, large language models (LLMs) have shown advanced understanding capabilities but ...
- research-articleAugust 2023
Learning to Relate to Previous Turns in Conversational Search
KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data MiningPages 1722–1732https://doi.org/10.1145/3580305.3599411Conversational search allows a user to interact with a search system in multiple turns. A query is strongly dependent on the conversation context. An effective way to improve retrieval effectiveness is to expand the current query with historical ...
- short-paperJuly 2023
A Preference Judgment Tool for Authoritative Assessment
SIGIR '23: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information RetrievalPages 3100–3104https://doi.org/10.1145/3539618.3591801Preference judgments have been established as an effective method for offline evaluation of information retrieval systems with advantages to graded or binary relevance judgments. Graded judgments assign each document a pre-defined grade level, while ...
- research-articleFebruary 2023
Understanding Relevance Judgments in Legal Case Retrieval
ACM Transactions on Information Systems (TOIS), Volume 41, Issue 3Article No.: 76, Pages 1–32https://doi.org/10.1145/3569929Legal case retrieval, which aims to retrieve relevant cases given a query case, has drawn increasing research attention in recent years. While much research has worked on developing automatic retrieval models, how to characterize relevance in this ...
- short-paperOctober 2022
Extreme Systematic Reviews: A Large Literature Screening Dataset to Support Environmental Policymaking
CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge ManagementPages 4029–4033https://doi.org/10.1145/3511808.3557600The United States Environmental Protection Agency (EPA) periodically releases Integrated Science Assessments (ISAs) that synthesize the latest research on each of six air pollutants to inform environmental policymaking. To guarantee the best possible ...
-
- research-articleOctober 2021
Evaluating Relevance Judgments with Pairwise Discriminative Power
CIKM '21: Proceedings of the 30th ACM International Conference on Information & Knowledge ManagementPages 261–270https://doi.org/10.1145/3459637.3482428Relevance judgments play an essential role in the evaluation of information retrieval systems. As many different relevance judgment settings have been proposed in recent years, an evaluation metric to compare relevance judgments in different annotation ...
- short-paperJuly 2020
Investigating Reading Behavior in Fine-grained Relevance Judgment
SIGIR '20: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information RetrievalPages 1889–1892https://doi.org/10.1145/3397271.3401305A better understanding of users' reading behavior helps improve many information retrieval (IR) tasks, such as relevance estimation and document ranking. Existing research has already leveraged eye movement information to investigate user's reading ...
- research-articleJanuary 2020
Crowd Worker Strategies in Relevance Judgment Tasks
- Lei Han,
- Eddy Maddalena,
- Alessandro Checco,
- Cristina Sarasua,
- Ujwal Gadiraju,
- Kevin Roitero,
- Gianluca Demartini
WSDM '20: Proceedings of the 13th International Conference on Web Search and Data MiningPages 241–249https://doi.org/10.1145/3336191.3371857Crowdsourcing is a popular technique to collect large amounts of human-generated labels, such as relevance judgments used to create information retrieval (IR) evaluation collections. Previous research has shown how collecting high quality labels from a ...
- research-articleOctober 2019
Order effect on relevance judgment: An exploratory study on the detection of quantum interference
Proceedings of the Association for Information Science and Technology (PRA2), Volume 56, Issue 1Pages 803–804https://doi.org/10.1002/pra2.179ABSTRACTThis poster explores the existence of order effect on relevance judgment, especially the interference of one document on the other while evaluating the relevance. A between‐subject experiment with 78 undergraduate students was reported. The ...
- research-articleSeptember 2019
Effort-based information retrieval evaluation with varied evaluation depth and topic sizes
ICBIM '19: Proceedings of the 3rd International Conference on Business and Information ManagementPages 143–147https://doi.org/10.1145/3361785.3361794The information retrieval accessed globally is a vital productivity boost for most organization. However, the outcome of information retrieval system evaluation does not agree with the real user's satisfaction. Information retrieval systems retrieving ...
- research-articleJuly 2019
Investigating Passage-level Relevance and Its Role in Document-level Relevance Judgment
SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information RetrievalPages 605–614https://doi.org/10.1145/3331184.3331233The understanding of the process of relevance judgment helps to inspire the design of retrieval models. Traditional retrieval models usually estimate relevance based on document-level signals. Recent works consider a more fine-grain, passage-level ...
- short-paperMarch 2019
Lessons Learned from Users Reading Highlighted Abstracts in a Digital Library
CHIIR '19: Proceedings of the 2019 Conference on Human Information Interaction and RetrievalPages 271–275https://doi.org/10.1145/3295750.3298950Finding relevant documents is essential for researchers of all disciplines. We investigated an approach for supporting searchers in their relevance decision in a digital library by automatically highlighting the most important keywords in abstracts. We ...
- research-articleApril 2018
Search Process as Transitions Between Neural States
WWW '18: Proceedings of the 2018 World Wide Web ConferencePages 1683–1692https://doi.org/10.1145/3178876.3186080Search is one of the most performed activities on the World Wide Web. Various conceptual models postulate that the search process can be broken down into distinct emotional and cognitive states of searchers while they engage in a search process. These ...
- research-articleAugust 2017
Comparing In Situ and Multidimensional Relevance Judgments
SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information RetrievalPages 405–414https://doi.org/10.1145/3077136.3080840To address concerns of TREC-style relevance judgments, we explore two improvements. The first one seeks to make relevance judgments contextual, collecting in situ feedback of users in an interactive search session and embracing usefulness as the primary ...
- research-articleMarch 2017Best Student Paper
Understanding Ephemeral State of Relevance
CHIIR '17: Proceedings of the 2017 Conference on Conference Human Information Interaction and RetrievalPages 137–146https://doi.org/10.1145/3020165.3020176Despite its dynamic nature, relevance is often measured in a context-independent manner in information retrieval practice. We look into this discrepancy. We propose a contextual relevance/usefulness measurement called ephemeral state of relevance (ESR), ...
- short-paperAugust 2015
Differences in Eye-Tracking Measures Between Visits and Revisits to Relevant and Irrelevant Web Pages
SIGIR '15: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information RetrievalPages 811–814https://doi.org/10.1145/2766462.2767795This short paper presents initial results from a project, in which we investigated differences in how users view relevant and irrelevant Web pages on their visits and revisits. The users' viewing of Web pages was characterized by eye-tracking measures, ...
- short-paperAugust 2015
A Test Collection for Spoken Gujarati Queries
SIGIR '15: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information RetrievalPages 919–922https://doi.org/10.1145/2766462.2767791The development of a new test collection is described in which the task is to search naturally occurring spoken content using naturally occurring spoken queries. To support research on speech retrieval for low-resource settings, the collection includes ...
- research-articleJune 2015
Debugging a Crowdsourced Task with Low Inter-Rater Agreement
JCDL '15: Proceedings of the 15th ACM/IEEE-CS Joint Conference on Digital LibrariesPages 101–110https://doi.org/10.1145/2756406.2757741In this paper, we describe the process we used to debug a crowdsourced labeling task with low inter-rater agreement. In the labeling task, the workers' subjective judgment was used to detect high-quality social media content-interesting tweets-with the ...
- research-articleJuly 2014
Multidimensional relevance modeling via psychometrics and crowdsourcing
SIGIR '14: Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrievalPages 435–444https://doi.org/10.1145/2600428.2609577While many multidimensional models of relevance have been posited, prior studies have been largely exploratory rather than confirmatory. Lacking a methodological framework to quantify the relationships among factors or measure model fit to observed data,...
- research-articleDecember 2013
Choices in batch information retrieval evaluation
ADCS '13: Proceedings of the 18th Australasian Document Computing SymposiumPages 74–81https://doi.org/10.1145/2537734.2537745Web search tools are used on a daily basis by billions of people. The commercial providers of these services spend large amounts of money measuring their own effectiveness and benchmarking against their competitors; nothing less than their corporate ...