Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3664647.3681658acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article
Open access

Query Augmentation with Brain Signals

Published: 28 October 2024 Publication History

Abstract

In the information retrieval scenario, query augmentation is an essential technique to refine semantically imprecise queries to align more closely with users' actual information needs. Traditional methods typically rely on extracting signals from user interactions such as browsing or clicking behaviors to augment the queries, which may not accurately reflect the actual user intent due to inherent noise and the dependency on initial user interactions. To overcome these limitations, we introduce Brain-Aug, a novel approach that decodes semantic information directly from brain signals of users to augment query representation. Brain-Aug builds on three techniques: (i) Structurally, an adapter network is utilized to project brain signals into the embedding space of a language model, allowing query augmentation conditioned on both the users' initial query and their brain signals. (ii) During training, we use a next token prediction task for query augmentation and adopt prompt tuning to efficiently train the brain adapter. (iii)At the inference stage, a ranking-oriented decoding strategy is implemented, enabling Brain-Aug to generate augmentations that improve ranking performance. We evaluate our approach on multiple functional magnetic resonance imaging (fMRI) datasets, demonstrating that Brain-Aug not only produces semantically richer queries but also significantly improves document ranking accuracy, particularly for ambiguous queries. These results validate the effectiveness of Brain-Aug, and reveal the potential of using internal cognitive states to understand and augment text-based queries. Supplementary materials and code are available at https://github.com/YeZiyi1998/Brain-Query-Augmentation.

Supplemental Material

MP4 File - Query Augmentation with Brain Signals
This is the video presenting our paper titled "Query Augmentation with Brain Signals". We propose Brain-Aug, which decodes semantic information directly from brain signals and augments the query by LLM-based generation. We demonstrate that Brain-Aug not only produces semantically richer queries but also significantly improves document ranking accuracy, particularly for ambiguous queries. This advances the BCI-based search system and brings it closer to becoming a reality.

References

[1]
Wasi Uddin Ahmad, Kai-Wei Chang, and Hongning Wang. 2019. Context Attentive Document Ranking and Query Suggestion. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval. 385--394.
[2]
Marco Allegretti, Yashar Moshfeghi, Maria Hadjigeorgieva, Frank E. Pollick, Joemon M. Jose, and Gabriella Pasi. 2015. When Relevance Judgement Is Happening? An EEG-based Study. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval. 719--722.
[3]
Keping Bi, Qingyao Ai, and W Bruce Croft. 2019. Iterative Relevance Feedback for Answer Passage Retrieval with Pasage-level Semantic Match. In Advances in Information Retrieval: 41st European Conference on IR Research, ECIR 2019, Cologne, Germany, April 14--18, 2019, Proceedings, Part I 41. Springer, 558--572.
[4]
Jia Chen, Jiaxin Mao, Yiqun Liu, Ziyi Ye, Weizhi Ma, Chao Wang, Min Zhang, and Shaoping Ma. 2021. A Hybrid Framework for Session Context Modeling. ACM Transactions on Information Systems, Vol. 39, 3 (2021), 1--35.
[5]
Xuesong Chen, Ziyi Ye, Xiaohui Xie, Yiqun Liu, Xiaorong Gao, Weihang Su, Shuqi Zhu, Yike Sun, Min Zhang, and Shaoping Ma. 2022. Web Search via an Efficient and Effective Brain-Machine Interface. In Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining. 1569--1572.
[6]
Steve Cronen-Townsend, Yun Zhou, and W Bruce Croft. 2002. Predicting Query Performance. In Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 299--306.
[7]
Alexandre Défossez, Charlotte Caucheteux, Jérémy Rapin, Ori Kabeli, and Jean-Rémi King. 2023. Decoding Speech Perception from Non-invasive Brain Recordings. Nature Machine Intelligence (2023), 1--11.
[8]
Manuel J.A. Eugster, Tuukka Ruotsalo, Michiel M Spapé, Oswald Barral, Niklas Ravaja, Giulio Jacucci, and Samuel Kaski. 2016. Natural Brain-information Interfaces: Recommending Information by Relevance Inferred from Human Brain Signals. Scientific Reports, Vol. 6, 1 (2016), 38580.
[9]
Manuel J.A. Eugster, Tuukka Ruotsalo, Michiel M Spapé, Ilkka Kosunen, Oswald Barral, Niklas Ravaja, Giulio Jacucci, and Samuel Kaski. 2014. Predicting Term-relevance from Brain Signals. In Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval. 425--434.
[10]
Kunihiko Fukushima. 1980. Neocognitron: A Self-organizing Neural Network Model for a Mechanism of Pattern Recognition Unaffected by Shift in Position. Biological Cybernetics, Vol. 36, 4 (1980), 193--202.
[11]
Jacek Gwizdka, Rahilsadat Hosseini, Michael Cole, and Shouyi Wang. 2017. Temporal dynamics of eye-tracking and EEG during reading and relevance decisions. Journal of the Association for Information Science and Technology, Vol. 68, 10 (2017), 2299--2312.
[12]
Gautier Izacard, Mathilde Caron, Lucas Hosseini, Sebastian Riedel, Piotr Bojanowski, Armand Joulin, and Edouard Grave. 2021. Unsupervised Dense Information Retrieval with Contrastive Learning. arXiv preprint arXiv:2112.09118 (2021).
[13]
Kalervo Järvelin and Jaana Kekäläinen. 2002. Cumulated Gain-based Evaluation of IR Techniques. ACM Transactions on Information Systems, Vol. 20, 4 (2002), 422--446.
[14]
Kalervo Järvelin and Jaana Kekäläinen. 2017. IR Evaluation Methods for Retrieving Highly Relevant Documents. In ACM SIGIR Forum, Vol. 51. ACM New York, NY, USA, 243--250.
[15]
Emilia Kacprzak, Laura M Koesten, Luis-Daniel Ibá nez, Elena Simperl, and Jeni Tennison. 2017. A Query Log Analysis of Dataset Search. In Web Engineering: 17th International Conference, ICWE 2017, Rome, Italy, June 5--8, 2017, Proceedings 17. Springer, 429--436.
[16]
Victor Lavrenko and W Bruce Croft. 2017. Relevance-based Language Models. In ACM SIGIR Forum, Vol. 51. ACM New York, NY, USA, 260--267.
[17]
Amanda LeBel, Lauren Wagner, Shailee Jain, Aneesh Adhikari-Desai, Bhavin Gupta, Allyson Morgenthal, Jerry Tang, Lixiang Xu, and Alexander G. Huth. 2023. A Natural Language fMRI Dataset for Voxelwise Encoding Models. Scientific Data, Vol. 10, 1 (2023), 555.
[18]
Kenton Lee, Ming-Wei Chang, and Kristina Toutanova. 2019. Latent retrieval for weakly supervised open domain question answering. arXiv preprint arXiv:1906.00300 (2019).
[19]
Hang Li, Harrisen Scells, and Guido Zuccon. 2020. Systematic Review Automation Tools for End-to-end Query Formulation. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 2141--2144.
[20]
Chin-Yew Lin. 2004. Rouge: A Package for Automatic Evaluation of Summaries. In Text Summarization Branches Out. 74--81.
[21]
Haotian Liu, Chunyuan Li, Qingyang Wu, and Yong Jae Lee. 2023. Visual Instruction Tuning. arXiv preprint arXiv:2304.08485 (2023).
[22]
Xiao Liu, Yanan Zheng, Zhengxiao Du, Ming Ding, Yujie Qian, Zhilin Yang, and Jie Tang. 2023. GPT Understands, Too. AI Open (2023).
[23]
Yizhuo Lu, Changde Du, Qiongyi Zhou, Dianpeng Wang, and Huiguang He. 2023. MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion. In Proceedings of the 31st ACM International Conference on Multimedia. 5899--5908.
[24]
Xueguang Ma, Liang Wang, Nan Yang, Furu Wei, and Jimmy Lin. 2023. Fine-tuning LLaMA for Multi-stage Text Retrieval. arXiv preprint arXiv:2310.08319 (2023).
[25]
Qiaozhu Mei, Dengyong Zhou, and Kenneth Church. 2008. Query Suggestion Using Hitting Time. In Proceedings of the 17th ACM Conference on Information and Knowledge Management. 469--478.
[26]
Clara Meister and Ryan Cotterell. 2021. Language Model Evaluation Beyond Perplexity. arXiv preprint arXiv:2106.00085 (2021).
[27]
Chuan Meng, Negar Arabzadeh, Mohammad Aliannejadi, and Maarten de Rijke. 2023. Query Performance Prediction: From Ad-hoc to Conversational Search. arXiv preprint arXiv:2305.10923 (2023).
[28]
Dominika Michalkova, Mario Parra Rodriguez, and Yashar Moshfeghi. 2024. Understanding Feeling-of-Knowing in Information Search: An EEG Study. ACM Transactions on Information Systems, Vol. 42, 3 (2024), 1--30.
[29]
Tom M. Mitchell, Svetlana V. Shinkareva, Andrew Carlson, Kai-Min Chang, Vicente L. Malave, Robert A. Mason, and Marcel Adam Just. 2008. Predicting Human Brain Activity Associated with the Meanings of Nouns. science, Vol. 320, 5880 (2008), 1191--1195.
[30]
Yashar Moshfeghi, Peter Triantafillou, and Frank E Pollick. 2016. Understanding Information Need: An fMRI Study. In Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval. 335--344.
[31]
Javed Mostafa and Jacek Gwizdka. 2016. Deepening the Role of the User: Neuro-physiological Evidence as a Basis for Studying and Improving Search. In Proceedings of the 2016 ACM on Conference on Human Information Interaction and Retrieval. 63--70.
[32]
Samuel A. Nastase, Yun-Fei Liu, Hanna Hillman, Asieh Zadbood, Liat Hasenfratz, Neggin Keshavarzian, Janice Chen, Christopher J. Honey, Yaara Yeshurun, Mor Regev, et al. 2021. The 'Narratives' fMRI Dataset for Evaluating Models of Naturalistic Language Comprehension. Scientific Data, Vol. 8, 1 (2021), 250.
[33]
Francisco Pereira, Bin Lou, Brianna Pritchett, Samuel Ritter, Samuel J. Gershman, Nancy Kanwisher, Matthew Botvinick, and Evelina Fedorenko. 2018. Toward a Universal Decoder of Linguistic Meaning from Brain Activation. Nature Communications, Vol. 9, 1 (2018), 963.
[34]
Mateus Pereira, Elham Etemad, and Fernando Paulovich. 2020. Iterative Learning to Rank from Explicit Relevance Feedback. In Proceedings of the 35th Annual ACM Symposium on Applied Computing. 698--705.
[35]
Zuzana Pinkosova, William J. McGeown, and Yashar Moshfeghi. 2020. The Cortical Activity of Graded Relevance. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 299--308.
[36]
Yansheng Qiu, Ziyuan Zhao, Hongdou Yao, Delin Chen, and Zheng Wang. 2023. Modal-aware Visual Prompting for Incomplete Multi-modal Brain Tumor Segmentation. In Proceedings of the 31st ACM International Conference on Multimedia. 3228--3239.
[37]
Stephen Robertson. 2004. Understanding Inverse Document Frequency: On Theoretical Arguments for IDF. Journal of Documentation, Vol. 60, 5 (2004), 503--520.
[38]
Stephen Robertson and Hugo Zaragoza. 2009. The Probabilistic Relevance Framework: BM25 and Beyond. Foundations and Trends in Information Retrieval, Vol. 3, 4 (2009), 333--389.
[39]
Joseph John Rocchio Jr. 1971. Relevance Feedback in Information Retrieval. The SMART Retrieval System: Experiments in Automatic Document Processing (1971).
[40]
Anna Shtok, Oren Kurland, David Carmel, Fiana Raiber, and Gad Markovits. 2012. Predicting Query Performance by Query-drift Estimation. ACM Transactions on Information Systems, Vol. 30, 2 (2012), 1--35.
[41]
Nandan Thakur, Nils Reimers, Andreas Rücklé, Abhishek Srivastava, and Iryna Gurevych. 2021. BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models. arXiv preprint arXiv:2104.08663 (2021).
[42]
Yuki Yano, Yukihiro Tagami, and Akira Tajima. 2016. Quantifying Query Ambiguity with Topic Distributions. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. 1877--1880.
[43]
Ziyi Ye, Qingyao Ai, Yiqun Liu, Min Zhang, Christina Lioma, and Tuukka Ruotsalo. 2023. Language Generation from Human Brain Activities. arXiv preprint arXiv:2311.09889 (2023).
[44]
Ziyi Ye, Xiaohui Xie, Qingyao Ai, Yiqun Liu, Zhihong Wang, Weihang Su, and Min Zhang. 2024. Relevance Feedback with Brain Signals. ACM Transactions on Information Systems, Vol. 42, 4 (2024), Article No. 93.
[45]
Ziyi Ye, Xiaohui Xie, Yiqun Liu, Zhihong Wang, Xuesong Chen, Min Zhang, and Shaoping Ma. 2022. Brain Topography Adaptive Network for Satisfaction Modeling in Interactive Information Access System. In Proceedings of the 30th ACM International Conference on Multimedia. 90--100.
[46]
Ziyi Ye, Xiaohui Xie, Yiqun Liu, Zhihong Wang, Xuesong Chen, Min Zhang, and Shaoping Ma. 2022. Towards a Better Understanding of Human Reading Comprehension with Brain Signals. In Proceedings of the ACM Web Conference 2022. 380--391.
[47]
Ziyi Ye, Xiaohui Xie, Yiqun Liu, Zhihong Wang, Xuancheng Li, Jiaji Li, Xuesong Chen, Min Zhang, and Shaoping Ma. 2022. Why Don't You Click: Understanding Non-Click Results in Web Search with Brain Signals. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 633--645.
[48]
Shuxian Zou, Shaonan Wang, Jiajun Zhang, and Chengqing Zong. 2021. Towards Brain-to-Text Generation: Neural Decoding with Pre-trained Encoder-Decoder Models. In NeurIPS 2021 AI for Science Workshop.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '24: Proceedings of the 32nd ACM International Conference on Multimedia
October 2024
11719 pages
ISBN:9798400706868
DOI:10.1145/3664647
This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 October 2024

Check for updates

Author Tags

  1. brain-computer interface (bci)
  2. prompt tunning
  3. query augmentation

Qualifiers

  • Research-article

Funding Sources

  • the Horizon 2020 FET program of the EU through the ERA-NET Cofund
  • the Dutch Research Council
  • the Academy of Finland
  • the European Union's Horizon Europe program
  • QuanCheng Laboratory

Conference

MM '24
Sponsor:
MM '24: The 32nd ACM International Conference on Multimedia
October 28 - November 1, 2024
Melbourne VIC, Australia

Acceptance Rates

MM '24 Paper Acceptance Rate 1,150 of 4,385 submissions, 26%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 188
    Total Downloads
  • Downloads (Last 12 months)188
  • Downloads (Last 6 weeks)66
Reflects downloads up to 17 Feb 2025

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media