research-article

Open access

Online Label Aggregation: A Variational Bayesian Approach

Authors:

Amirmasoud Ghiassi,

Lydia Y. ChenAuthors Info & Claims

WWW '21: Proceedings of the Web Conference 2021

Pages 1904 - 1915

https://doi.org/10.1145/3442381.3449933

Published: 03 June 2021 Publication History

All formats PDF

Abstract

Noisy labeled data is more a norm than a rarity for crowd sourced contents. It is effective to distill noise and infer correct labels through aggregating results from crowd workers. To ensure the time relevance and overcome slow responses of workers, online label aggregation is increasingly requested, calling for solutions that can incrementally infer true label distribution via subsets of data items. In this paper, we propose a novel online label aggregation framework, BiLA, which employs variational Bayesian inference method and designs a novel stochastic optimization scheme for incremental training. BiLA is flexible to accommodate any generating distribution of labels by the exact computation of its posterior distribution. We also derive the convergence bound of the proposed optimizer. We compare BiLA with the state of the art based on minimax entropy, neural networks and expectation maximization algorithms, on synthetic and real-world data sets. Our evaluation results on various online scenarios show that BiLA can effectively infer the true labels, with an error rate reduction of at least 10 to 1.5 percent points for synthetic and real-world datasets, respectively.

References

[1]

European Commission 2018. European Union’s General Data Protection Regulation. European Commission. https://ec.europa.eu/info/law/law-topic/data-protection/eu-data-protection-rules_en

[2]

Christopher M Bishop. 2006. Pattern recognition and machine learning. (2006), 461–517.

[3]

José María Cavanillas, Edward Curry, and Wolfgang Wahlster(Eds.). 2016. New Horizons for a Data-Driven Economy - A Roadmap for Usage and Exploitation of Big Data in Europe. Springer. https://doi.org/10.1007/978-3-319-21569-3

[4]

Alexander Philip Dawid and Allan M Skene. 1979. Maximum likelihood estimation of observer error-rates using the EM algorithm. Applied statistics (1979), 20–28.

[5]

Dheeru Dua and Casey Graff. 2017. UCI Machine Learning Repository. http://archive.ics.uci.edu/ml

[6]

Li Fei-Fei. 2010. ImageNet: crowdsourcing, benchmarking & other cool things. In CMU VASC Seminar, Vol. 16. 18–25.

[7]

Alex Gaunt, Diana Borsa, and Yoram Bachrach. 2016. Training deep neural nets to aggregate crowdsourced responses. In UAI. 242–251.

[8]

Amirmasoud Ghiassi, Taraneh Younesian, Robert Birke, and Lydia Y. Chen. 2020. TrustNet: Learning from Trusted Data Against (A)symmetric Label Noise. CoRR abs/2007.06324(2020).

[9]

Amirmasoud Ghiassi, Taraneh Younesian, Zilong Zhao, Robert Birke, Valerio Schiavoni, and Lydia Y. Chen. 2019. Robust (Deep) Learning Framework Against Dirty Labels and Beyond. In TPS-ISA. 236–244.

[10]

Hu Han, Charles Otto, Xiaoming Liu, and Anil K Jain. 2015. Demographic estimation from face images: Human vs. machine performance. IEEE transactions on pattern analysis and machine intelligence 37, 6(2015), 1148–1161.

[11]

Muhammad Imran, Carlos Castillo, Ji Lucas, Patrick Meier, and Sarah Vieweg. 2014. AIDR: Artificial intelligence for disaster response. In WWW. 159–162.

Digital Library

[12]

A Janosi, W Steinbrunn, M Pfisterer, and R Detrano. 1988. Heart disease data set. In https://archive.ics.uci.edu/ml/datasets/Heart+Disease.

[13]

Hyun-Chul Kim and Zoubin Ghahramani. 2012. Bayesian classifier combination. In AISTATS. 619–627.

[14]

Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In ICLR.

[15]

Daphne Koller and Nir Friedman. 2009. Probabilistic graphical models: principles and techniques. MIT press.

Digital Library

[16]

Alex Krizhevsky, Vinod Nair, and Geoffrey Hinton. 2009. CIFAR-10 (Canadian Institute for Advanced Research). (2009). http://www.cs.toronto.edu/~kriz/cifar.html

[17]

Kenichi Kurihara, Max Welling, and Yee Whye Teh. 2007. Collapsed Variational Dirichlet Process Mixture Models. In IJCAI, Vol. 7. 2796–2801.

[18]

Yuan Li, Benjamin Rubinstein, and Trevor Cohn. 2019. Exploiting worker correlation for label aggregation in crowdsourcing. In ICML. 3886–3895.

[19]

Qiang Liu, Jian Peng, and Alexander T Ihler. 2012. Variational inference for crowdsourcing. In NeurIPS. 692–700.

[20]

Alan Lundgard, Yiwei Yang, Maya L Foster, and Walter S Lasecki. 2018. Bolt: Instantaneous crowdsourcing via just-in-time training. In CHI. 1–7.

[21]

H. Brendan McMahan and Matthew J. Streeter. 2010. Adaptive Bound Optimization for Online Convex Optimization. In COLT. 244–256.

[22]

John W. Paisley, David M. Blei, and Michael I. Jordan. 2012. Variational Bayesian Inference with Stochastic Search. In ICML. 1363–1370.

[23]

Foster Provost, Wang Jing, and Panagiotis G. Ipeirotis. 2010. Quality management on amazon mechanical turk. In SIGKDD workshop on human computation. 64–67.

[24]

Vikas C Raykar, Shipeng Yu, Linda H Zhao, Gerardo Hermosillo Valadez, Charles Florin, Luca Bogoni, and Linda Moy. 2010. Learning from crowds. Journal of Machine Learning Research 11, Apr (2010), 1297–1322.

Digital Library

[25]

Pedro Savarese. 2019. On the Convergence of AdaBound and its Connection to SGD. CoRR abs/1908.04457(2019).

[26]

Edwin D Simpson, Matteo Venanzi, Steven Reece, Pushmeet Kohli, John Guiver, Stephen J Roberts, and Nicholas R Jennings. 2015. Language understanding in the wild: Combining crowdsourcing and machine learning. In WWW. 992–1002.

[27]

Rion Snow, Brendan O’Connor, Daniel Jurafsky, and Andrew Y Ng. 2008. Cheap and fast—but is it good?: evaluating non-expert annotations for natural language tasks. In EMNLP. 254–263.

[28]

Yee W Teh, David Newman, and Max Welling. 2007. A collapsed variational Bayesian inference algorithm for latent Dirichlet allocation. In NeurIPS. 1353–1360.

[29]

T. Tieleman and G. Hinton. 2012. Lecture 6.5—RmsProp: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural Networks for Machine Learning.

[30]

Matteo Venanzi, John Guiver, Gabriella Kazai, Pushmeet Kohli, and Milad Shokouhi. 2014. Community-based bayesian aggregation models for crowdsourcing. In WWW. 155–164.

[31]

Martin J Wainwright, Michael I Jordan, 2008. Graphical models, exponential families, and variational inference. Foundations and Trends® in Machine Learning 1, 1–2(2008), 1–305.

[32]

Jacob Whitehill, Ting-fan Wu, Jacob Bergsma, Javier R Movellan, and Paul L Ruvolo. 2009. Whose vote should count more: Optimal integration of labels from labelers of unknown expertise. In NeurIPS. 2035–2043.

[33]

Tong Xiao, Tian Xia, Yi Yang, Chang Huang, and Xiaogang Wang. 2015. Learning from massive noisy labeled data for image classification. In CVPR. 2691–2699.

[34]

Jie Yang, Thomas Drake, Andreas Damianou, and Yoelle Maarek. 2018. Leveraging crowdsourcing data for deep active learning an application: Learning intents in alexa. In WWW. 23–32.

[35]

Jie Yang, Alisa Smirnova, Dingqi Yang, Gianluca Demartini, Yuan Lu, and Philippe Cudré-Mauroux. 2019. Scalpel-cd: leveraging crowdsourcing and deep probabilistic modeling for debugging noisy training data. In WWW. 2158–2168.

[36]

Li’ang Yin, Jianhua Han, Weinan Zhang, and Yong Yu. 2017. Aggregating crowd wisdoms with label-aware autoencoders. In IJCAI. 1325–1331.

[37]

Taraneh Younesian, Zilong Zhao, Amirmasoud Ghiassi, Robert Birke, and Lydia Y. Chen. 2020. QActor: On-line Active Learning for Noisy Labeled Stream Data. CoRR abs/2001.10399(2020).

[38]

Denny Zhou, Sumit Basu, Yi Mao, and John C Platt. 2012. Learning from the wisdom of crowds by minimax entropy. In NeurIPS. 2195–2203.

[39]

Dengyong Zhou, Qiang Liu, John Platt, and Christopher Meek. 2014. Aggregating ordinal labels from crowds by minimax conditional entropy. In ICML. 262–270.

[40]

Martin Zinkevich. 2003. Online convex programming and generalized infinitesimal gradient ascent. In ICML. 928–936.

Cited By

Yang YZhao ZWu GZhuo XLiu QBai QLi W(2024)A Lightweight, Effective, and Efficient Model for Label Aggregation in CrowdsourcingACM Transactions on Knowledge Discovery from Data10.1145/363010218:4(1-27)Online publication date: 13-Feb-2024
https://dl.acm.org/doi/10.1145/3630102
Wu GZhuo XBao XHu XHong RWu X(2023)Crowdsourcing Truth Inference via Reliability-Driven Multi-View Graph EmbeddingACM Transactions on Knowledge Discovery from Data10.1145/356557617:5(1-26)Online publication date: 27-Feb-2023
https://dl.acm.org/doi/10.1145/3565576

Online Label Aggregation: A Variational Bayesian Approach
1. Computing methodologies

Recommendations

Debiased Label Aggregation for Subjective Crowdsourcing Tasks
CHI EA '22: Extended Abstracts of the 2022 CHI Conference on Human Factors in Computing Systems

Human Intelligence Tasks (HITs) allow people to collect and curate labeled data from multiple annotators. Then labels are often aggregated to create an annotated dataset suitable for supervised machine learning tasks. The most popular label aggregation ...
Label Aggregation with Clustering for Biased Crowdsourced Labeling
ICMLC '22: Proceedings of the 2022 14th International Conference on Machine Learning and Computing

With the rapid development of crowdsourcing learning, amount of label aggregation methods are proposed to infer the true labels of instances from multiple noisy labels provided by inexpert crowd workers. Most of the label aggregation methods take the ...
Variational Bayesian multinomial logistic Gaussian process classification

The multinomial logistic Gaussian process is a flexible non-parametric model for multi-class classification tasks. These tasks are often involved in solving a pattern recognition problem in real life. In such contexts, the multinomial logistic function (...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

WWW '21: Proceedings of the Web Conference 2021

April 2021

4054 pages

ISBN:9781450383127

DOI:10.1145/3442381

Editors:
Jure Leskovec
Stanford
,
Marko Grobelnik
Jožef Stefan Institute
,
Marc Najork
Google
,
Jie Tang
Tsinghua University
,
Leila Zia
Wikimedia Foundation

Copyright © 2021 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 June 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

WWW '21

Sponsor:

SIGWEB

WWW '21: The Web Conference 2021

April 19 - 23, 2021

Ljubljana, Slovenia

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
554
Total Downloads

Downloads (Last 12 months)130
Downloads (Last 6 weeks)21

Reflects downloads up to 03 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Yang YZhao ZWu GZhuo XLiu QBai QLi W(2024)A Lightweight, Effective, and Efficient Model for Label Aggregation in CrowdsourcingACM Transactions on Knowledge Discovery from Data10.1145/363010218:4(1-27)Online publication date: 13-Feb-2024
https://dl.acm.org/doi/10.1145/3630102
Wu GZhuo XBao XHu XHong RWu X(2023)Crowdsourcing Truth Inference via Reliability-Driven Multi-View Graph EmbeddingACM Transactions on Knowledge Discovery from Data10.1145/356557617:5(1-26)Online publication date: 27-Feb-2023
https://dl.acm.org/doi/10.1145/3565576

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Figures

Tables

Media

View Table of Conten