Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Learning from binary labels with instance-dependent noise

Published: 01 September 2018 Publication History

Abstract

Supervised learning has seen numerous theoretical and practical advances over the last few decades. However, its basic assumption of identical train and test distributions often fails to hold in practice. One important example of this is when the training instances are subject to label noise: that is, where the observed labels do not accurately reflect the underlying ground truth. While the impact of simple noise models has been extensively studied, relatively less attention has been paid to the practically relevant setting of instance-dependent label noise. It is thus unclear whether one can learn, both in theory and in practice, good models from data subject to such noise, with no access to clean labels. We provide a theoretical analysis of this issue, with three contributions. First, we prove that for instance-dependent (but label-independent) noise, any algorithm that is consistent for classification on the noisy distribution is also consistent on the noise-free distribution. Second, we prove that consistency also holds for the area under the ROC curve, assuming the noise scales (in a precise sense) with the inherent difficulty of an instance. Third, we show that the Isotron algorithm can efficiently and provably learn from noisy samples when the noise-free distribution is a generalised linear model. We empirically confirm our theoretical findings, which we hope may stimulate further analysis of this important learning setting.

Cited By

View all
  • (2024)Weakly Supervised Solar Panel Mapping via Uncertainty Adjusted Label Transition in Aerial ImagesIEEE Transactions on Image Processing10.1109/TIP.2023.333617033(881-896)Online publication date: 1-Jan-2024
  • (2024)Efficient and robust active learning methods for interactive database explorationThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-023-00816-x33:4(931-956)Online publication date: 1-Jul-2024
  • (2023)Class-Conditional Label Noise in Astroparticle PhysicsMachine Learning and Knowledge Discovery in Databases: Applied Data Science and Demo Track10.1007/978-3-031-43427-3_2(19-35)Online publication date: 18-Sep-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Machine Language
Machine Language  Volume 107, Issue 8-10
September 2018
428 pages

Publisher

Kluwer Academic Publishers

United States

Publication History

Published: 01 September 2018

Author Tags

  1. Consistency
  2. Instance-dependent noise
  3. Label noise

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 28 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Weakly Supervised Solar Panel Mapping via Uncertainty Adjusted Label Transition in Aerial ImagesIEEE Transactions on Image Processing10.1109/TIP.2023.333617033(881-896)Online publication date: 1-Jan-2024
  • (2024)Efficient and robust active learning methods for interactive database explorationThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-023-00816-x33:4(931-956)Online publication date: 1-Jul-2024
  • (2023)Class-Conditional Label Noise in Astroparticle PhysicsMachine Learning and Knowledge Discovery in Databases: Applied Data Science and Demo Track10.1007/978-3-031-43427-3_2(19-35)Online publication date: 18-Sep-2023
  • (2022)Beyond confusion matrix: learning from multiple annotators with awareness of instance featuresMachine Language10.1007/s10994-022-06211-x112:3(1053-1075)Online publication date: 7-Jul-2022
  • (2022)Enhanced Data-Recalibration: Utilizing Validation Data to Mitigate Instance-Dependent Noise in ClassificationImage Analysis and Processing – ICIAP 202210.1007/978-3-031-06427-2_52(621-632)Online publication date: 23-May-2022
  • (2021)Efficiently learning halfspaces with Tsybakov noiseProceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing10.1145/3406325.3450998(88-101)Online publication date: 15-Jun-2021
  • (2020)SIGUAProceedings of the 37th International Conference on Machine Learning10.5555/3524938.3525313(4006-4016)Online publication date: 13-Jul-2020
  • (2020)Learning with bounded instance- and label-dependent label noiseProceedings of the 37th International Conference on Machine Learning10.5555/3524938.3525105(1789-1799)Online publication date: 13-Jul-2020
  • (2020)Learnability with indirect supervision signalsProceedings of the 34th International Conference on Neural Information Processing Systems10.5555/3495724.3496491(9145-9155)Online publication date: 6-Dec-2020
  • (2020)Part-dependent label noiseProceedings of the 34th International Conference on Neural Information Processing Systems10.5555/3495724.3496361(7597-7610)Online publication date: 6-Dec-2020

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media