Computer Science > Machine Learning

arXiv:2404.04871 (cs)

[Submitted on 7 Apr 2024]

Title:Data Stream Sampling with Fuzzy Task Boundaries and Noisy Labels

Abstract:In the realm of continual learning, the presence of noisy labels within data streams represents a notable obstacle to model reliability and fairness. We focus on the data stream scenario outlined in pertinent literature, characterized by fuzzy task boundaries and noisy labels. To address this challenge, we introduce a novel and intuitive sampling method called Noisy Test Debiasing (NTD) to mitigate noisy labels in evolving data streams and establish a fair and robust continual learning algorithm. NTD is straightforward to implement, making it feasible across various scenarios. Our experiments benchmark four datasets, including two synthetic noise datasets (CIFAR10 and CIFAR100) and real-world noise datasets (mini-WebVision and Food-101N). The results validate the efficacy of NTD for online continual learning in scenarios with noisy labels in data streams. Compared to the previous leading approach, NTD achieves a training speedup enhancement over two times while maintaining or surpassing accuracy levels. Moreover, NTD utilizes less than one-fifth of the GPU memory resources compared to previous leading methods.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2404.04871 [cs.LG]
	(or arXiv:2404.04871v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2404.04871

Submission history

From: Yu-Hsi Chen [view email]
[v1] Sun, 7 Apr 2024 08:32:16 UTC (134 KB)

Computer Science > Machine Learning

Title:Data Stream Sampling with Fuzzy Task Boundaries and Noisy Labels

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Data Stream Sampling with Fuzzy Task Boundaries and Noisy Labels

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators