Computer Science > Computer Vision and Pattern Recognition

arXiv:2305.04379 (cs)

[Submitted on 7 May 2023 (v1), last revised 6 Jun 2023 (this version, v5)]

Title:Data Efficient Training with Imbalanced Label Sample Distribution for Fashion Detection

Authors:Xin Shen, Praful Agrawal, Zhongwei Cheng

View PDF

Abstract:Multi-label classification models have a wide range of applications in E-commerce, including visual-based label predictions and language-based sentiment classifications. A major challenge in achieving satisfactory performance for these tasks in the real world is the notable imbalance in data distribution. For instance, in fashion attribute detection, there may be only six 'puff sleeve' clothes among 1000 products in most E-commerce fashion catalogs. To address this issue, we explore more data-efficient model training techniques rather than acquiring a huge amount of annotations to collect sufficient samples, which is neither economic nor scalable. In this paper, we propose a state-of-the-art weighted objective function to boost the performance of deep neural networks (DNNs) for multi-label classification with long-tailed data distribution. Our experiments involve image-based attribute classification of fashion apparels, and the results demonstrate favorable performance for the new weighting method compared to non-weighted and inverse-frequency-based weighting mechanisms. We further evaluate the robustness of the new weighting mechanism using two popular fashion attribute types in today's fashion industry: sleevetype and archetype.

Comments:	We have identified a substantial error in the experimental results and a potentially misleading explanation of the algorithm. We kindly request that you consider withdrawing this version to mitigate the risk of disseminating inaccurate information
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2305.04379 [cs.CV]
	(or arXiv:2305.04379v5 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2305.04379

Submission history

From: Xin Shen [view email]
[v1] Sun, 7 May 2023 21:25:09 UTC (1,640 KB)
[v2] Tue, 9 May 2023 07:54:52 UTC (1,640 KB)
[v3] Mon, 15 May 2023 21:01:09 UTC (1,640 KB)
[v4] Sat, 3 Jun 2023 19:23:08 UTC (1 KB) (withdrawn)
[v5] Tue, 6 Jun 2023 07:33:13 UTC (1,641 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Data Efficient Training with Imbalanced Label Sample Distribution for Fashion Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Data Efficient Training with Imbalanced Label Sample Distribution for Fashion Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators