Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3651781.3651818acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicscaConference Proceedingsconference-collections
research-article

Robust Contrastive Learning: PSSI - Image Entropy Positive Samples with Semantic Invariance

Published: 30 May 2024 Publication History

Abstract

Data augmentation is a popular technique used to increase the amount of training data and improve model generalization in the contrastive learning framework. However, traditional enhancement methods often struggle to balance the preservation of semantic information and the enhancement of generalization ability. In this paper, we propose a lightweight data augmentation method called Positive Samples with Semantic Invariance (PSSI) that aims to preserve image semantics and model generalization to the greatest extent possible. PSSI uses an innovative edge blurring technique to emphasize the model’s learning of high-level semantic information while maintaining image entropy. We show that PSSI outperforms other methods without increasing model complexity and is more robust in sparse sample scenarios. Specifically, our experiments on popular image classification datasets demonstrate that PSSI improves model performance by 1.72% with no missing samples and 4.05% with missing samples. Our results highlight the effectiveness of PSSI in balancing the preservation of semantic information and the enhancement of generalization ability, which can help improve model performance in various applications.

References

[1]
Srikanth Bethu, M. Neelakantappa, A. Swami Goud, B. Hari Krishna, and P. N. V. Syamala Rao M. 2023. An Approach for Person Detection along with Object Using Machine Learning. Journal of Advances in Information Technology 14, 3 (2023), 411–417.
[2]
Ting Chen, Kornblith Simon, Norouzi Mohammad, and Hinton Geoffrey. 2020. A simple framework for contrastive learning of visual representations. In International conference on machine learning. PMLR, 1597–1607.
[3]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770–778.
[4]
Geoffrey E Hinton, Nitish Srivastava, Alex Krizhevsky, Ilya Sutskever, and Ruslan R Salakhutdinov. 2012. Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580 (2012).
[5]
Xu Ji, Joao F Henriques, and Andrea Vedaldi. 2019. Invariant information clustering for unsupervised image classification and segmentation. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 9865–9874.
[6]
Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision. 2980–2988.
[7]
Byong Seok Min, Dong Kyun Lim, Seung Jong Kim, and Joo Heung Lee. 2013. A novel method of determining parameters of CLAHE based on image entropy. International Journal of Software Engineering and Its Applications 7, 5 (2013), 113–120.
[8]
Luyl-Da Quach, Khang Nguyen Quoc, Anh Nguyen Quynh, and Hoang Tran Ngoc. 2023. Evaluating the Effectiveness of YOLO Models in Different Sized Object Detection and Feature-Based Classification of Small Objects. Journal of Advances in Information Technology 14, 5 (2023), 907–917.
[9]
Fan-Yun Sun, Jordan Hoffmann, Vikas Verma, and Jian Tang. 2019. Infograph: Unsupervised and semi-supervised graph-level representation learning via mutual information maximization. arXiv preprint arXiv:1908.01000 (2019).
[10]
Becker Suzanna and Hinton Geoffrey E. 1992. Self-organizing neural network that discovers surfaces in random-dot stereograms. Nature 355, 6356 (1992), 161–163.
[11]
Du-Yih Tsai, Yongbum Lee, and Eri Matsuyama. 2008. Information entropy measure for evaluation of image quality. Journal of digital imaging 21, 3 (2008), 338–347.
[12]
Vikas Verma, Meng Qu, Alex Lamb, Yoshua Bengio, Juho Kannala, and Jian Tang. 2019. GraphMix: Regularized training of graph neural networks for semi-supervised learning. arxiv e-prints, art. arXiv preprint arXiv:1909.11715 (2019).
[13]
Adedeji Olugboja Zenghui Wang and Yanxia Sun. 2021. Parallel Convolutional Neural Networks for Object Detection. Journal of Advances in Information Technology 12, 4 (2021), 279–286.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICSCA '24: Proceedings of the 2024 13th International Conference on Software and Computer Applications
February 2024
395 pages
ISBN:9798400708329
DOI:10.1145/3651781
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 May 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Image entropy
  2. computer vision
  3. contrastive learning
  4. data augmentation

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICSCA 2024

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 7
    Total Downloads
  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)0
Reflects downloads up to 30 Nov 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media