Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3393822.3432321acmotherconferencesArticle/Chapter ViewAbstractPublication PagesesseConference Proceedingsconference-collections
research-article

Dog and Cat Classification with Deep Residual Network

Published: 21 December 2020 Publication History

Abstract

I With the development of artificial intelligence, the deep neural network(DNN) has achieved excellent results in image processing domain such as image classification[1] and objct detection[2]. The convolution neural networks(CNN) [3] is a Representative algorithm of DNN which have the representation learning ability. According to its convolutional structure, input information is extracted with translation invariance. Based on the widely used CNN, there are many efficient models. For image classification there are Lenet-5[4], VGG[5], Resnet[6] and so on. For object detection, the yolo series[7] is well-known. Also few well known datasets are proposed to measure their performance such as ImageNet and Cifar-10[8]. These data sets are dedicated to the classification of multiple objects in natural scenes. Nowadays, pets play an increasingly important role in our life, so we built a cat and dog dataset, each of which categories with 12500 samples which is larger then 1260 in Imagenet. For our dataset, we trained an image classification model. We focus on the performance of distinguish dog and cat In different scenes, lighting and noise. Our method achieved an accuracy of 92.7 percent and remained robust under adversarial attack.

References

[1]
Krizhevsky, Alex, I. Sutskever, and G. Hinton. "ImageNet Classification with Deep Convolutional Neural Networks." Advances in neural information processing systems 25.2(2012)
[2]
Everingham, Mark, et al. "The Pascal Visual Object Classes (VOC) Challenge." International Journal of Computer Vision 88.2(2010):p.303--338.
[3]
Szegedy, Christian, et al. "Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning." (2016).
[4]
Ahmed El-Sawy, Hazem EL-Bakry, and Mohamed Loey. "CNN for Handwritten Arabic Digits Recognition Based on LeNet-5." (2016).
[5]
Simonyan, Karen, and A. Zisserman. "Very Deep Convolutional Networks for Large-Scale Image Recognition." Computer ence (2014).
[6]
He, Kaiming, et al. "Deep Residual Learning for Image Recognition." 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) IEEE, 2016.
[7]
Redmon, Joseph, and A. Farhadi. "YOLOv3: An Incremental Improvement." (2018).
[8]
Recht, Benjamin, et al. "Do CIFAR-10 Classifiers Generalize to CIFAR-10." (2018).
[9]
Joachims, Thorsten. "Making large-scale SVM learning practical." Technical Reports 8.3(1998): 499--526.
[10]
Deng, L. "The MNIST Database of Handwritten Digit Images for Machine Learning Research [Best of the Web]. " Signal Processing Magazine, IEEE 29.6(2012):p.141--142.
[11]
Park, Dongwon, and S. Y. Chun. "Classification based Grasp Detection using Spatial Transformer Network." (2018).
[12]
Boer, Pieter Tjerk De, et al. "A Tutorial on the Cross-Entropy Method. " Annals of Operations Research 134.1(2005): 19--67.
[13]
Kingma, Diederik P, and J. Ba. "Adam: A Method for Stochastic Optimization." Computer ence (2014).
[14]
Goodfellow, Ian J, Shlens, Jonathon, and Szegedy, Christian. "Explaining and Harnessing Adversarial Examples." Computer ence (2014).
[15]
Park, M. "L1-regularization path algorithm for generalized linear models." Journal of the Royal Statistical Society 69.4(2007): 659--677.
[16]
Yu, Wei, et al. "Visualizing and Comparing Convolutional Neural Networks." Computer Science (2014).

Cited By

View all
  • (2024)ClipArtGAN: An Application of Pix2Pix Generative Adversarial Network for Clip Art GenerationMultimedia Tools and Applications10.1007/s11042-024-20361-1Online publication date: 22-Oct-2024
  • (2023)Image Classification of Occluded and Non-Occluded Cats and Dogs Datasets with Machine Learning2023 IEEE Symposium on Computers & Informatics (ISCI)10.1109/ISCI58771.2023.10391871(19-24)Online publication date: 14-Oct-2023
  • (2023)SRViT: A Vision Transformer via Super-Resolution2023 7th Asian Conference on Artificial Intelligence Technology (ACAIT)10.1109/ACAIT60137.2023.10528557(1387-1394)Online publication date: 10-Nov-2023
  • Show More Cited By

Index Terms

  1. Dog and Cat Classification with Deep Residual Network

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ESSE '20: Proceedings of the 2020 European Symposium on Software Engineering
    November 2020
    220 pages
    ISBN:9781450377621
    DOI:10.1145/3393822
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    • UNIBO: University of Bologna

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 21 December 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Neural network
    2. adversarial attack
    3. convolution neural networks(CNN)
    4. image classification

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ESSE 2020

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)56
    • Downloads (Last 6 weeks)13
    Reflects downloads up to 20 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)ClipArtGAN: An Application of Pix2Pix Generative Adversarial Network for Clip Art GenerationMultimedia Tools and Applications10.1007/s11042-024-20361-1Online publication date: 22-Oct-2024
    • (2023)Image Classification of Occluded and Non-Occluded Cats and Dogs Datasets with Machine Learning2023 IEEE Symposium on Computers & Informatics (ISCI)10.1109/ISCI58771.2023.10391871(19-24)Online publication date: 14-Oct-2023
    • (2023)SRViT: A Vision Transformer via Super-Resolution2023 7th Asian Conference on Artificial Intelligence Technology (ACAIT)10.1109/ACAIT60137.2023.10528557(1387-1394)Online publication date: 10-Nov-2023
    • (2022)PVGAN: a generative adversarial network for object simplification in prosthetic visionJournal of Neural Engineering10.1088/1741-2552/ac8acf19:5(056007)Online publication date: 7-Sep-2022

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media