research-article

Dog and Cat Classification with Deep Residual Network

Author:

Yao Yu chenAuthors Info & Claims

ESSE '20: Proceedings of the 2020 European Symposium on Software Engineering

Pages 137 - 141

https://doi.org/10.1145/3393822.3432321

Published: 21 December 2020 Publication History

Abstract

I With the development of artificial intelligence, the deep neural network(DNN) has achieved excellent results in image processing domain such as image classification[1] and objct detection[2]. The convolution neural networks(CNN) [3] is a Representative algorithm of DNN which have the representation learning ability. According to its convolutional structure, input information is extracted with translation invariance. Based on the widely used CNN, there are many efficient models. For image classification there are Lenet-5[4], VGG[5], Resnet[6] and so on. For object detection, the yolo series[7] is well-known. Also few well known datasets are proposed to measure their performance such as ImageNet and Cifar-10[8]. These data sets are dedicated to the classification of multiple objects in natural scenes. Nowadays, pets play an increasingly important role in our life, so we built a cat and dog dataset, each of which categories with 12500 samples which is larger then 1260 in Imagenet. For our dataset, we trained an image classification model. We focus on the performance of distinguish dog and cat In different scenes, lighting and noise. Our method achieved an accuracy of 92.7 percent and remained robust under adversarial attack.

References

[1]

Krizhevsky, Alex, I. Sutskever, and G. Hinton. "ImageNet Classification with Deep Convolutional Neural Networks." Advances in neural information processing systems 25.2(2012)

[2]

Everingham, Mark, et al. "The Pascal Visual Object Classes (VOC) Challenge." International Journal of Computer Vision 88.2(2010):p.303--338.

[3]

Szegedy, Christian, et al. "Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning." (2016).

[4]

Ahmed El-Sawy, Hazem EL-Bakry, and Mohamed Loey. "CNN for Handwritten Arabic Digits Recognition Based on LeNet-5." (2016).

[5]

Simonyan, Karen, and A. Zisserman. "Very Deep Convolutional Networks for Large-Scale Image Recognition." Computer ence (2014).

[6]

He, Kaiming, et al. "Deep Residual Learning for Image Recognition." 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) IEEE, 2016.

[7]

Redmon, Joseph, and A. Farhadi. "YOLOv3: An Incremental Improvement." (2018).

[8]

Recht, Benjamin, et al. "Do CIFAR-10 Classifiers Generalize to CIFAR-10." (2018).

[9]

Joachims, Thorsten. "Making large-scale SVM learning practical." Technical Reports 8.3(1998): 499--526.

[10]

Deng, L. "The MNIST Database of Handwritten Digit Images for Machine Learning Research [Best of the Web]. " Signal Processing Magazine, IEEE 29.6(2012):p.141--142.

[11]

Park, Dongwon, and S. Y. Chun. "Classification based Grasp Detection using Spatial Transformer Network." (2018).

[12]

Boer, Pieter Tjerk De, et al. "A Tutorial on the Cross-Entropy Method. " Annals of Operations Research 134.1(2005): 19--67.

[13]

Kingma, Diederik P, and J. Ba. "Adam: A Method for Stochastic Optimization." Computer ence (2014).

[14]

Goodfellow, Ian J, Shlens, Jonathon, and Szegedy, Christian. "Explaining and Harnessing Adversarial Examples." Computer ence (2014).

[15]

Park, M. "L1-regularization path algorithm for generalized linear models." Journal of the Royal Statistical Society 69.4(2007): 659--677.

[16]

Yu, Wei, et al. "Visualizing and Comparing Convolutional Neural Networks." Computer Science (2014).

Cited By

Elnabawy RAbdennadher SHellwich OEldawlatly S(2024)ClipArtGAN: An Application of Pix2Pix Generative Adversarial Network for Clip Art GenerationMultimedia Tools and Applications10.1007/s11042-024-20361-1Online publication date: 22-Oct-2024
https://doi.org/10.1007/s11042-024-20361-1
Rumpungan JOn CHijazi MKarim S(2023)Image Classification of Occluded and Non-Occluded Cats and Dogs Datasets with Machine Learning2023 IEEE Symposium on Computers & Informatics (ISCI)10.1109/ISCI58771.2023.10391871(19-24)Online publication date: 14-Oct-2023
https://doi.org/10.1109/ISCI58771.2023.10391871
Song WWang ZGao YHe X(2023)SRViT: A Vision Transformer via Super-Resolution2023 7th Asian Conference on Artificial Intelligence Technology (ACAIT)10.1109/ACAIT60137.2023.10528557(1387-1394)Online publication date: 10-Nov-2023
https://doi.org/10.1109/ACAIT60137.2023.10528557
Show More Cited By

Index Terms

Dog and Cat Classification with Deep Residual Network
1. Applied computing
  1. Computer forensics
    1. Data recovery

Recommendations

A dyadic multi-resolution deep convolutional neural wavelet network for image classification

For almost the past four decades, image classification has gained a lot of attention in the field of pattern recognition due to its application in various fields. Given its importance, several approaches have been proposed up to now. In this paper, we ...
Adder Wavelet for Better Image Classification under Adder Neural Network
CSAI '22: Proceedings of the 2022 6th International Conference on Computer Science and Artificial Intelligence

Adder neural network (AdderNet) is a new kind of deep learning model in which the original massive multiplications in convolutions are replaced by additions. The overall energy consumption using adder network is reduced significantly. However, there is ...
Richer feature for image classification with super and sub kernels based on deep convolutional neural network

The concepts of super convolutional kernel and sub convolutional kernel are proposed inspired by the process that human observes objects.A novel Parallel Crossing Deep Convolutional Neural Network (PCDCNN) model is designed to simulate the human ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

ESSE '20: Proceedings of the 2020 European Symposium on Software Engineering

November 2020

220 pages

ISBN:9781450377621

DOI:10.1145/3393822

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

UNIBO: University of Bologna

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 December 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

ESSE 2020

ESSE 2020: 2020 European Symposium on Software Engineering

November 6 - 8, 2020

Rome, Italy

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
195
Total Downloads

Downloads (Last 12 months)56
Downloads (Last 6 weeks)13

Reflects downloads up to 20 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Elnabawy RAbdennadher SHellwich OEldawlatly S(2024)ClipArtGAN: An Application of Pix2Pix Generative Adversarial Network for Clip Art GenerationMultimedia Tools and Applications10.1007/s11042-024-20361-1Online publication date: 22-Oct-2024
https://doi.org/10.1007/s11042-024-20361-1
Rumpungan JOn CHijazi MKarim S(2023)Image Classification of Occluded and Non-Occluded Cats and Dogs Datasets with Machine Learning2023 IEEE Symposium on Computers & Informatics (ISCI)10.1109/ISCI58771.2023.10391871(19-24)Online publication date: 14-Oct-2023
https://doi.org/10.1109/ISCI58771.2023.10391871
Song WWang ZGao YHe X(2023)SRViT: A Vision Transformer via Super-Resolution2023 7th Asian Conference on Artificial Intelligence Technology (ACAIT)10.1109/ACAIT60137.2023.10528557(1387-1394)Online publication date: 10-Nov-2023
https://doi.org/10.1109/ACAIT60137.2023.10528557
Elnabawy RAbdennadher SHellwich OEldawlatly S(2022)PVGAN: a generative adversarial network for object simplification in prosthetic visionJournal of Neural Engineering10.1088/1741-2552/ac8acf19:5(056007)Online publication date: 7-Sep-2022
https://doi.org/10.1088/1741-2552/ac8acf

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents