Computer Science > Computer Vision and Pattern Recognition

arXiv:2206.13829 (cs)

[Submitted on 28 Jun 2022]

Title:Cross-Forgery Analysis of Vision Transformers and CNNs for Deepfake Image Detection

Authors:Davide Alessandro Coccomini, Roberto Caldelli, Fabrizio Falchi, Claudio Gennaro, Giuseppe Amato

View PDF

Abstract:Deepfake Generation Techniques are evolving at a rapid pace, making it possible to create realistic manipulated images and videos and endangering the serenity of modern society. The continual emergence of new and varied techniques brings with it a further problem to be faced, namely the ability of deepfake detection models to update themselves promptly in order to be able to identify manipulations carried out using even the most recent methods. This is an extremely complex problem to solve, as training a model requires large amounts of data, which are difficult to obtain if the deepfake generation method is too recent. Moreover, continuously retraining a network would be unfeasible. In this paper, we ask ourselves if, among the various deep learning techniques, there is one that is able to generalise the concept of deepfake to such an extent that it does not remain tied to one or more specific deepfake generation methods used in the training set. We compared a Vision Transformer with an EfficientNetV2 on a cross-forgery context based on the ForgeryNet dataset. From our experiments, It emerges that EfficientNetV2 has a greater tendency to specialize often obtaining better results on training methods while Vision Transformers exhibit a superior generalization ability that makes them more competent even on images generated with new methodologies.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2206.13829 [cs.CV]
	(or arXiv:2206.13829v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2206.13829
Related DOI:	https://doi.org/10.1145/3512732.3533582

Submission history

From: Davide Alessandro Coccomini [view email]
[v1] Tue, 28 Jun 2022 08:50:22 UTC (2,821 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Cross-Forgery Analysis of Vision Transformers and CNNs for Deepfake Image Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Cross-Forgery Analysis of Vision Transformers and CNNs for Deepfake Image Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators