A major challenge in DeepFake forgery detection is that state-of-the-art algorithms are mostly trained to detect a specific fake method. As a result, these approaches show poor generalization across different types of facial manipulations, e.g., from face swapping to facial reenactment. To this end, we introduce ID-Reveal, a new approach that learns temporal facial features, specific of how a person moves while talking, by means of metric learning coupled with an adversarial training strategy. The advantage is that we do not need any training data of fakes, but only train on real videos. Moreover, we utilize high-level semantic features, which enables robustness to widespread and disruptive forms of post-processing. We perform a thorough experimental analysis on several publicly available benchmarks. Compared to state of the art, our method improves generalization and is more robust to low-quality videos, that are usually spread over social networks. In particular, we obtain an average improvement of more than 15% in terms of accuracy for facial reenactment on high compressed videos.

ID-Reveal: Identity-aware deepfake video detection / Cozzolino, Davide; Rössler, Andreas; Thies, Justus; Nießner, Matthias; Verdoliva, Luisa. - (2021). (Intervento presentato al convegno IEEE/CVF International Conference on Computer Vision (ICCV) nel 10-17 Ottobre) [10.1109/ICCV48922.2021.01483].

ID-Reveal: Identity-aware deepfake video detection

Davide Cozzolino;Luisa Verdoliva
2021

Abstract

A major challenge in DeepFake forgery detection is that state-of-the-art algorithms are mostly trained to detect a specific fake method. As a result, these approaches show poor generalization across different types of facial manipulations, e.g., from face swapping to facial reenactment. To this end, we introduce ID-Reveal, a new approach that learns temporal facial features, specific of how a person moves while talking, by means of metric learning coupled with an adversarial training strategy. The advantage is that we do not need any training data of fakes, but only train on real videos. Moreover, we utilize high-level semantic features, which enables robustness to widespread and disruptive forms of post-processing. We perform a thorough experimental analysis on several publicly available benchmarks. Compared to state of the art, our method improves generalization and is more robust to low-quality videos, that are usually spread over social networks. In particular, we obtain an average improvement of more than 15% in terms of accuracy for facial reenactment on high compressed videos.
2021
ID-Reveal: Identity-aware deepfake video detection / Cozzolino, Davide; Rössler, Andreas; Thies, Justus; Nießner, Matthias; Verdoliva, Luisa. - (2021). (Intervento presentato al convegno IEEE/CVF International Conference on Computer Vision (ICCV) nel 10-17 Ottobre) [10.1109/ICCV48922.2021.01483].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/877759
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 35
social impact