Publication detail

Deepfake Speech Detection: A Spectrogram Analysis

FIRC, A. MALINKA, K. HANÁČEK, P.

Original Title

Deepfake Speech Detection: A Spectrogram Analysis

Type

article in a collection out of WoS and Scopus

Language

English

Original Abstract

The current voice biometric systems have no natural mechanics to defend against deepfake spoofing attacks. Thus, supporting these systems with a deepfake detection solution is necessary. One of the latest approaches to deepfake speech detection is representing speech as a spectrogram and using it as an input for a deep neural network. This work thus analyzes the feasibility of different spectrograms for deepfake speech detection. We compare types of them regarding their performance, hardware requirements, and speed. We show the majority of the spectrograms are feasible for deepfake detection. However, there is no general, correct answer to selecting the best spectrogram. As we demonstrate, different spectrograms are suitable for different needs.

Keywords

Deepfake, Speech, Image-based, Deepfake Detection, Spectrogram

Authors

FIRC, A.; MALINKA, K.; HANÁČEK, P.

Released

8. 4. 2024

Pages count

9

BibTex

@inproceedings{BUT188028,
  author="Anton {Firc} and Kamil {Malinka} and Petr {Hanáček}",
  title="Deepfake Speech Detection: A Spectrogram Analysis",
  booktitle="SAC '24: Proceedings of the 39th ACM/SIGAPP Symposium on Applied Computing",
  year="2024",
  pages="9"
}