Diffuse or Confuse: A Diffusion Deepfake Speech Dataset

2410.06796

Total: 1

#1 Diffuse or Confuse: A Diffusion Deepfake Speech Dataset [PDF] [Copy] [Kimi] [REL]

Authors: Anton Firc, Kamil Malinka, Petr Hanáček

Advancements in artificial intelligence and machine learning have significantly improved synthetic speech generation. This paper explores diffusion models, a novel method for creating realistic synthetic speech. We create a diffusion dataset using available tools and pretrained models. Additionally, this study assesses the quality of diffusion-generated deepfakes versus non-diffusion ones and their potential threat to current deepfake detection systems. Findings indicate that the detection of diffusion-based deepfakes is generally comparable to non-diffusion deepfakes, with some variability based on detector architecture. Re-vocoding with diffusion vocoders shows minimal impact, and the overall speech quality is comparable to non-diffusion methods.

Subjects: Cryptography and Security , Artificial Intelligence , Machine Learning , Sound

Publish: 2024-10-09 11:51:08 UTC