Processing math: 100%

2408.05366

Total: 1

#1 DeepSpeak Dataset v1.0 [PDF] [Copy] [Kimi¹] [REL]

Authors: Sarah Barrington, Matyas Bohacek, Hany Farid

We describe a large-scale dataset--{\em DeepSpeak}--of real and deepfake footage of people talking and gesturing in front of their webcams. The real videos in this first version of the dataset consist of $9$ hours of footage from $220$ diverse individuals. Constituting more than 25 hours of footage, the fake videos consist of a range of different state-of-the-art face-swap and lip-sync deepfakes with natural and AI-generated voices. We expect to release future versions of this dataset with different and updated deepfake technologies. This dataset is made freely available for research and non-commercial uses; requests for commercial use will be considered.

Subject: Computer Vision and Pattern Recognition

Publish: 2024-08-09 22:29:43 UTC

2408.05366

#1 DeepSpeak Dataset v1.0 [PDF] [Copy] [Kimi1] [REL]

#1 DeepSpeak Dataset v1.0 [PDF] [Copy] [Kimi¹] [REL]