FlowFake: Liquid Networks for Audio Deepfake Detection

#1 FlowFake: Liquid Networks for Audio Deepfake Detection [PDF] [Copy] [Kimi] [REL]

Authors: Shivaay Dhondiyal, Divyansh Sharma, Dinesh Kumar Vishwakarma

Audio deepfakes generated by neural text-to-speech and voice-cloning systems threaten speaker verification and public discourse at scale. The core challenge is cross-dataset generalization: detectors trained on one synthesis pipeline collapse on unseen forgeries. We argue that this failure is primarily because of structural synthetic speech artifacts which are multi-timescale trajectory anomalies. Though every existing detector aggregates a fixed-window frame statistics, this misaligns the architecture with the signal. We propose FlowFake, a Liquid Time-Constant (LTC) architecture whose hidden state evolves via a learned ODE, with per-neuron adaptive time constants simultaneously resolving spectral (10ms) and prosodic (2s) cues. At only 34K parameters FlowFake achieves formal BIBO stability and O(dt^4) integration error. On a four-dataset cross domain benchmark (ASVspoof2019-LA, FakeOrReal, InTheWild, MLAAD), FlowFake reaches 75.29% on ASVspoof2019 trained only on FakeOrReal and 79.97% trained only on MLAAD. It outperforms RawGAT-ST and Whisper-DF on every evaluated pair and matching SSL Wav2vec2 (300x larger) at 0.01% of its parameter count. The source code is available on : https://github.com/GhostRider2023/FlowFake

Subjects: Sound , Artificial Intelligence

Publish: 2026-06-17 20:32:32 UTC

2606.19579

#1 FlowFake: Liquid Networks for Audio Deepfake Detection [PDF] [Copy] [Kimi] [REL]