MeanSE: Efficient Generative Speech Enhancement with Mean Flows

2509.21214

Total: 1

#1 MeanSE: Efficient Generative Speech Enhancement with Mean Flows [PDF⁵] [Copy] [Kimi¹] [REL]

Authors: Jiahe Wang, Hongyu Wang, Wei Wang, Lei Yang, Chenda Li, Wangyou Zhang, Lufen Tan, Yanmin Qian

Speech enhancement (SE) improves degraded speech's quality, with generative models like flow matching gaining attention for their outstanding perceptual quality. However, the flow-based model requires multiple numbers of function evaluations (NFEs) to achieve stable and satisfactory performance, leading to high computational load and poor 1-NFE performance. In this paper, we propose MeanSE, an efficient generative speech enhancement model using mean flows, which models the average velocity field to achieve high-quality 1-NFE enhancement. Experimental results demonstrate that our proposed MeanSE significantly outperforms the flow matching baseline with a single NFE, exhibiting extremely better out-of-domain generalization capabilities.

Subject: Audio and Speech Processing

Publish: 2025-09-25 14:23:23 UTC

2509.21214

#1 MeanSE: Efficient Generative Speech Enhancement with Mean Flows [PDF5] [Copy] [Kimi1] [REL]

#1 MeanSE: Efficient Generative Speech Enhancement with Mean Flows [PDF⁵] [Copy] [Kimi¹] [REL]