popov20@interspeech_2020@ISCA

Total: 1

#1 Fast and Lightweight On-Device TTS with Tacotron2 and LPCNet [PDF] [Copy] [Kimi1]

Authors: Vadim Popov ; Stanislav Kamenev ; Mikhail Kudinov ; Sergey Repyevsky ; Tasnima Sadekova ; Vitalii Bushaev ; Vladimir Kryzhanovskiy ; Denis Parkhomenko

We present a fast and lightweight on-device text-to-speech system based on state-of-art methods of feature and speech generation i.e. Tacotron2 and LPCNet. We show that modification of the basic pipeline combined with hardware-specific optimizations and extensive usage of parallelization enables running TTS service even on low-end devices with faster than realtime waveform generation. Moreover, the system preserves high quality of speech without noticeable degradation of Mean Opinion Score compared to the non-optimized baseline. While the system is mostly oriented on low-to-mid range hardware we believe that it can also be used in any CPU-based environment.