2505.21173

Total: 1

#1 Topological Deep Learning for Speech Data [PDF1] [Copy] [Kimi1] [REL]

Author: Zhiwang Yu

Topological data analysis (TDA) offers novel mathematical tools for deep learning. Inspired by Carlsson et al., this study designs topology-aware convolutional kernels that significantly improve speech recognition networks. Theoretically, by investigating orthogonal group actions on kernels, we establish a fiber-bundle decomposition of matrix spaces, enabling new filter generation methods. Practically, our proposed Orthogonal Feature (OF) layer achieves superior performance in phoneme recognition, particularly in low-noise scenarios, while demonstrating cross-domain adaptability. This work reveals TDA's potential in neural network optimization, opening new avenues for mathematics-deep learning interdisciplinary studies.

Subjects: Machine Learning , Computer Vision and Pattern Recognition , Sound , Audio and Speech Processing

Publish: 2025-05-27 13:26:05 UTC