Optimizing Hidden Markov Language Models: An Empirical Study of Reparameterization and Initialization Techniques

2025.findings-naacl.429@ACL

Total: 1

#1 Optimizing Hidden Markov Language Models: An Empirical Study of Reparameterization and Initialization Techniques [PDF] [Copy] [Kimi] [REL]

Authors: Ivan Lee, Taylor Berg-Kirkpatrick

Hidden Markov models (HMMs) are valuable for their ability to provide exact and tractable inference. However, learning an HMM in an unsupervised manner involves a non-convex optimization problem that is plagued by poor local optima. Recent work on scaling-up HMMs to perform competitively as language models has indicated that this challenge only increases with larger hidden state sizes. Several techniques to address this problem have been proposed, but have not be evaluated comprehensively. This study provides a comprehensive empirical analysis of two recent strategies that use neural networks to enhance HMM optimization: neural reparameterization and neural initialization. We find that (1) these techniques work effectively for scaled HMM language modeling, (2) linear reparameterizations can be as effective as non-linear ones, and (3) the strategies are complementary.

Subject: NAACL.2025 - Findings