2507.16695

Total: 1

#1 Interpretable Topic Extraction and Word Embedding Learning using row-stochastic DEDICOM [PDF] [Copy] [Kimi3] [REL]

Authors: Lars Hillebrand, David Biesner, Christian Bauckhage, Rafet Sifa

The DEDICOM algorithm provides a uniquely interpretable matrix factorization method for symmetric and asymmetric square matrices. We employ a new row-stochastic variation of DEDICOM on the pointwise mutual information matrices of text corpora to identify latent topic clusters within the vocabulary and simultaneously learn interpretable word embeddings. We introduce a method to efficiently train a constrained DEDICOM algorithm and a qualitative evaluation of its topic modeling and word embedding performance.

Subjects: Computation and Language , Artificial Intelligence , Machine Learning

Publish: 2025-07-22 15:30:32 UTC