ELBO, regularized maximum likelihood, and their common one-sample approximation for training stochastic neural networks

#1 ELBO, regularized maximum likelihood, and their common one-sample approximation for training stochastic neural networks [PDF] [Copy] [Kimi] [REL]

Authors: Sina Däubener, Simon Damm, Asja Fischer

Monte Carlo approximations are central to the training of stochastic neural networks in general, and Bayesian neural networks (BNNs) in particular. We observe that the common one-sample approximation of the standard training objective can be viewed both as maximizing the Evidence Lower Bound (ELBO) and as maximizing a regularized log-likelihood of a compound distribution. This latter approach differs from the ELBO only in the order of the logarithm and expectation, and is theoretically grounded in PAC-Bayes theory. We argue theoretically and demonstrate empirically that training with the regularized maximum likelihood increases prediction variance, enhancing performance in misspecified settings, adversarial robustness, and strengthening out-of-distribution (OOD) detection. Our findings help reconcile previous contradictions in the literature by providing a detailed analysis of how training objectives and Monte Carlo sample sizes affect uncertainty quantification in stochastic neural networks.

Subject: UAI.2025 - Poster

j4yFTNrdrO@OpenReview

#1 ELBO, regularized maximum likelihood, and their common one-sample approximation for training stochastic neural networks [PDF] [Copy] [Kimi] [REL]