uzFQpkEzOo@OpenReview

Total: 1

#1 Depth Separation with Multilayer Mean-Field Networks [PDF¹] [Copy] [Kimi] [REL]

Authors: Yunwei Ren, Mo Zhou, Rong Ge

Depth separation—why a deeper network is more powerful than a shallow one—has been a major problem in deep learning theory. Previous results often focus on representation power, for example, Safran et al. (2019) constructed a function that is easy to approximate using a 3-layer network but not approximable by any 2-layer network. In this paper, we show that this separation is in fact algorithmic: one can learn the function constructed by Safran et al. (2019) using an overparametrized network with polynomially many neurons efﬁciently. Our result relies on a new way of extending the mean-ﬁeld limit to multilayer networks, and a decomposition of loss that factors out the error introduced by the discretization of inﬁnite-width mean-ﬁeld networks.

Subject: ICLR.2023 - Notable-top-25%

uzFQpkEzOo@OpenReview

#1 Depth Separation with Multilayer Mean-Field Networks [PDF1] [Copy] [Kimi] [REL]

#1 Depth Separation with Multilayer Mean-Field Networks [PDF¹] [Copy] [Kimi] [REL]