Processing math: 100%

2504.03405

Total: 1

#1 On the rate of convergence of an over-parametrized deep neural network regression estimate learned by gradient descent [PDF] [Copy] [Kimi] [REL]

Nonparametric regression with random design is considered. The $L_2$ error with integration with respect to the design measure is used as the error criterion. An over-parametrized deep neural network regression estimate with logistic activation function is defined, where all weights are learned by gradient descent. It is shown that the estimate achieves a nearly optimal rate of convergence in case that the regression function is $(p,C)$ --smooth.

Subject: Statistics Theory

Publish: 2025-04-04 12:28:54 UTC