GAMM2025

Name: GAMM2025
Start: 2025-04-07T08:00:00+02:00
End: 2025-04-11T22:00:00+02:00
Location: Lecture and Conference Centre

7–11 Apr 2025

Lecture and Conference Centre

Europe/Warsaw timezone

Adaptive Step Sizes for Stochastic Gradient Descent

8 Apr 2025, 14:20

20m

Room 7

PP02: Theoretical Foundations of Deep Learning

Frederik Köhne

The choice of the step size (or learning rate) in stochastic optimization algorithms, such as stochastic gradient descent, plays a central role in the training of machine learning models.

Both theoretical investigations and empirical analyses emphasize that an optimal step size not only requires taking into account the nonlinearity of the underlying problem, but also relies on accounting for the local variance within the search directions. In this presentation, we introduce a novel method capable of estimating these fundamental quantities and subsequently using these estimates to derive an adaptive step size for stochastic gradient descent. Our proposed approach leads to a nearly hyperparameter-free variant of stochastic gradient descent with provable convergence guarantees.

We provide a convergence analysis for the ideal step sizes, as well as for the approximated step sizes.

In addition, we perform numerical experiments focusing on classical image classification tasks. Remarkably, our algorithm exhibits truly problem-adaptive behavior when applied to these problems that exceed theoretical boundaries. Moreover, our framework facilitates the potential incorporation of a preconditioner, thereby enabling the implementation of adaptive step sizes for stochastic second-order optimization methods.

Frederik Köhne Leonie Kreis Roland Herzog Anton Schiela

There are no materials yet.

GAMM2025

Adaptive Step Sizes for Stochastic Gradient Descent

Room 7

Speaker

Description

Co-authors

Presentation materials

Choose timezone

GAMM2025

Speaker

Description

Co-authors

Presentation materials