Towards an Optimal Control Perspective of ResNet Training

Püttschneider, Jens; Heilig, Simon; Fischer, Asja; Faulwasser, Timm

Towards an Optimal Control Perspective of ResNet Training

Dateien

TRR_WP-6.pdf (365.56 KB)

Datum

2025

Autor:innen

Zusammenfassung

We propose a training formulation for ResNets reflecting an optimal control problem that is applicable for standard architectures and general loss functions. We suggest bridging both worlds via penalizing intermediate outputs of hidden states corresponding to stage cost terms in optimal control. For standard ResNets, we obtain intermediate outputs by propagating the state through the subsequent skip connections and the output layer. We demonstrate that our training dynamic biases the weights of the unnecessary deeper residual layers to vanish. This indicates the potential for a theory-grounded layer pruning strategy.

Schlagwörter

ResNets, optimal control, regularization, network depth

URI

http://hdl.handle.net/2003/43792
http://dx.doi.org/10.17877/DE290R-25566

Sammlungen

Sonderforschungsbereich (SFB) Transregio 391

Komplettanzeige

Towards an Optimal Control Perspective of ResNet Training

Dateien

Datum

Autor:innen

Zeitschriftentitel

ISSN der Zeitschrift

Bandtitel

Verlag

Sonstige Titel

Zusammenfassung

Beschreibung

Inhaltsverzeichnis

Schlagwörter

Schlagwörter nach RSWK

Zitierform

URI

Sammlungen

Befürwortung

Review

Ergänzt durch

Referenziert von