Optimization methods for training neural networks

This repository is the collection of tutorials on my experience in training large neural networks, extracting features of different optimizers, models and regularization techniques as well as different set ups of training. Here I exclude everything related to the convex deterministic optimization and focus only on the stochastic methods that address problems related to the data processing from different domains.

Basic concepts: models, autograd, generalization, local minima and their features
Ingredients of basic optimizers
Key elements of models
Federated learning
Few-bit optimizers
Privacy-aware optimizers
From first-order stochastic methods to higher order optimizers
Paralellism in training large neural networks
Meta optimizers
Challenges and perspectives

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optimization methods for training neural networks

About

Releases

Packages

License

amkatrutsa/dl-opt

Folders and files

Latest commit

History

Repository files navigation

Optimization methods for training neural networks

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages