README

Implementation of OpenAI paper "An Empirical Model of Large-Batch Training" for Fastai V2.

The code is based on the batch size finder implementation for Fastai V1 by DanyWind (repo V1 / blog / discussion).

This implementation differs on:

It implements exactly the original article and not an aproximation (by default).
Fixes a couple of bugs in noise and scale values. However, they didn't affect on Simple Noise Scale value.

However, you could use the DanyWind aproximation by settting simulate_multi_gpus to False. DanyWind aproximation is faster but numerically more inestable and finds a Simple Noise Scale smaller than the original Simple Noise Scale.

It's tested with fastai 2.1. It should work with fastai>=2.0

TODO:

Port description improvements from fastai2 PR.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bs_finder.ipynb		bs_finder.ipynb
bs_finder.py		bs_finder.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

README

About

Releases

Packages

Contributors 2

Languages

License

hal-314/fastai-batch-size-finder

Folders and files

Latest commit

History

Repository files navigation

README

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages