Batch Size

Gradient Descent

fastai book

image.png

image.png

Find the best learning rate

One Cycle Policy (fastai)

To find the best learning rate according to fastai and Jeremy Howard, you should use the learning rate finder, which is a key feature in the fastai library. Here's how it works:

  1. The learning rate finder runs your model for a few iterations, starting with a very small learning rate and gradually increasing it.
  2. It plots the loss against the learning rate on a log scale.
  3. You should select a learning rate that's somewhere in the middle of the sharpest downward slope on this plot, typically one magnitude lower than the point where the loss starts to increase again.