Open
Description
📚 Documentation
The README https://github.com/pytorch/examples/blob/main/imagenet/README.md is very helpful when getting started with training AlexNet.
We are able to successfully train AlexNet to approximately 56% top-1 and 79% top-5 accuracy on the validation set. But this is still a fair bit below Krizhevsky's published results of circa 83% or 85% top-5 accuracy on these training sets.
We are training with the default recommendations for a single GPU in the README for AlexNet:
python main.py -a alexnet --lr 0.01 --gpu 0 /data/datasets/imagenet/
What out-of the box accuracy should we expect when training AlexNet on ImageNet with the default PyTorch implementation?
What sort of hyperparameter changes do you recommend to duplicate Alex Krizhevsky's accuracies?