Skip to content

What accuracy should we expect when training Alexnet from scratch on ImageNet? #987

Open
@yoderj

Description

@yoderj

📚 Documentation

The README https://github.com/pytorch/examples/blob/main/imagenet/README.md is very helpful when getting started with training AlexNet.

We are able to successfully train AlexNet to approximately 56% top-1 and 79% top-5 accuracy on the validation set. But this is still a fair bit below Krizhevsky's published results of circa 83% or 85% top-5 accuracy on these training sets.

We are training with the default recommendations for a single GPU in the README for AlexNet:

python main.py -a alexnet --lr 0.01 --gpu 0 /data/datasets/imagenet/

What out-of the box accuracy should we expect when training AlexNet on ImageNet with the default PyTorch implementation?

What sort of hyperparameter changes do you recommend to duplicate Alex Krizhevsky's accuracies?

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions