Initial version of classification reference scripts #819

fmassa · 2019-03-27T14:00:25Z

This PR introduces the foundations for reference training/evaluation scripts for torchvision.

The idea is that all pre-trained models will have corresponding training scripts / command-line arguments, so that reproducing a trained model should be straightforward.

This is not at its final version. I'll be merging this soon, and after adding segmentation and detection training/evaluation scripts, a lot of it will be refactored and included inside torchvision.

Also log the learning rate

Identified a bug in the reporting of the results. They need to be reduced between all processes

soumith

reviewed the train script. didn't review classification/utills.py

soumith · 2019-03-27T14:30:18Z

references/classification/utils.py

+
+
+def setup_for_distributed(is_master):
+    """


printing override seems fine, but torch.save override seems pretty sketchy. Maybe consider having a utils.save_on_master that you use, rather than monkey-patching torch.save

sounds good.

This is a fundamental feature for distributed training, so it's better to have it right.

codecov-io · 2019-03-27T17:41:12Z

Codecov Report

Merging #819 into master will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master     #819   +/-   ##
=======================================
  Coverage   51.58%   51.58%           
=======================================
  Files          34       34           
  Lines        3342     3342           
  Branches      536      536           
=======================================
  Hits         1724     1724           
  Misses       1486     1486           
  Partials      132      132

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 9d37cc9...9036426. Read the comment docs.

fmassa added 6 commits March 26, 2019 05:54

Initial version of classification reference training script

d2c93ab

Updates

9557ffe

Minor updates

ab031fd

Expose a few more options

33c70a2

Load optimizer and lr_scheduler when resuming

0d61de3

Also log the learning rate

Evaluation-only and minor improvements

e384268

Identified a bug in the reporting of the results. They need to be reduced between all processes

fmassa mentioned this pull request Mar 27, 2019

Initial version of segmentation reference scripts #820

Merged

soumith approved these changes Mar 27, 2019

View reviewed changes

soumith reviewed Mar 27, 2019

View reviewed changes

soumith mentioned this pull request Mar 27, 2019

Add logging to ImageNet training pytorch/examples#530

Closed

Address Soumith's comment

4c8c40d

fmassa added 2 commits March 28, 2019 06:18

Fix some approximations on the evaluation metric

9036426

Flake8

8d4628b

fmassa merged commit 27ff89f into pytorch:master Mar 28, 2019

fmassa deleted the classification-v0 branch March 28, 2019 13:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Initial version of classification reference scripts #819

Initial version of classification reference scripts #819

Uh oh!

fmassa commented Mar 27, 2019

Uh oh!

soumith left a comment

Uh oh!

soumith Mar 27, 2019

Uh oh!

fmassa Mar 27, 2019

Uh oh!

codecov-io commented Mar 27, 2019 •

edited

Loading

Uh oh!

Uh oh!

Initial version of classification reference scripts #819

Initial version of classification reference scripts #819

Uh oh!

Conversation

fmassa commented Mar 27, 2019

Uh oh!

soumith left a comment

Choose a reason for hiding this comment

Uh oh!

soumith Mar 27, 2019

Choose a reason for hiding this comment

Uh oh!

fmassa Mar 27, 2019

Choose a reason for hiding this comment

Uh oh!

codecov-io commented Mar 27, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

codecov-io commented Mar 27, 2019 •

edited

Loading