Skip to content

Commit ae228fe

Browse files
authored
Add default training parameters to classification refrence README (#1998)
* add default parameters to README * fix vgg_*_bn
1 parent e75b497 commit ae228fe

File tree

1 file changed

+25
-1
lines changed

1 file changed

+25
-1
lines changed

references/classification/README.md

Lines changed: 25 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,31 @@ This folder contains reference training scripts for image classification.
44
They serve as a log of how to train specific models, as provide baseline
55
training and evaluation scripts to quickly bootstrap research.
66

7-
Except otherwise noted, all models have been trained on 8x V100 GPUs.
7+
Except otherwise noted, all models have been trained on 8x V100 GPUs with
8+
the following parameters:
9+
10+
| Parameter | value |
11+
| ------------------------ | ------ |
12+
| `--batch_size` | `32` |
13+
| `--epochs` | `90` |
14+
| `--lr` | `0.1` |
15+
| `--momentum` | `0.9` |
16+
| `--wd`, `--weight-decay` | `1e-4` |
17+
| `--lr-step-size` | `30` |
18+
| `--lr-gamma` | `0.1` |
19+
20+
### AlexNet and VGG
21+
22+
Since `AlexNet` and the original `VGG` architectures do not include batch
23+
normalization, the default initial learning rate `--lr 0.1` is to high.
24+
25+
```
26+
python main.py --model $MODEL --lr 1e-2
27+
```
28+
29+
Here `$MODEL` is one of `alexnet`, `vgg11`, `vgg13`, `vgg16` or `vgg19`. Note
30+
that `vgg11_bn`, `vgg13_bn`, `vgg16_bn`, and `vgg19_bn` include batch
31+
normalization and thus are trained with the default parameters.
832

933
### ResNext-50 32x4d
1034
```

0 commit comments

Comments
 (0)