File tree 1 file changed +25
-1
lines changed
references/classification
1 file changed +25
-1
lines changed Original file line number Diff line number Diff line change @@ -4,7 +4,31 @@ This folder contains reference training scripts for image classification.
4
4
They serve as a log of how to train specific models, as provide baseline
5
5
training and evaluation scripts to quickly bootstrap research.
6
6
7
- Except otherwise noted, all models have been trained on 8x V100 GPUs.
7
+ Except otherwise noted, all models have been trained on 8x V100 GPUs with
8
+ the following parameters:
9
+
10
+ | Parameter | value |
11
+ | ------------------------ | ------ |
12
+ | ` --batch_size ` | ` 32 ` |
13
+ | ` --epochs ` | ` 90 ` |
14
+ | ` --lr ` | ` 0.1 ` |
15
+ | ` --momentum ` | ` 0.9 ` |
16
+ | ` --wd ` , ` --weight-decay ` | ` 1e-4 ` |
17
+ | ` --lr-step-size ` | ` 30 ` |
18
+ | ` --lr-gamma ` | ` 0.1 ` |
19
+
20
+ ### AlexNet and VGG
21
+
22
+ Since ` AlexNet ` and the original ` VGG ` architectures do not include batch
23
+ normalization, the default initial learning rate ` --lr 0.1 ` is to high.
24
+
25
+ ```
26
+ python main.py --model $MODEL --lr 1e-2
27
+ ```
28
+
29
+ Here ` $MODEL ` is one of ` alexnet ` , ` vgg11 ` , ` vgg13 ` , ` vgg16 ` or ` vgg19 ` . Note
30
+ that ` vgg11_bn ` , ` vgg13_bn ` , ` vgg16_bn ` , and ` vgg19_bn ` include batch
31
+ normalization and thus are trained with the default parameters.
8
32
9
33
### ResNext-50 32x4d
10
34
```
You can’t perform that action at this time.
0 commit comments