File tree
38 files changed
+863
-476
lines changed- .github/workflows
- test/datasets
- torchtitan
- datasets
- tokenizer
- models
- llama
- parallelisms
- train_configs
38 files changed
+863
-476
lines changedLines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
39 | 39 |
| |
40 | 40 |
| |
41 | 41 |
| |
| 42 | + | |
42 | 43 |
| |
43 | 44 |
|
Lines changed: 9 additions & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
18 | 18 |
| |
19 | 19 |
| |
20 | 20 |
| |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
21 | 29 |
| |
22 | 30 |
| |
23 | 31 |
| |
| |||
66 | 74 |
| |
67 | 75 |
| |
68 | 76 |
| |
69 |
| - | |
| 77 | + | |
70 | 78 |
| |
71 | 79 |
| |
72 | 80 |
| |
|
Lines changed: 0 additions & 2 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
18 | 18 |
| |
19 | 19 |
| |
20 | 20 |
| |
21 |
| - | |
22 |
| - | |
23 | 21 |
| |
24 | 22 |
| |
25 | 23 |
| |
|
Lines changed: 27 additions & 20 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
9 | 9 |
| |
10 | 10 |
| |
11 | 11 |
| |
12 |
| - | |
13 | 12 |
| |
14 | 13 |
| |
15 |
| - | |
16 | 14 |
| |
17 |
| - | |
18 | 15 |
| |
19 | 16 |
| |
20 | 17 |
| |
21 |
| - | |
22 |
| - | |
23 |
| - | |
24 |
| - | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
25 | 21 |
| |
| 22 | + | |
26 | 23 |
| |
27 |
| - | |
| 24 | + | |
28 | 25 |
| |
29 | 26 |
| |
30 | 27 |
| |
| |||
61 | 58 |
| |
62 | 59 |
| |
63 | 60 |
| |
64 |
| - | |
| 61 | + | |
65 | 62 |
| |
66 | 63 |
| |
| 64 | + | |
67 | 65 |
| |
68 | 66 |
| |
69 | 67 |
| |
70 | 68 |
| |
71 | 69 |
| |
72 | 70 |
| |
73 | 71 |
| |
| 72 | + | |
74 | 73 |
| |
75 | 74 |
| |
76 | 75 |
| |
| |||
93 | 92 |
| |
94 | 93 |
| |
95 | 94 |
| |
96 |
| - | |
| 95 | + | |
97 | 96 |
| |
98 |
| - | |
99 |
| - | |
100 |
| - | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
101 | 100 |
| |
102 | 101 |
| |
103 | 102 |
| |
104 | 103 |
| |
105 |
| - | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
106 | 107 |
| |
107 | 108 |
| |
108 | 109 |
| |
| |||
123 | 124 |
| |
124 | 125 |
| |
125 | 126 |
| |
126 |
| - | |
127 |
| - | |
128 |
| - | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
129 | 131 |
| |
130 | 132 |
| |
131 | 133 |
| |
| |||
143 | 145 |
| |
144 | 146 |
| |
145 | 147 |
| |
146 |
| - | |
| 148 | + | |
147 | 149 |
| |
148 | 150 |
| |
149 | 151 |
| |
| |||
170 | 172 |
| |
171 | 173 |
| |
172 | 174 |
| |
173 |
| - | |
| 175 | + | |
174 | 176 |
| |
175 | 177 |
| |
176 | 178 |
| |
| |||
181 | 183 |
| |
182 | 184 |
| |
183 | 185 |
| |
| 186 | + | |
| 187 | + | |
184 | 188 |
| |
185 | 189 |
| |
186 | 190 |
| |
| 191 | + | |
| 192 | + | |
| 193 | + | |
187 | 194 |
| |
188 | 195 |
| |
189 | 196 |
| |
| |||
217 | 224 |
| |
218 | 225 |
| |
219 | 226 |
| |
220 |
| - | |
| 227 | + |
Lines changed: 0 additions & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
53 | 53 |
| |
54 | 54 |
| |
55 | 55 |
| |
56 |
| - | |
57 | 56 |
| |
58 | 57 |
| |
59 | 58 |
| |
|
Lines changed: 3 additions & 24 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
7 | 7 |
| |
8 | 8 |
| |
9 | 9 |
| |
10 |
| - | |
11 |
| - | |
12 |
| - | |
13 |
| - | |
14 |
| - | |
15 | 10 |
| |
16 | 11 |
| |
17 | 12 |
| |
18 |
| - | |
19 | 13 |
| |
20 |
| - | |
21 |
| - | |
22 |
| - | |
23 | 14 |
| |
24 |
| - | |
25 |
| - | |
26 | 15 |
| |
27 | 16 |
| |
28 | 17 |
| |
29 | 18 |
| |
30 | 19 |
| |
31 | 20 |
| |
32 | 21 |
| |
33 |
| - | |
34 |
| - | |
35 |
| - | |
36 |
| - | |
37 |
| - | |
38 |
| - | |
39 |
| - | |
40 |
| - | |
41 |
| - | |
42 |
| - | |
43 |
| - | |
44 |
| - | |
45 |
| - | |
| 22 | + | |
| 23 | + | |
| 24 | + |
Lines changed: 26 additions & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + |
Lines changed: 2 additions & 2 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
6 | 6 |
| |
7 | 7 |
| |
8 | 8 |
| |
9 |
| - | |
| 9 | + | |
10 | 10 |
| |
11 | 11 |
| |
12 | 12 |
| |
| |||
42 | 42 |
| |
43 | 43 |
| |
44 | 44 |
| |
45 |
| - | |
| 45 | + | |
46 | 46 |
| |
47 | 47 |
| |
48 | 48 |
| |
|
Lines changed: 27 additions & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
46 | 46 |
| |
47 | 47 |
| |
48 | 48 |
| |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
49 | 64 |
| |
50 | 65 |
| |
51 | 66 |
| |
| |||
284 | 299 |
| |
285 | 300 |
| |
286 | 301 |
| |
| 302 | + | |
| 303 | + | |
| 304 | + | |
| 305 | + | |
| 306 | + | |
| 307 | + | |
| 308 | + | |
| 309 | + | |
| 310 | + | |
| 311 | + | |
287 | 312 |
| |
288 | 313 |
| |
289 | 314 |
| |
| |||
315 | 340 |
| |
316 | 341 |
| |
317 | 342 |
| |
| 343 | + | |
| 344 | + | |
318 | 345 |
| |
319 | 346 |
| |
320 | 347 |
| |
|
Lines changed: 42 additions & 3 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
10 | 10 |
| |
11 | 11 |
| |
12 | 12 |
| |
| 13 | + | |
| 14 | + | |
13 | 15 |
| |
14 | 16 |
| |
15 | 17 |
| |
| |||
27 | 29 |
| |
28 | 30 |
| |
29 | 31 |
| |
30 |
| - | |
| 32 | + | |
31 | 33 |
| |
32 | 34 |
| |
33 | 35 |
| |
| |||
41 | 43 |
| |
42 | 44 |
| |
43 | 45 |
| |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
44 | 83 |
| |
45 | 84 |
| |
46 | 85 |
| |
| |||
124 | 163 |
| |
125 | 164 |
| |
126 | 165 |
| |
| 166 | + | |
127 | 167 |
| |
128 | 168 |
| |
129 | 169 |
| |
130 |
| - | |
131 | 170 |
| |
132 | 171 |
| |
133 | 172 |
| |
| |||
390 | 429 |
| |
391 | 430 |
| |
392 | 431 |
| |
393 |
| - | |
| 432 | + | |
394 | 433 |
| |
395 | 434 |
| |
396 | 435 |
| |
|
0 commit comments