You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Summary:
Adds config options to configure float8 scaling type for input, weight,
grad_output.
Performance is not ideal yet, but that's because we have not optimized
it.
Test Plan:
```
// repeat for input, weight, grad_out
with-proxy CONFIG_FILE="./train_configs/llama3_8b.toml" ./run_llama_train.sh --training.enable_float8_linear --training.float8_scaling_type_weight delayed --training.compile
```
Reviewers:
Subscribers:
Tasks:
Tags:
0 commit comments