Update torchao.prototype.parq and add 4-bit Llama 3.2 1B benchmark #2017

lisjin · 2025-04-04T16:38:46Z

We would like to merge recent changes from our open sourced library at https://github.com/facebookresearch/parq.

We have also benchmarked 4-bit Llama 3.2 1B fine-tuned for 25K steps on fineweb-edu using torchtune. We used PARQ's MaxUnifQuantizer and ProxHardQuant proximal mapping, which is equivalent to STE. Below are the relevant training config changes to the llama3_2/1B_full.yaml recipe.

batch_size: 8
epochs: 1
optimizer:
  _component_: torch.optim.AdamW
  lr: 4e-5
  weight_decay: 0.0
  betas: [0.9, 0.95]
  fused: True

lr_scheduler:
  _component_: torchtune.training.lr_schedulers.get_cosine_schedule_with_warmup
  num_warmup_steps: 2500

As shown in the table below, the resulting 4-bit model achieves well under 10% accuracy on most commonsense reasoning benchmarks relative to the pre-trained model.

Tasks	16-bit	4-bit	% diff
arc_challenge	0.3805	0.3575	-6.4
arc_easy	0.6309	0.6077	-3.7
hellaswag	0.6081	0.5423	-10.8
piqa	0.7410	0.7122	-3.9
winogrande	0.6022	0.5549	-7.8

pytorch-bot · 2025-04-04T16:38:50Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2017

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit f4ee2d7 with merge base 6922733 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

andrewor14 · 2025-04-04T17:14:13Z

Hi @lisjin, thanks for the update! The results look great. As discussed offline, we usually only add submodules if other parts of torchao are using the submodules, which is not the case here. Do you mind making the latest changes to PARQ in the prototype version here in torchao instead?

lisjin · 2025-04-04T17:44:23Z

Do you mind making the latest changes to PARQ in the prototype version here in torchao instead?

Of course! I just removed the submodule and updated the prototype version instead.

…2017) Replace torchao.prototype.parq with facebookresearch/parq submodule

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 4, 2025

lisjin force-pushed the parq branch from 32b0899 to e69782c Compare April 4, 2025 17:24

Replace torchao.prototype.parq with facebookresearch/parq submodule

f4ee2d7

lisjin force-pushed the parq branch from e69782c to f4ee2d7 Compare April 4, 2025 17:36

lisjin changed the title ~~Replace torchao.prototype.parq with facebookresearch/parq submodule~~ Update torchao.prototype.parq and add 4-bit Llama 3.2 1B benchmark Apr 4, 2025

lisjin added the topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories) label Apr 4, 2025

lisjin requested a review from andrewor14 April 4, 2025 18:32

andrewor14 approved these changes Apr 4, 2025

View reviewed changes

lisjin merged commit 3bbf42a into pytorch:main Apr 4, 2025
18 of 19 checks passed

lisjin deleted the parq branch April 4, 2025 21:42

jainapurva pushed a commit that referenced this pull request Apr 8, 2025

Update torchao.prototype.parq and add 4-bit Llama 3.2 1B benchmark (#…

711d584

…2017) Replace torchao.prototype.parq with facebookresearch/parq submodule

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update torchao.prototype.parq and add 4-bit Llama 3.2 1B benchmark #2017

Update torchao.prototype.parq and add 4-bit Llama 3.2 1B benchmark #2017

Uh oh!

lisjin commented Apr 4, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Apr 4, 2025 •

edited

Loading

Uh oh!

andrewor14 commented Apr 4, 2025

Uh oh!

lisjin commented Apr 4, 2025

Uh oh!

Uh oh!

Uh oh!

Update torchao.prototype.parq and add 4-bit Llama 3.2 1B benchmark #2017

Update torchao.prototype.parq and add 4-bit Llama 3.2 1B benchmark #2017

Uh oh!

Conversation

lisjin commented Apr 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Apr 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2017

✅ No Failures

Uh oh!

andrewor14 commented Apr 4, 2025

Uh oh!

lisjin commented Apr 4, 2025

Uh oh!

Uh oh!

Uh oh!

lisjin commented Apr 4, 2025 •

edited

Loading

pytorch-bot bot commented Apr 4, 2025 •

edited

Loading