Refactor LinearActQuantizedTensor #542

jerryzh168 · 2024-07-25T22:00:08Z

Summary:

rename to LinearActivationQuantizedTensor
using implements util to implement torch function and torch dispatch overwrites
refactored tensor subclass dispatch related utility functions: _implements and added two dispatch helpers: _dispatch__torch_function__ and _dispatch__torch_dispatch__ that saves more lines of code from users

Test Plan:
CI

Reviewers:

Subscribers:

Tasks:

Tags:

pytorch-bot · 2024-07-25T22:00:11Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/542

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 097a61e with merge base c9f79be ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

tutorials/calibration_flow/static_quant.py

jerryzh168 · 2024-07-26T01:28:56Z

@jcaip please take a look again, the scope increased a bit to include a refactor for torch_function/torch_dispatch related utils functions

also @msaroufim @gau-nernst please take a look as well

gau-nernst

The new _dispatch__torch_function__ and _dispatch__torch_dispatch__ are very nice, reduce boilerplate. Apart from also making the changes to torchao/prototype/quant_llm/quant_llm.py, for the codes that I wrote, as long as the tests pass, they are good for me.

msaroufim

nice clean change, just double checking this won't cause any BC issues on tune side for their nf4 recipe?

Summary: * rename to LinearActivationQuantizedTensor * using `implements` util to implement torch function and torch dispatch overwrites Test Plan: CI Reviewers: Subscribers: Tasks: Tags:

jerryzh168 · 2024-07-26T16:24:56Z

nice clean change, just double checking this won't cause any BC issues on tune side for their nf4 recipe?

oh nf4 is not using these utils yet, I can do it in a separate PR

Summary: As a follow up of pytorch#542, we can simplify the code of nf4tensor by using the dispatch utils as well. Test Plan: python test/dtypes/test_nf4.py Reviewers: Subscribers: Tasks: Tags:

Summary: * rename to LinearActivationQuantizedTensor * using `implements` util to implement torch function and torch dispatch overwrites Test Plan: CI Reviewers: Subscribers: Tasks: Tags:

jerryzh168 requested review from andrewor14, msaroufim, jcaip and HDCharles July 25, 2024 22:00

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 25, 2024

jcaip approved these changes Jul 25, 2024

View reviewed changes

tutorials/calibration_flow/static_quant.py Outdated Show resolved Hide resolved

jerryzh168 force-pushed the refactor-linear-act branch 4 times, most recently from 5007567 to 5f300dd Compare July 26, 2024 01:25

jerryzh168 requested a review from gau-nernst July 26, 2024 01:29

gau-nernst approved these changes Jul 26, 2024

View reviewed changes

jerryzh168 force-pushed the refactor-linear-act branch from 5f300dd to 2ba9bfc Compare July 26, 2024 04:10

msaroufim approved these changes Jul 26, 2024

View reviewed changes

jerryzh168 force-pushed the refactor-linear-act branch 2 times, most recently from 70e760e to 1f849c2 Compare July 26, 2024 06:14

Refactor LinearActQuantizedTensor

097a61e

Summary: * rename to LinearActivationQuantizedTensor * using `implements` util to implement torch function and torch dispatch overwrites Test Plan: CI Reviewers: Subscribers: Tasks: Tags:

jerryzh168 force-pushed the refactor-linear-act branch from 1f849c2 to 097a61e Compare July 26, 2024 16:24

jerryzh168 merged commit afde175 into pytorch:main Jul 26, 2024
13 checks passed

jerryzh168 deleted the refactor-linear-act branch July 26, 2024 17:10

jerryzh168 mentioned this pull request Jul 26, 2024

Refactor NF4Tensor to use dispatch utils #543

Draft

gau-nernst mentioned this pull request Aug 5, 2024

Fix FP6-LLM API and add .to(device) op #595

Merged

jerryzh168 mentioned this pull request Aug 5, 2024

Fix FP6-LLM API and add .to(device) op #599

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor LinearActQuantizedTensor #542

Refactor LinearActQuantizedTensor #542

Uh oh!

jerryzh168 commented Jul 25, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jul 25, 2024 •

edited

Loading

Uh oh!

Uh oh!

jerryzh168 commented Jul 26, 2024 •

edited

Loading

Uh oh!

gau-nernst left a comment

Uh oh!

msaroufim left a comment

Uh oh!

jerryzh168 commented Jul 26, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Refactor LinearActQuantizedTensor #542

Refactor LinearActQuantizedTensor #542

Uh oh!

Conversation

jerryzh168 commented Jul 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/542

✅ No Failures

Uh oh!

Uh oh!

jerryzh168 commented Jul 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gau-nernst left a comment

Choose a reason for hiding this comment

Uh oh!

msaroufim left a comment

Choose a reason for hiding this comment

Uh oh!

jerryzh168 commented Jul 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jerryzh168 commented Jul 25, 2024 •

edited

Loading

pytorch-bot bot commented Jul 25, 2024 •

edited

Loading

jerryzh168 commented Jul 26, 2024 •

edited

Loading

jerryzh168 commented Jul 26, 2024 •

edited

Loading