Skip to content

Conversation

msaroufim
Copy link
Member

@msaroufim msaroufim commented Jul 16, 2024

Revert this change after #504 lands

Copy link

pytorch-bot bot commented Jul 16, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/505

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit ca92406 with merge base 1029df3 (image):

NEW FAILURE - The following job has failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 16, 2024
@msaroufim msaroufim changed the title pin nightly to 2.5.0.dev20240711+cu121 pin nightly to 2.5.0.dev20240709+cu121 Jul 16, 2024
@msaroufim msaroufim merged commit cc871c5 into main Jul 16, 2024
@msaroufim msaroufim deleted the msaroufim/ci branch July 16, 2024 03:31
jerryzh168 added a commit that referenced this pull request Jul 16, 2024
jerryzh168 added a commit that referenced this pull request Jul 16, 2024
jerryzh168 added a commit that referenced this pull request Jul 16, 2024
jerryzh168 added a commit that referenced this pull request Jul 16, 2024
jerryzh168 added a commit that referenced this pull request Jul 16, 2024
jerryzh168 added a commit that referenced this pull request Jul 17, 2024
jerryzh168 added a commit that referenced this pull request Jul 17, 2024
jerryzh168 added a commit to jerryzh168/ao that referenced this pull request Jul 17, 2024
yanbing-j pushed a commit to yanbing-j/ao that referenced this pull request Dec 9, 2024
Summary:
Without this optimization llama3 on s22 is around 4 tok/sec. With the
fix it is > 7 tok/sec

Test Plan:
./runner/build_android.sh
python3 torchchat.py download llama3
python3 torchchat.py export llama3 --output-pte-pat llama3.pte
--quantize config/data/mobile.json
adb push llama3.pte /data/local/tmp/
adb push tokenizer.model /data/local/tmp/
adb shell "cd /data/local/tmp/ && ./et_run llama3.pte -z tokenizer.model
-t 0 -i "Once upon" -n 124"

Reviewers:

Subscribers:

Tasks:

Tags:
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants