Disable value splitting by default on G3 #58

madamczyk-intel · 2024-06-11T12:03:42Z

No description provided.

This reverts commit 47c0c5b.

remove expert_max hard code (#47) vLLM-Ext: Full enabling of ALiBi (#34) Add version inference via setuptools-scm (#58) Revert "vLLM-Ext: Full enabling of ALiBi (#34)" (#59) Remove punica_hpu.py from vllm_hpu_extension (#66) Removed previous (not-pipelined) pa implementation (#72) Add flag to enable running softmax in fp32 (#71) Update calibration readme link (#73) allow lm_head quantization in calibration process (#65) Pad to bmin if value is less (#67) Update pyproject.toml (#75) --------- Co-authored-by: Michał Kuligowski <[email protected]>

Signed-off-by: Kacper Pietkun <[email protected]>

Disable value splitting on G3

b3e2977

kzawora-intel approved these changes Jun 11, 2024

View reviewed changes

kzawora-intel merged commit 47c0c5b into HabanaAI:habana_main Jun 11, 2024

madamczyk-intel deleted the g3_no_split branch June 11, 2024 12:05

adobrzyn pushed a commit that referenced this pull request Jun 25, 2024

Disable value splitting on G3 (#58)

0176060

tzielinski-habana added a commit that referenced this pull request Jun 27, 2024

Revert "Disable value splitting on G3 (#58)"

9266e12

This reverts commit 47c0c5b.

tzielinski-habana mentioned this pull request Jun 27, 2024

Revert "Disable value splitting by default on G3" #74

Merged

kzawora-intel pushed a commit that referenced this pull request Jun 27, 2024

Revert "Disable value splitting on G3 (#58)" (#74)

4a45bbf

This reverts commit 47c0c5b.

kzawora-intel added the habana Issues or PRs submitted by Habana Labs label Nov 8, 2024

mfylcek mentioned this pull request Jan 14, 2025

Set vllm-hpu-extension to 6ac93fb #684

Merged

michalkuligowski mentioned this pull request Jan 15, 2025

Update requirements-hpu.txt #685

Closed

yiliu30 pushed a commit that referenced this pull request Aug 8, 2025

Change PT_HPU_LAZY_MODE default value (#58)

079e659

Signed-off-by: Kacper Pietkun <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Disable value splitting by default on G3 #58

Disable value splitting by default on G3 #58

Uh oh!

madamczyk-intel commented Jun 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Disable value splitting by default on G3 #58

Disable value splitting by default on G3 #58

Uh oh!

Conversation

madamczyk-intel commented Jun 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants