Skip to content

Conversation

WoosukKwon
Copy link
Collaborator

No description provided.

@WoosukKwon WoosukKwon merged commit 0f4b321 into main Apr 15, 2023
@WoosukKwon WoosukKwon deleted the block-ablation branch April 15, 2023 16:03
hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024
tianyil1 pushed a commit to tianyil1/vllm that referenced this pull request Jun 5, 2024
fxmarty pushed a commit to fxmarty/vllm-public that referenced this pull request Jun 12, 2024
…pes; Using attn_fwd triton kernel from ROCm/triton main_perf that does not cause triton compolier to hang (vllm-project#38)
yukavio pushed a commit to yukavio/vllm that referenced this pull request Jul 3, 2024
@alixiaodi alixiaodi mentioned this pull request Aug 2, 2024
heheda12345 pushed a commit to heheda12345/vllm that referenced this pull request Sep 29, 2025
* Add indexer_k_quant_and_cache_kernel

Signed-off-by: Barry Kang <[email protected]>

* Accept 3D kv_cache buffer

Signed-off-by: Barry Kang <[email protected]>

* Address review comments

Signed-off-by: Barry Kang <[email protected]>

---------

Signed-off-by: Barry Kang <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant