[Core] Use `CpuGpuBuffer` for block table tensors #24795

njhill · 2025-09-13T06:09:43Z

In the gpu model runner input batch.

Signed-off-by: Nick Hill <[email protected]>

gemini-code-assist

Code Review

This pull request refactors the BlockTable class to use the CpuGpuBuffer helper for managing tensors that exist on both CPU and GPU, specifically for block_table and slot_mapping. This change simplifies the code by centralizing the creation and management of these paired buffers, removing redundant manual handling of CPU, GPU, and NumPy tensor versions. The changes are well-contained within vllm/v1/worker/block_table.py, and the necessary adjustments in vllm/v1/worker/gpu_model_runner.py to adapt to the updated BlockTable API have been correctly applied. The refactoring improves code clarity and maintainability without introducing any apparent issues.

DarkLight1337

cc @WoosukKwon please confirm if this is ok to avoid conflicting with your refactoring efforts

WoosukKwon

My PR will rewrite the block table entirely, but this change looks good for now.

Signed-off-by: Nick Hill <[email protected]>

…pugpubuf

Signed-off-by: Nick Hill <[email protected]>

njhill · 2025-09-16T23:03:09Z

The failing test is also failing on main :( example: https://buildkite.com/vllm/ci/builds/30944#01995329-1ec7-4d17-abdf-0283fa8115f5

vllm-project/vllm#24795 and vllm-project/vllm#24615 and vllm-project/vllm#24078 --------- Signed-off-by: Agata Dobrzyniewicz <[email protected]>

vllm-project/vllm#24795 and vllm-project/vllm#24615 and vllm-project/vllm#24078 --------- Signed-off-by: Agata Dobrzyniewicz <[email protected]> Signed-off-by: slokesha <[email protected]>

Signed-off-by: Nick Hill <[email protected]>

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: charlifu <[email protected]>

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

Signed-off-by: Nick Hill <[email protected]>

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

[Core] Use CpuGpuBuffer for block table tensors

00b91f5

Signed-off-by: Nick Hill <[email protected]>

njhill requested review from WoosukKwon, alexm-redhat, comaniac, robertgshaw2-redhat and ywang96 as code owners September 13, 2025 06:09

mergify bot added the v1 label Sep 13, 2025

gemini-code-assist bot reviewed Sep 13, 2025

View reviewed changes

DarkLight1337 reviewed Sep 13, 2025

View reviewed changes

WoosukKwon approved these changes Sep 13, 2025

View reviewed changes

WoosukKwon added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 13, 2025

njhill added 2 commits September 15, 2025 12:55

Merge remote-tracking branch 'origin/main' into btable-cpugpubuf

b460842

fix test

3d7bfb5

Signed-off-by: Nick Hill <[email protected]>

njhill enabled auto-merge (squash) September 15, 2025 20:11

auto-merge was automatically disabled September 15, 2025 21:28
Pull Request is not mergeable

njhill added 2 commits September 15, 2025 15:22

Merge remote-tracking branch 'refs/remotes/origin/main' into btable-c…

195e69c

…pugpubuf

more test fixes

3e90179

Signed-off-by: Nick Hill <[email protected]>

mergify bot added the tpu Related to Google TPUs label Sep 15, 2025

Merge remote-tracking branch 'origin/main' into btable-cpugpubuf

b2d4f74

simon-mo merged commit eeb135e into vllm-project:main Sep 17, 2025
38 of 40 checks passed

njhill deleted the btable-cpugpubuf branch September 17, 2025 02:23

adobrzyn mentioned this pull request Sep 17, 2025

CI fix vllm-project/vllm-gaudi#186

Merged

xuechendi pushed a commit to vllm-project/vllm-gaudi that referenced this pull request Sep 17, 2025

CI fix (#186)

a3dce5c

vllm-project/vllm#24795 and vllm-project/vllm#24615 and vllm-project/vllm#24078 --------- Signed-off-by: Agata Dobrzyniewicz <[email protected]>

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[Core] Use CpuGpuBuffer for block table tensors (vllm-project#24795)

7b1d4b4

Signed-off-by: Nick Hill <[email protected]>

charlifu pushed a commit to ROCm/vllm that referenced this pull request Sep 25, 2025

[Core] Use CpuGpuBuffer for block table tensors (vllm-project#24795)

5e11eeb

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: charlifu <[email protected]>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025

[Core] Use CpuGpuBuffer for block table tensors (vllm-project#24795)

dea89b4

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

choprahetarth pushed a commit to Tandemn-Labs/vllm that referenced this pull request Oct 11, 2025

[Core] Use CpuGpuBuffer for block table tensors (vllm-project#24795)

a565b7e

Signed-off-by: Nick Hill <[email protected]>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025

[Core] Use CpuGpuBuffer for block table tensors (vllm-project#24795)

a24b355

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Core] Use `CpuGpuBuffer` for block table tensors #24795

[Core] Use `CpuGpuBuffer` for block table tensors #24795

Uh oh!

njhill commented Sep 13, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

DarkLight1337 left a comment

Uh oh!

WoosukKwon left a comment

Uh oh!

njhill commented Sep 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

[Core] Use CpuGpuBuffer for block table tensors #24795

[Core] Use CpuGpuBuffer for block table tensors #24795

Uh oh!

Conversation

njhill commented Sep 13, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

WoosukKwon left a comment

Choose a reason for hiding this comment

Uh oh!

njhill commented Sep 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Core] Use `CpuGpuBuffer` for block table tensors #24795

[Core] Use `CpuGpuBuffer` for block table tensors #24795