[Scheduler] 视频理解支持chunk prefill #5107

yangjianfengo1 · 2025-11-18T07:44:24Z

Motivation

对于长视频来说token太多，所占激活显存太多，故推理时支持chunk prefill，并且视频的chunk prefill只能帧与帧切割

Modifications

修改了多模中的视频理解每次推理时给引擎的token数

Usage or Command

请求中带入can_split_idx字段就好，这个数组中的每个值是可以切割的下标，例如token_num=10，can_split_list=[2,5,9]，表述chunk fill时可以分3次推理，每次的token数分别是[3,3,4]

Accuracy Tests

模型单测通过

Checklist

Add at least a tag in the PR title.
- Tag list: [[FDConfig],[APIServer],[Engine], [Scheduler], [PD Disaggregation], [Executor], [Graph Optimization], [Speculative Decoding], [RL], [Models], [Quantization], [Loader], [OP], [KVCache], [DataProcessor], [BugFix], [Docs], [CI], [Optimization], [Feature], [Benchmark], [Others], [XPU], [HPU], [GCU], [DCU], [Iluvatar], [Metax]]
- You can add new tags based on the PR content, but the semantics must be clear.
Format your code, run pre-commit before commit.
Add unit tests. Please write the reason in this PR if no unit tests.
Provide accuracy results.
If the current PR is submitting to the release branch, make sure the PR has been submitted to the develop branch, then cherry-pick it to the release branch with the [Cherry-Pick] PR tag.

paddle-bot · 2025-11-18T07:44:30Z

Thanks for your contribution!

add video chunk prefill

9f677d7

TBD1 self-requested a review November 18, 2025 07:52

TBD1 previously approved these changes Nov 18, 2025

View reviewed changes

add vit_merge=True for test_tokenizer_client.py

19e4768

yangjianfengo1 dismissed TBD1’s stale review via 19e4768 November 19, 2025 01:33

yangjianfengo1 force-pushed the eb4_video branch from dd4d1fe to 19e4768 Compare November 19, 2025 01:33

TBD1 self-requested a review November 19, 2025 03:36

TBD1 approved these changes Nov 19, 2025

View reviewed changes

yangjianfengo1 changed the title ~~【new feature】视频理解支持chunk prefill~~ [Scheduler] 视频理解支持chunk prefill Nov 20, 2025

yangjianfengo1 and others added 2 commits November 20, 2025 10:33

Merge branch 'develop' into eb4_video

8c7f073

Merge branch 'develop' into eb4_video

7dec3f1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Scheduler] 视频理解支持chunk prefill #5107

[Scheduler] 视频理解支持chunk prefill #5107

yangjianfengo1 commented Nov 18, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Scheduler] 视频理解支持chunk prefill #5107

Are you sure you want to change the base?

[Scheduler] 视频理解支持chunk prefill #5107

Conversation

yangjianfengo1 commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Usage or Command

Accuracy Tests

Checklist

Uh oh!

paddle-bot bot commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yangjianfengo1 commented Nov 18, 2025 •

edited

Loading