-
-
Notifications
You must be signed in to change notification settings - Fork 11.7k
[Kernel][Performance] Add Triton kernel for Qwen3-VL interleaved MRoPE #25055
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Benchmarkserver client ResultsMain PR |
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
|
Can you also check lm-eval? |
|
@Isotr0py @DarkLight1337 BTW let's not post the actual eval numbers since the model hasn't been released - just make sure the results match |
Oh, I just realized there is no generation tests for Qwen3-VL yet. Let me add ones later today (after my seminar) |
|
Have confirmed Qwen3-VL generation test added by #25185 can still pass with Triton kernel. |
vllm-project#25055) Signed-off-by: Isotr0py <[email protected]>
vllm-project#25055) Signed-off-by: Isotr0py <[email protected]>
vllm-project#25055) Signed-off-by: Isotr0py <[email protected]> Signed-off-by: charlifu <[email protected]>
vllm-project#25055) Signed-off-by: Isotr0py <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>
vllm-project#25055) Signed-off-by: Isotr0py <[email protected]>
vllm-project#25055) Signed-off-by: Isotr0py <[email protected]>
vllm-project#25055) Signed-off-by: Isotr0py <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>
Purpose
Test Plan
Test Result
Test should pass
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.