-
Notifications
You must be signed in to change notification settings - Fork 86
Pull requests: jd-opensource/xllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
refactor: replace decode_seq_range with batch_forward_type.
#451
opened Nov 27, 2025 by
RobbieLeung
Loading…
feat: support qwen2_5_vl/qwen3_vl/qwen3_vl_moe on mlu device.
#450
opened Nov 27, 2025 by
a120092009
Loading…
refactor: optimize unique token count preparation of batch input builder.
#449
opened Nov 27, 2025 by
RobbieLeung
Loading…
refactor: move draft input preparation of decode batch from worker to batch builder.
#448
opened Nov 27, 2025 by
RobbieLeung
Loading…
[WIP] feat: support loading model weights and forward overlap.
#441
opened Nov 26, 2025 by
Clement-Wang26
Loading…
feat: add rec proto,serivce and utils for rec framework[2/6].
#440
opened Nov 26, 2025 by
DragonFive
Loading…
feat: enhance Qwen3-MoE to support TP settings beyond 4.
#427
opened Nov 24, 2025 by
yingxudeng
Loading…
feat: support Qwen2-VL & GME-Qwen2-VL model on npu device.
#399
opened Nov 18, 2025 by
xanecdotex
Loading…
feat: enable torch_npu graph mode for Qwen-3 dense with TP support.
#325
opened Nov 6, 2025 by
yingxudeng
Loading…
ProTip!
What’s not been updated in a month: updated:<2025-10-28.