Skip to content

Pull requests: huggingface/text-generation-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Remove useless packages
#3253 opened Jun 3, 2025 by yuanwu2017 Loading…
5 tasks
Xccl
#3252 opened Jun 2, 2025 by sywangyi Draft
5 tasks
Qwen3 moe
#3244 opened May 29, 2025 by yuanwu2017 Draft
5 tasks
xpu lora support
#3232 opened May 19, 2025 by sywangyi Loading…
Trtllm backend improvements
#3231 opened May 17, 2025 by leejuyuu Loading…
1 of 5 tasks
Refine logging for Gaudi warmup
#3222 opened May 10, 2025 by regisss Loading…
5 tasks
Fix typos
#3210 opened May 6, 2025 by omahs Loading…
1 of 5 tasks
feat: lock updated kernel versions
#3201 opened Apr 29, 2025 by drbh Loading…
Set uv UV_PYTHON_INSTALL_DIR explicitly
#3197 opened Apr 27, 2025 by sebastianliebscher Loading…
1 of 5 tasks
2
README: minimum Python version is 3.10
#3194 opened Apr 25, 2025 by Frenzie Loading…
1 of 5 tasks
feat: support logit bias in chat request
#3186 opened Apr 22, 2025 by drbh Loading…
Fix flashinfer plan call to use positional arguments for #3165
#3166 opened Apr 11, 2025 by ruckc Loading…
2 of 5 tasks
Update to flashinfer 0.2.5
#3164 opened Apr 11, 2025 by danieldk Draft
5 tasks
Add chunked attn for L4
#3162 opened Apr 10, 2025 by mht-sharma Draft
2 of 7 tasks
Gaudi: add CI
#3160 opened Apr 10, 2025 by baptistecolle Draft
Update links Inferentia refer docs
#3154 opened Apr 9, 2025 by guspan-tanadi Loading…
1 of 5 tasks
feat: align function id with tool call response
#3111 opened Mar 13, 2025 by drbh Loading…
wip: comment out prepend full_text
#3079 opened Mar 7, 2025 by jrc2139 Draft
1 of 5 tasks
Support xccl distributed backend
#3034 opened Feb 18, 2025 by dvrogozh Loading…
[Backend] Introduce vLLM backend
#2976 opened Jan 31, 2025 by mfuntowicz Loading…
ProTip! Filter pull requests by the default branch with base:main.