-
Notifications
You must be signed in to change notification settings - Fork 14.2k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
android: routine maintenance - Dec 2025
android
Issues specific to Android
examples
#18338
opened Dec 24, 2025 by
naco-siren
Loading…
[WIP]ggml-hexagon: improve leftover element calc at changes relating to the ggml tensor library for machine learning
vec_dot_f16_f32
ggml
mimo2: wire RMS eps + MoE bias + converter guard
model
Model specific
python
python script changes
#18333
opened Dec 24, 2025 by
Aaryan-Kapoor
Loading…
vulkan: Use BK=32 for coopmat2 mul_mat_id
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#18332
opened Dec 23, 2025 by
jeffbolznv
Loading…
full modern bert support
python
python script changes
#18330
opened Dec 23, 2025 by
ryan-mangeno
Loading…
vulkan: Support UPSCALE w/antialias
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#18327
opened Dec 23, 2025 by
jeffbolznv
Loading…
Support Youtu-VL Model
examples
python
python script changes
#18315
opened Dec 23, 2025 by
f291400
Loading…
Add metal count equal op
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
#18314
opened Dec 23, 2025 by
gatbontonpc
Loading…
utils: beging using log.h in tokenize.cpp
examples
#18307
opened Dec 22, 2025 by
syedshazli
Loading…
vulkan: handle rope with large number of rows
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#18306
opened Dec 22, 2025 by
jeffbolznv
Loading…
vulkan: fix command buffer corruption in ggml_backend_vk_event_wait
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#18302
opened Dec 22, 2025 by
jeffbolznv
Loading…
Webui/prompt processing progress
examples
server
#18300
opened Dec 22, 2025 by
ServeurpersoCom
Loading…
vulkan: extend topk_moe to handle sigmoid w/exp_probs_b for nemotron
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#18295
opened Dec 22, 2025 by
jeffbolznv
Loading…
eval-callback : add support for saving logits
examples
#18281
opened Dec 22, 2025 by
danbev
Loading…
Vulkan: Tune Flash Attention for MoE on AMD GPUs
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#18280
opened Dec 22, 2025 by
0cc4m
Loading…
Cmdline arg -to changes http read timeout from current 600sec default
examples
server
#18279
opened Dec 22, 2025 by
wbtek
Loading…
tools : use common_log_pause to fix fit-params output race
examples
#18276
opened Dec 22, 2025 by
Aadeshveer
Loading…
KYLIN: fix compile error for cuda backend
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#18275
opened Dec 22, 2025 by
lizhenneng
Loading…
docs: Fix typos in SYCL documentation
documentation
Improvements or additions to documentation
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#18269
opened Dec 21, 2025 by
yoka
Loading…
llama: fix magic number of 999 for GPU layers
#18266
opened Dec 21, 2025 by
JohannesGaessler
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:master.