-
Notifications
You must be signed in to change notification settings - Fork 13.4k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add Go OCI library integration with go-containerregistry
documentation
Improvements or additions to documentation
#16667
opened Oct 19, 2025 by
ericcurtin
Loading…
metal : adjust .get_alloc_size to be alloc friendly
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
sycl: add ROLL operation support
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16665
opened Oct 19, 2025 by
tamarPal
Loading…
devops: fix binaries release failure for s390x
devops
improvements to build systems and github actions
ggml
changes relating to the ggml tensor library for machine learning
#16664
opened Oct 19, 2025 by
taronaeo
Loading…
ggml: add ggml_can_fuse_subgraph
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
vulkan: Update topk_moe fusion to handle gpt's late softmax
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16656
opened Oct 18, 2025 by
jeffbolznv
Loading…
llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization
ggml
changes relating to the ggml tensor library for machine learning
#16653
opened Oct 18, 2025 by
JohannesGaessler
Loading…
ggml-cpu: optimise rms_norm op
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#16650
opened Oct 18, 2025 by
taronaeo
Loading…
CUDA: topk-moe: add optional parameter for gpt-oss
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#16649
opened Oct 18, 2025 by
am17an
Loading…
vulkan: Optimize SSM_SCAN
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16645
opened Oct 18, 2025 by
jeffbolznv
Loading…
sycl: use async memory allocation to fix crashes during graph recording
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16644
opened Oct 17, 2025 by
mmichel11
Loading…
CUDA: better error for FA kernel with 0 occupancy
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16643
opened Oct 17, 2025 by
JohannesGaessler
Loading…
Fix: validateNewFunctionWithConstantArguments:305: failed assertion constantValues must not be nil.'
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#16639
opened Oct 17, 2025 by
armin976
Loading…
vulkan: Increase BK to 32; use BK/4 for non-CM mul_mm.comp
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16636
opened Oct 17, 2025 by
SavicStefan
Loading…
metal : initial Metal4 tensor API support
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
CUDA: fuse gate + up for mmvq, and mmvf
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#16630
opened Oct 17, 2025 by
am17an
Loading…
ggml: CUMSUM and TRI (CPU, Metal, CUDA)
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#16623
opened Oct 16, 2025 by
gabe-l-hart
Loading…
webui: add OAI-Compat Harmony tool-call streaming visualization and persistence in chat UI
examples
server
#16618
opened Oct 16, 2025 by
ServeurpersoCom
Loading…
llama-batch: fix build fails with
-Werror=missing-braces
#16614
opened Oct 16, 2025 by
otegami
Loading…
SYCL: Add support for FLOOR,CEIL,ROUND and TRUNC unary operators
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#16613
opened Oct 16, 2025 by
safranowith
Loading…
Fix non-ASCII path handling in common argument parser on Windows
#16611
opened Oct 16, 2025 by
kuguma
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.