-
Notifications
You must be signed in to change notification settings - Fork 12.1k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
convert : fix null head_dim AutoConfig regression
python
python script changes
#14248
opened Jun 17, 2025 by
CISC
Loading…
2
llama-chat : fix multiple system messages for gemma, orion
#14246
opened Jun 17, 2025 by
ngxson
Loading…
sycl: add usage of enqueue_functions extension
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14244
opened Jun 17, 2025 by
s-Nick
Loading…
Add SmolLM3
documentation
Improvements or additions to documentation
python
python script changes
#14240
opened Jun 17, 2025 by
Vaibhavs10
•
Draft
MODEL: Falcon-H1 support
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
testing
Everything test related
#14238
opened Jun 17, 2025 by
younesbelkada
•
Draft
Mtmd: add a way to select device for vision encoder
examples
#14236
opened Jun 17, 2025 by
stduhpf
Loading…
ggml: introduce GGML_NUMA_MIGRATE to optimize cross NUMA op computation
examples
ggml
changes relating to the ggml tensor library for machine learning
#14232
opened Jun 17, 2025 by
wenlujon
Loading…
logit_bias: apply configurable escalating EOG bias at low n_remain
examples
server
testing
Everything test related
#14229
opened Jun 16, 2025 by
graehl
Loading…
tests : enhance llama-bench with separate timings (pp/gen t/s), added n_threads_batch
examples
#14219
opened Jun 16, 2025 by
thad0ctor
Loading…
sycl: Cleanup codepaths in Get Rows in sycl backend
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14215
opened Jun 16, 2025 by
ShanoToni
Loading…
gguf-py: Make sentencepiece optional
python
python script changes
#14200
opened Jun 15, 2025 by
Ahajha
Loading…
webui: save model name with conversation history (#13570)
examples
server
#14192
opened Jun 15, 2025 by
deepanshu2015
Loading…
ci: re-enable rocm linux build, reduce the built targets to the ones currently available in rocblas
devops
improvements to build systems and github actions
#14184
opened Jun 14, 2025 by
IMbackK
Loading…
ggml : implement GLU for split up/gate
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#14181
opened Jun 14, 2025 by
CISC
Loading…
ggml : implement REGLU/GEGLU/SWIGLU ops
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
help wanted
Extra attention is needed
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#14158
opened Jun 12, 2025 by
CISC
Loading…
models/templates: add mistralai/Mistral-Small-3.1-24B-Instruct-2503 template with tool calling support
#14148
opened Jun 12, 2025 by
bretello
Loading…
ggml: aarch64: Implement SVE Kernels for Int 8 Quantization
ggml
changes relating to the ggml tensor library for machine learning
#14117
opened Jun 11, 2025 by
Vithulep
Loading…
scripts: Fix remote option in Windows (#14102)
python
python script changes
#14100
opened Jun 10, 2025 by
pqnet
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.