Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

arg : allow using -hf offline
#13202 opened Apr 29, 2025 by ngxson Loading…
CUDA: batched+noncont MMQ, refactor bs>1 MoE code ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#13199 opened Apr 29, 2025 by JohannesGaessler Loading…
kv-cache : add SWA support
#13194 opened Apr 29, 2025 by ggerganov Draft
vulkan: use uint array index to avoid glslang bug ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#13193 opened Apr 29, 2025 by jeffbolznv Loading…
vulkan: Handle src1 batch dimension in non-contiguous mat-vec-mul shader ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#13191 opened Apr 29, 2025 by jeffbolznv Loading…
rpc : fix cache directory initialization examples
#13188 opened Apr 29, 2025 by hbuxiaofei Loading…
test: non-cont. b in test-backend-ops -o MUL_MAT testing Everything test related
#13187 opened Apr 29, 2025 by JohannesGaessler Loading…
mtmd : add C public API examples testing Everything test related
#13184 opened Apr 29, 2025 by ngxson Loading…
ggml-cpu: enable z17 compile detection ggml changes relating to the ggml tensor library for machine learning
#13182 opened Apr 29, 2025 by taronaeo Loading…
Fix for issue #13170 ggml changes relating to the ggml tensor library for machine learning
#13176 opened Apr 29, 2025 by shalinib-ibm Loading…
fix(rpc): validate graph operands ggml changes relating to the ggml tensor library for machine learning
#13167 opened Apr 29, 2025 by thevilledev Draft
[CANN] Update CANN model support status documentation Improvements or additions to documentation
#13162 opened Apr 29, 2025 by bachelor-dou Draft
musa: enable MMA ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#13149 opened Apr 28, 2025 by yeahdongcn Draft
PowerPC: Enable MMA for BF16 in llamafile_sgemm ggml changes relating to the ggml tensor library for machine learning
#13148 opened Apr 28, 2025 by shalinib-ibm Loading…
CUDA: build archs as virtual for GGML_NATIVE=OFF ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#13135 opened Apr 27, 2025 by JohannesGaessler Loading…
convert : improve model arch handling python python script changes
#13122 opened Apr 26, 2025 by ngxson Loading…
sycl : Implemented reorder Q4_K mmvq ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13109 opened Apr 25, 2025 by sgeor255 Loading…
1 task
llama : try loading tensors with pre-computed hashes Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning Kompute https://github.com/KomputeProject/kompute/ Nvidia GPU Issues specific to Nvidia GPUs SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language Vulkan Issues specific to the Vulkan backend
#13106 opened Apr 25, 2025 by rgerganov Draft
[sync #10544] llama/ggml: add LLM training support examples ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#13105 opened Apr 25, 2025 by ggerganov Draft
1 task
[CANN] Simplify the environment variable setting for GGML_CANN_MEM_POOL and GGML_CANN_ASYNC_MODE Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#13104 opened Apr 25, 2025 by bachelor-dou Loading…
ggml: Implement yield barrier using futex for improved thread scheduling efficiency ggml changes relating to the ggml tensor library for machine learning
#13079 opened Apr 23, 2025 by SongXiaoXi Loading…
Reduce enum sizes some are used in structs, which allowed them to be optimized. build Compilation issues ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language Vulkan Issues specific to the Vulkan backend
#13071 opened Apr 22, 2025 by GermanAizek Loading…
ProTip! no:milestone will show everything without a milestone.