Skip to content

January 2025 Update #1036

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 25 commits into from
Jan 21, 2025
Merged

Conversation

martindevans
Copy link
Member

@martindevans martindevans commented Jan 4, 2025

Updated llama.cpp binaries to 0827b2c1da299805288abbd556d869318f2b121e.

This introduces new binaries. Previously there were 2 for each platform, e.g. ggml.dll, llama.dll for CUDA. Now there are more: ggml.dll, ggml-base.dll, ggml-cpu.dll, ggml-cuda.dll, llama.dll.

Currently these are handled in the same way as the old system - each platform has it's own set of completely independent binaries. In the future this should be modified to dynamically load backends. See more details here and here.

Testing:

  • Windows CPU
  • Windows CUDA
  • Windows Vulkan
  • Linux CPU
  • Linux CUDA
  • Linux Vulkan
  • MacOS CPU
  • MacOS Metal

@LSXAxeller
Copy link

LSXAxeller commented Jan 10, 2025

Thanks for your hard work. Any chance this can be updated with the latest binaries? Mainstream got QRWKV Hybrid, Phi-3.5-MoE, Deepseek V3, C4AI Command-R 7B models, INT-8 and BF16 implementations.

@martindevans
Copy link
Member Author

The binaries here are from just one week ago, so hopefully most of those things should be covered.

Currently we need to fix the issues loading the binaries on Linux and MacOS, once that's figured out the next update should be easier 🤞

@m0nsky
Copy link
Contributor

m0nsky commented Jan 20, 2025

Unit tests passed on Windows CUDA & Linux CUDA.

Test application is running fine on:

  • Windows CPU
  • Windows CUDA
  • Windows Vulkan
  • Linux CPU
  • Linux CUDA
  • Linux Vulkan

@Lyrcaxis
Copy link
Contributor

Lyrcaxis commented Jan 20, 2025

Works on my machine! (osx-arm64)

  • MacOS CPU
  • MacOS Metal

Tested all of [no offload, partial offload, full offload].

@martindevans martindevans merged commit 02eedd9 into SciSharp:master Jan 21, 2025
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants