Skip to content

Commit 1f360a7

Browse files
authored
Merge branch 'master' into feat/stablediffusion-ggml
2 parents bef0d4f + 074b52b commit 1f360a7

File tree

3 files changed

+57
-2
lines changed

3 files changed

+57
-2
lines changed

Makefile

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ DETECT_LIBS?=true
88
# llama.cpp versions
99
GOLLAMA_REPO?=https://github.com/go-skynet/go-llama.cpp
1010
GOLLAMA_VERSION?=2b57a8ae43e4699d3dc5d1496a1ccd42922993be
11-
CPPLLAMA_VERSION?=5e1ed95583ca552a98d8528b73e1ff81249c2bf9
11+
CPPLLAMA_VERSION?=8648c521010620c2daccfa1d26015c668ba2c717
1212

1313
# whisper.cpp version
1414
WHISPER_REPO?=https://github.com/ggerganov/whisper.cpp

docs/themes/hugo-theme-relearn

Submodule hugo-theme-relearn updated 86 files

gallery/index.yaml

Lines changed: 55 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1725,6 +1725,31 @@
17251725
- filename: Teleut-7b.Q4_K_M.gguf
17261726
sha256: 844a633ea01d793c638e99f2e07413606b3812b759e9264fbaf69c8d94eaa093
17271727
uri: huggingface://QuantFactory/Teleut-7b-GGUF/Teleut-7b.Q4_K_M.gguf
1728+
- !!merge <<: *qwen25
1729+
name: "qwen2.5-7b-homercreative-mix"
1730+
urls:
1731+
- https://huggingface.co/ZeroXClem/Qwen2.5-7B-HomerCreative-Mix
1732+
- https://huggingface.co/QuantFactory/Qwen2.5-7B-HomerCreative-Mix-GGUF
1733+
description: |
1734+
ZeroXClem/Qwen2.5-7B-HomerCreative-Mix is an advanced language model meticulously crafted by merging four pre-trained models using the powerful mergekit framework. This fusion leverages the Model Stock merge method to combine the creative prowess of Qandora, the instructive capabilities of Qwen-Instruct-Fusion, the sophisticated blending of HomerSlerp1, and the foundational conversational strengths of Homer-v0.5-Qwen2.5-7B. The resulting model excels in creative text generation, contextual understanding, and dynamic conversational interactions.
1735+
🚀 Merged Models
1736+
1737+
This model merge incorporates the following:
1738+
1739+
bunnycore/Qandora-2.5-7B-Creative: Specializes in creative text generation, enhancing the model's ability to produce imaginative and diverse content.
1740+
1741+
bunnycore/Qwen2.5-7B-Instruct-Fusion: Focuses on instruction-following capabilities, improving the model's performance in understanding and executing user commands.
1742+
1743+
allknowingroger/HomerSlerp1-7B: Utilizes spherical linear interpolation (SLERP) to blend model weights smoothly, ensuring a harmonious integration of different model attributes.
1744+
1745+
newsbang/Homer-v0.5-Qwen2.5-7B: Acts as the foundational conversational model, providing robust language comprehension and generation capabilities.
1746+
overrides:
1747+
parameters:
1748+
model: Qwen2.5-7B-HomerCreative-Mix.Q4_K_M.gguf
1749+
files:
1750+
- filename: Qwen2.5-7B-HomerCreative-Mix.Q4_K_M.gguf
1751+
sha256: fc3fdb41e068646592f89a8ae62d7b330f2bd4e97bf615aef2977930977c8ba5
1752+
uri: huggingface://QuantFactory/Qwen2.5-7B-HomerCreative-Mix-GGUF/Qwen2.5-7B-HomerCreative-Mix.Q4_K_M.gguf
17281753
- &archfunct
17291754
license: apache-2.0
17301755
tags:
@@ -3340,6 +3365,20 @@
33403365
- filename: Skywork-o1-Open-Llama-3.1-8B.Q4_K_M.gguf
33413366
sha256: ef6a203ba585aab14f5d2ec463917a45b3ac571abd89c39e9a96a5e395ea8eea
33423367
uri: huggingface://QuantFactory/Skywork-o1-Open-Llama-3.1-8B-GGUF/Skywork-o1-Open-Llama-3.1-8B.Q4_K_M.gguf
3368+
- !!merge <<: *llama31
3369+
name: "sparse-llama-3.1-8b-2of4"
3370+
urls:
3371+
- https://huggingface.co/QuantFactory/Sparse-Llama-3.1-8B-2of4-GGUF
3372+
- https://huggingface.co/QuantFactory/Sparse-Llama-3.1-8B-2of4-GGUF
3373+
description: |
3374+
This is the 2:4 sparse version of Llama-3.1-8B. On the OpenLLM benchmark (version 1), it achieves an average score of 62.16, compared to 63.19 for the dense model—demonstrating a 98.37% accuracy recovery. On the Mosaic Eval Gauntlet benchmark (version v0.3), it achieves an average score of 53.85, versus 55.34 for the dense model—representing a 97.3% accuracy recovery.
3375+
overrides:
3376+
parameters:
3377+
model: Sparse-Llama-3.1-8B-2of4.Q4_K_M.gguf
3378+
files:
3379+
- filename: Sparse-Llama-3.1-8B-2of4.Q4_K_M.gguf
3380+
sha256: c481e7089ffaedd5ae8c74dccc7fb45f6509640b661fa086ae979f6fefc3fdba
3381+
uri: huggingface://QuantFactory/Sparse-Llama-3.1-8B-2of4-GGUF/Sparse-Llama-3.1-8B-2of4.Q4_K_M.gguf
33433382
- &deepseek
33443383
## Deepseek
33453384
url: "github:mudler/LocalAI/gallery/deepseek.yaml@master"
@@ -4905,6 +4944,22 @@
49054944
- filename: Volare.i1-Q4_K_M.gguf
49064945
sha256: fa8fb9d4cb19fcb44be8d53561c9e2840f45aed738de545983ebb158ebba461b
49074946
uri: huggingface://mradermacher/Volare-i1-GGUF/Volare.i1-Q4_K_M.gguf
4947+
- !!merge <<: *gemma
4948+
name: "bggpt-gemma-2-2.6b-it-v1.0"
4949+
icon: https://cdn-uploads.huggingface.co/production/uploads/637e1f8cf7e01589cc17bf7e/p6d0YFHjWCQ3S12jWqO1m.png
4950+
urls:
4951+
- https://huggingface.co/QuantFactory/BgGPT-Gemma-2-2.6B-IT-v1.0-GGUF
4952+
- https://huggingface.co/QuantFactory/BgGPT-Gemma-2-2.6B-IT-v1.0-GGUF
4953+
description: |
4954+
INSAIT introduces BgGPT-Gemma-2-2.6B-IT-v1.0, a state-of-the-art Bulgarian language model based on google/gemma-2-2b and google/gemma-2-2b-it. BgGPT-Gemma-2-2.6B-IT-v1.0 is free to use and distributed under the Gemma Terms of Use. This model was created by INSAIT, part of Sofia University St. Kliment Ohridski, in Sofia, Bulgaria.
4955+
The model was built on top of Google’s Gemma 2 2B open models. It was continuously pre-trained on around 100 billion tokens (85 billion in Bulgarian) using the Branch-and-Merge strategy INSAIT presented at EMNLP’24, allowing the model to gain outstanding Bulgarian cultural and linguistic capabilities while retaining its English performance. During the pre-training stage, we use various datasets, including Bulgarian web crawl data, freely available datasets such as Wikipedia, a range of specialized Bulgarian datasets sourced by the INSAIT Institute, and machine translations of popular English datasets. The model was then instruction-fine-tuned on a newly constructed Bulgarian instruction dataset created using real-world conversations. For more information check our blogpost.
4956+
overrides:
4957+
parameters:
4958+
model: BgGPT-Gemma-2-2.6B-IT-v1.0.Q4_K_M.gguf
4959+
files:
4960+
- filename: BgGPT-Gemma-2-2.6B-IT-v1.0.Q4_K_M.gguf
4961+
sha256: 1e92fe80ccad80e97076ee26b002c2280f075dfe2507d534b46a4391a077f319
4962+
uri: huggingface://QuantFactory/BgGPT-Gemma-2-2.6B-IT-v1.0-GGUF/BgGPT-Gemma-2-2.6B-IT-v1.0.Q4_K_M.gguf
49084963
- &llama3
49094964
url: "github:mudler/LocalAI/gallery/llama3-instruct.yaml@master"
49104965
icon: https://cdn-uploads.huggingface.co/production/uploads/642cc1c253e76b4c2286c58e/aJJxKus1wP5N-euvHEUq7.png

0 commit comments

Comments
 (0)