You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Docker must be installed and running on your system.
202
+
* Create a folder to store big models & intermediate files (in ex. im using /llama/models)
203
+
204
+
#### Images
205
+
We have two Docker images available for this project:
206
+
207
+
1.`ghcr.io/ggerganov/llama.cpp:full`: This image includes both the main executable file and the tools to convert LLaMA models into ggml and convert into 4-bit quantization.
208
+
2.`ghcr.io/ggerganov/llama.cpp:light`: This image only includes the main executable file.
209
+
210
+
#### Usage
211
+
212
+
The easiest way to download the models, convert them to ggml and optimize them is with the --all-in-one command which includes the full docker image.
213
+
214
+
```bash
215
+
docker run -v /llama/models:/models ghcr.io/ggerganov/llama.cpp:full --all-in-one "/models/" 7B
216
+
```
217
+
218
+
On complete, you are ready to play!
219
+
220
+
```bash
221
+
docker run -v /llama/models:/models ghcr.io/ggerganov/llama.cpp:full --run -m /models/7B/ggml-model-q4_0.bin -p "Building a website can be done in 10 simple steps:" -t 8 -n 512
222
+
```
223
+
224
+
or with light image:
225
+
226
+
```bash
227
+
docker run -v /llama/models:/models ghcr.io/ggerganov/llama.cpp:light -m /models/7B/ggml-model-q4_0.bin -p "Building a website can be done in 10 simple steps:" -t 8 -n 512
0 commit comments