Skip to content

Commit e8e05bb

Browse files
authored
Finish implementing min_p sampling
Closes abetlen#911 Implement min_p sampling as described in ggml-org/llama.cpp#3841 Most of the actual work was already done, I just added the parameters to Llama.sample, Llama.generate, Llama.create_completion, Llama.create_completion, and Llama.create_chat_completion. Tested and working as expected, as far as I can tell.
1 parent 96a3776 commit e8e05bb

File tree

1 file changed

+2182
-0
lines changed

1 file changed

+2182
-0
lines changed

0 commit comments

Comments
 (0)