Commit e8e05bb

authored

Finish implementing min_p sampling

Closes abetlen#911 Implement min_p sampling as described in ggml-org/llama.cpp#3841 Most of the actual work was already done, I just added the parameters to Llama.sample, Llama.generate, Llama.create_completion, Llama.create_completion, and Llama.create_chat_completion. Tested and working as expected, as far as I can tell.

1 parent 96a3776 commit e8e05bbCopy full SHA for e8e05bb

1 file changed

+2182

-0

lines changed

llama.py

1 file changed

+2182

-0

lines changed

Comments

(0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit e8e05bb

1 file changed

1 file changed

File tree

1 file changed

1 file changed

0 commit comments