You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Closesabetlen#911
Implement min_p sampling as described in ggml-org/llama.cpp#3841
Most of the actual work was already done, I just added the parameters to Llama.sample, Llama.generate, Llama.create_completion, Llama.create_completion, and Llama.create_chat_completion. Tested and working as expected, as far as I can tell.
0 commit comments