Custom Sampling Pipelines #348

martindevans · 2023-12-08T01:26:42Z

Introduced an entirely new way to sample models.

The current mechanism is hardcoded (see here) and can only be configured by tweaking values in InferenceParams.

This means a couple of things are not possible:

Re-ordering sampling (for example this PR suggests applying temperature twice)
Developing entirely new sampling mechanisms (e.g. MinP could not have been developed using LLamaSharp).

This PR introduces:

Low level interface for entirely customised sampling ISamplingPipeline. If the SamplingPipeline property in the inference params is non-null this pipeline will be used.
BaseSamplingPipeline is an abstract implementation if ISamplingPipeline which makes it a little easier to implement.
DefualtSamplingPipeline which is a demo implementation that does standard sampling.

You can see an example of it in use here: https://github.com/SciSharp/LLamaSharp/pull/348/files#diff-16c496b3d63b9606d57c5d2a5592059223a9e4bc7ab29416ea775d5356888df7R37

…her options with an entirely custom pipeline. - Added a `Sample` method to `LLamaContext` which uses a custom pipeline - Modified all executors to use the custom pipeline if it exists

- Added an example usage to one of the tests

…better written in code. - Added BaseSamplingPipeline which provides a base impl of `ISamplingPipeline` - Added `DefaultSamplingPipeline` which mimics normal llama.cpp sampling

AsakusaRinne

Great works! Greedy sampling is also a common sampling method, which is good to be provided in LLamaSharp. It's not required in this PR, just a mark. :)

martindevans · 2023-12-09T14:37:49Z

That reminds me of something related that I thought about and I might come back to in a future PR. At the moment the LLamaContext basically has several different modes, depending on what parameters you pass:

Pure Greedy (Temp <= 0)
MiroStat
MiroStat2
TopK + TailFree + LocallyTypical + TopP + MinP + Temperature + Sampling

So using this new pipeline system we could explicitly split it into 4 separate pipelines, construct them inside LLamaContext and use them as necessary. Would split some of the code out of LLamaContext into a more re-usable form.

martindevans added 4 commits December 4, 2023 01:31

Initial pass at a new sampling pipeline

3335812

- Added SamplingPipeline to inference params which overrides all ot…

b34f72a

…her options with an entirely custom pipeline. - Added a `Sample` method to `LLamaContext` which uses a custom pipeline - Modified all executors to use the custom pipeline if it exists

- Added "protected" logits, instead of the awkward save/load mechanism

3afc007

- Added an example usage to one of the tests

- Removed the object wrappers and configurable pipeline, they can be …

8359583

…better written in code. - Added BaseSamplingPipeline which provides a base impl of `ISamplingPipeline` - Added `DefaultSamplingPipeline` which mimics normal llama.cpp sampling

AsakusaRinne approved these changes Dec 9, 2023

View reviewed changes

martindevans merged commit d87d654 into SciSharp:master Dec 11, 2023

martindevans deleted the new_object_based_sampling_pipeline branch December 11, 2023 21:40

AsakusaRinne mentioned this pull request Dec 15, 2023

feat: support custom generation control of executors. #364

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Custom Sampling Pipelines #348

Custom Sampling Pipelines #348

Uh oh!

martindevans commented Dec 8, 2023 •

edited

Loading

Uh oh!

AsakusaRinne left a comment

Uh oh!

martindevans commented Dec 9, 2023

Uh oh!

Uh oh!

Custom Sampling Pipelines #348

Custom Sampling Pipelines #348

Uh oh!

Conversation

martindevans commented Dec 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AsakusaRinne left a comment

Choose a reason for hiding this comment

Uh oh!

martindevans commented Dec 9, 2023

Uh oh!

Uh oh!

martindevans commented Dec 8, 2023 •

edited

Loading