-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Description
Hi, this is a very vague feature request :D
There are some parts of the code (in the main branch, I haven't looked at the upcoming branches that were mentioned in #395 ) that depend on some private/internal functionality of Transformers types. Mostly around the cache and logits history.
The problem with this is, it's difficult to create wrappers for other model providers, like GPTQ, AQW, llamacpp etc etc because you have to replicate some pretty specific private code from the Transformer types that Guidance expects to see.
So my request is this - is it possible to define a smaller interface/surface, not specific to Transformers (or at least not relying on anything internal to Transformers), that can be used to host models from other sources?
No idea how difficult this is or if it's realistic, just something that I would personally love to see.