-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Open
Labels
documentationImprovements or additions to documentationImprovements or additions to documentationenhancementNew feature or requestNew feature or request
Description
- Migrate project to scikit-build-core #489
- Simplify dev / local setup using pyproject.toml and makefile #490
- Use numpy arrays for LogitsProcessors and StopCriteria to avoid copies / allocations #491
- Llama2 #488
- Configurable chat templates #492
- Add support for OpenAI-style functions #494
- Expose
scores
andinput_ids
inLlama
model #493 - Add batched inference #771
- Speculative sampling #675
Huge, vvsotnikov, rangehow, teleprint-me, romansky and 5 morepabl-o-ce and egeres
Metadata
Metadata
Assignees
Labels
documentationImprovements or additions to documentationImprovements or additions to documentationenhancementNew feature or requestNew feature or request