-
-
Notifications
You must be signed in to change notification settings - Fork 10.2k
Closed
Labels
releaseRelated to new version releaseRelated to new version release
Description
ETA: Oct. 15th (Sun) Oct 16th (Mon).
Major changes
TBD
PRs to be merged before the release
- PagedAttention V2 Implement PagedAttention V2 #1348
- Support
echo
Implement prompt logprobs & Batched topk for computing logprobs #1328 Supporting log probabilities of prompt tokens in both engine and OpenAI API server (akaecho
) #959 - Fix
TORCH_CUDA_ARCH_LIST
err msg Fix error message onTORCH_CUDA_ARCH_LIST
#1239 Support YaRN YaRN support implementation #1264 YaRN tests #1161(Deferred)Add(Deferred)repetition_penalty
sampling parameter Add repetition_penalty aligned with huggingface #866
Metadata
Metadata
Assignees
Labels
releaseRelated to new version releaseRelated to new version release