-
-
Notifications
You must be signed in to change notification settings - Fork 10.5k
Closed
Labels
feature requestNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomers
Description
🚀 The feature, motivation and pitch
vLLM's benchmark serving script is expected to be a standalone inference client that only requires minimum dependencies. Currently, it still imports vllm
conditionally.
The task is as follows:
- Clearly define a requirements txt for benchmark serving client
numpy
pandas
Pillow
tqdm
transformers
datasets
-
Add a CI test that create a new uv environment and execute the script. Ensure there is no vLLM present. This can be part of existing tests for benchmark scripts. https://github.com/vllm-project/vllm/blob/main/.buildkite/run-benchmarks.sh
-
Make sure the existing usage of vLLM is moved to inlining whatever utility method is required.
Alternatives
No response
Additional context
See #14879 for discussion, cc @houseroad @ywang96
Before submitting a new issue...
- Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.
Metadata
Metadata
Assignees
Labels
feature requestNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomers
Type
Projects
Status
Done