Skip to content

Commit ee249fb

Browse files
authored
Merge pull request #28 from liu-cong/manifest
Update vllm deployment example to use 1 GPU as tensor parallelism is 1
2 parents d122a6c + 948dbdc commit ee249fb

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

examples/poc/manifests/vllm/vllm-lora-deployment.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -78,9 +78,9 @@ spec:
7878
timeoutSeconds: 1
7979
resources:
8080
limits:
81-
nvidia.com/gpu: 2
81+
nvidia.com/gpu: 1
8282
requests:
83-
nvidia.com/gpu: 2
83+
nvidia.com/gpu: 1
8484
volumeMounts:
8585
- mountPath: /data
8686
name: data

0 commit comments

Comments
 (0)