[Installation]: Nvidia runtime issue? On new VLLM 0.7.0

### Your current environment

```text
The output of `python collect_env.py`
```
docker run --runtime nvidia --gpus all   -v ~/.cache/huggingface:/root/.cache/huggingface   -p 8000:8000   --ipc=host   -e VLLM_ENABLE_PREFIX_CACHING=true   --name qwen2.5_20250128   vllm/vllm-openai:v0.7.0   --model Qwen/Qwen2.5-72B-Instruct   --tensor-parallel-size=4   --gpu-memory-utilization=0.90   --enforce-eager   --rope-scaling '{"type": "yarn","factor": 4,"original_max_position_embeddings": 32768}'
error:
/usr/bin/ld: cannot find -lcuda: No such file or directory

### How you are installing vllm

docker

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Installation]: Nvidia runtime issue? On new VLLM 0.7.0 #12505

Your current environment

How you are installing vllm

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Installation]: Nvidia runtime issue? On new VLLM 0.7.0 #12505

Description

Your current environment

How you are installing vllm

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions