[User] Building with CUDA 11.0

# Prerequisites

- [X] I am running the latest code.
- [X] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.md).
- [X] I [searched using keywords relevant to my issue](https://docs.github.com/en/issues/tracking-your-work-with-issues/filtering-and-searching-issues-and-pull-requests) to make sure that I am creating a new issue that is not already open (or closed).
- [X] I reviewed the [Discussions](https://github.com/ggerganov/llama.cpp/discussions), and have a new bug or useful enhancement to share.

# Expected Behavior

I am trying to build llama.cpp with CUDA 11.0 using the following command:

```
$ make LLAMA_CUBLAS=ON CUDA_DOCKER_ARCH=sm_80 -j8
```

# Current Behavior

Currently there are several issues with CUDA 11.0:

1. `nvcc` fails to properly forward `-march=native -mtune=native` to backend compiler which results in `nvcc fatal : 'arch=native': expected a number`

2. In CUDA 11.0 in cudaStreamWaitEvent(...) all three arguments are required. Since CUDA 11.1 third parameter is 0 by default.

3. CUDA 11.0 does not support __builtin_assume(). According to docs it is only available since CUDA 11.1

# Environment and Context

I am building in `nvidia/cuda:11.0.3-devel-ubuntu20.04` docker with just 'build-essential' installed.
Host driver is 536.23
Host OS is WSL


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[User] Building with CUDA 11.0 #3131

Prerequisites

Expected Behavior

Current Behavior

Environment and Context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[User] Building with CUDA 11.0 #3131

Description

Prerequisites

Expected Behavior

Current Behavior

Environment and Context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions