Add Ascend NPU as a new backend #6034

hipudding · 2024-03-13T07:19:36Z

Prerequisites

Please answer the following questions for yourself before submitting an issue.

I am running the latest code. Development is very rapid so there are no tagged versions as of now.
I carefully followed the README.md.
I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
I reviewed the Discussions, and have a new bug or useful enhancement to share.

Feature Description

Add Ascend NPU as a new backend.

Motivation

Ascend is a full-stack AI computing infrastructure for industry applications and services based on Huawei Ascend processors and software. For more information about Ascend, see Ascend Community.

CANN (Compute Architecture of Neural Networks), developped by Huawei, is a heterogeneous computing architecture for AI.

Pytorch has officially announced support for Ascend NPU (through key PrivateUse1), please see the PrivateUse1 tutorial here.

Provide new backend support for llama.cpp, allowing users who are using Ascend NPU to inference model with llama.cpp.

Possible Implementation

Currently, the community has provided a convenient backend access mechanism. Ascend NPU is a CUDA-LIKE device, and I plan to reference CUDA's implementation to complete the Ascend NPU backend.

Due to the large workload, I plan to complete this feature in multiple stages. First, I will focus on compiling, backend registration, and device runtime functionalities. Additionally, I will add a new test file to validate backend registration, memory allocation, tensor operations, and other functionalities.

Next, I will proceed to implement tensor operators and validate them.

Afterward, do performance implementation, including split tensor support.

See also: very first commit #6035.

hipudding · 2024-03-13T07:27:10Z

@ggerganov Thanks for this awesome project, I'm looking forward to your suggestions.

ggerganov · 2024-03-13T18:06:35Z

Interesting work! Looks like a good start. Follow @slaren's advice for any insights with the implementation

hipudding · 2024-03-14T01:07:42Z

Interesting work! Looks like a good start. Follow @slaren's advice for any insights with the implementation

Thanks. I will.

github-actions · 2024-04-28T01:06:45Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

hipudding added the enhancement New feature or request label Mar 13, 2024

hipudding mentioned this issue Mar 13, 2024

[CANN] Add Ascend NPU backend #6035

Merged

github-actions bot added the stale label Apr 14, 2024

github-actions bot closed this as completed Apr 28, 2024

MengqingCao mentioned this issue Aug 6, 2024

cann: add Ascend NPU support ggml-org/whisper.cpp#2336

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Ascend NPU as a new backend #6034

Add Ascend NPU as a new backend #6034

hipudding commented Mar 13, 2024 •

edited

Loading

hipudding commented Mar 13, 2024 •

edited

Loading

Uh oh!

ggerganov commented Mar 13, 2024

Uh oh!

hipudding commented Mar 14, 2024

Uh oh!

github-actions bot commented Apr 28, 2024

Uh oh!

Add Ascend NPU as a new backend #6034

Add Ascend NPU as a new backend #6034

Comments

hipudding commented Mar 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Prerequisites

Feature Description

Motivation

Possible Implementation

hipudding commented Mar 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov commented Mar 13, 2024

Uh oh!

hipudding commented Mar 14, 2024

Uh oh!

github-actions bot commented Apr 28, 2024

Uh oh!

hipudding commented Mar 13, 2024 •

edited

Loading

hipudding commented Mar 13, 2024 •

edited

Loading