-
Notifications
You must be signed in to change notification settings - Fork 12k
Bug: running failure on Adreno devices using Vulkan for large batch size #8743
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@MeeCreeps Sir,can you please share more detailed steps for building with Vulkan&GPU backend?I can't build if I enabled Vulkan for Android,typically if I just build with Vulkan enabled(not for Android),it can work. Build for Android with Vulkan backend enabled,error logs:
my build configuration:
|
you can first build the target vulkan-shaders-gen without android compile flags and set it in the environment |
can you tell me what's the variable should I set in environment for |
In file "ggml/src/CMakeLists.txt", it has |
@MeeCreeps thank you sir! I have followed your instruction to set PATH which will include
it shows that the build system didn't find the header
And I noticed that
it seems that the header I also tried to rename so where is |
I found |
update:it seems that NDK‘s Vulkan headers are outdated,so I update Vulkan related headers in NDK,like these sub dir:
now it can build successfully, |
@MeeCreeps Hello sir,sorry to bother you again.when I use these libraries in my Android APK,I still got these errors,here is the log:
it seems that these shaders can't be linked,any idea for this problem? the GPU device is |
How to update Vulkan headers? upgrade NDK or other methods? I found there are lots of redefinition in 'sources/third_party/vulkan/src/include/vulkan/vulkan.hpp' . |
You can find more details in my BLOG |
This issue was closed because it has been inactive for 14 days since being marked as stale. |
What happened?
I tried to run the tinyllama-1.1b model on a OnePlus CPH2573 (with Adreno™ 750). It works fine when I set --batch-size to less than 32, but a failure (vk::DeviceLostError) occurs when I set --batch-size to 33.
In issue #5186, it was mentioned that Adreno devices have a maximum allocated memory size of 1GB, but this doesn't seem to fully explain the behavior I'm experiencing. I also tried submitting the operator one by one (not in a whole command buffer, but submitting them individually), and it succeeded. Does Vulkan on Adreno devices have other constraints (like a maximum size for command buffers) that could explain the failure I'm encountering?
Name and Version
build = 3400 (97bdd26)
main: built with Android (11349228, +pgo, +bolt, +lto, -mlgo, based on r487747e) clang version 17.0.2 (https://android.googlesource.com/toolchain/llvm-project d9f89f4d16663d5012e5c09495f3b30ece3d2362) for x86_64-linux-gnu
What operating system are you seeing the problem on?
No response
Relevant log output
The text was updated successfully, but these errors were encountered: