Vulkan generated targets and shader organization #5356

bandoti · 2024-02-06T02:30:51Z

The generated header ggml-vulkan-shaders.hpp is 3 MB of generated binary from the packed Vulkan shaders. These should ideally be generated as a make or CMake target at build time instead of being placed under source control.

In addition, I would like to propose splitting the actual shaders out from ggml_vk_generate_shaders.py. It will likely be easier to reason about the shaders (even though there will be many of them) if they are placed within a separate folder (perhaps $LLAMA_ROOT/vulkan) and then collected/assembled by the python script, instead of having them inline.

That way it will be clearer which part of Vulkan is affected by a given change—and also commit conflicts will be lessened if multiple people are working on separate shaders. In addition, if new shaders need to be added they can simply be dropped in the folder, and the python script may glob them before packing into ggml-vulkan-shaders.hpp.

The hard work to get Vulkan going is greatly appreciated—I look forward to exploring this back-end further. Thank you!

The text was updated successfully, but these errors were encountered:

0cc4m · 2024-02-06T16:51:31Z

The idea behind bundling it like this was to allow templating and to keep the number of files in the repo low, which I know @ggerganov prefers. But I see the disadvantages. I'm open to a discussion about what we could replace it with that has as few disadvantages as possible.

There's a few points to consider:

GLSL requires a file for each shader
GLSL has no direct way for reusing shader code between them (but the Kompute backend has a workaround using the preprocessor)
Some custom code is needed to embed the SPIR-V shaders in a header to avoid having to load them on runtime (which Kompute has done in CMake, I think

Also pinging @cebtenzzre who has raised this point before. Let me know what you think.

ggerganov · 2024-02-06T17:32:37Z

Moving the shaders to a vulkan-shaders folder similar to kompute-shaders would be fine if that can help to improve the process

bandoti · 2024-02-07T19:49:28Z

So here are some basic ideas on this after some consideration.

We could keep ggml-vulkan-shaders.hpp as a facade header which includes one header for each shader. This file itself should be autogenerated and populated with the list of shaders.

In CMake a custom target (using add_custom_command) will be added to the if (LLAMA_VULKAN) ... block which wraps a script invocation—invoking a spirc.py—which does preprocessing and delegates to glslc. Anything with the appropriate shader extensions will subsequently be tracked by this—and will automatically compile these in a *.spirv.hpp file then placed within the build directory, where they can be picked up by ggml-vulkan-shaders.hpp. Similarly, a custom target can be added for ggml-vulkan-shaders.hpp which does something like the following example.

On the make side, logic needs to be added to locate the glslc command. However, the workflow is a bit clearer and essentially something like:

VULKAN_DIR              = ...
VULKAN_COMPILE_CMD      = "$(srcdir)/spirc.py --glslc $(VULKAN_DIR)/glslc"
VULKAN_SHADERS          = $(srcdir)/vulkan-shaders/*.frag ...
VULKAN_SHADERS_COMPILED = $(for s in $(VULKAN_SHADERS); \
        do compiled_name="$(echo "$s" | awk -F/ '{print $NF}' | sed 's/\.frag/\.spirv\.hpp/'))"; \
             echo "$(builddir)/$compiled_name"; done)

$(srcdir)/ggml-vulkan-shaders.hpp: $(VULKAN_SHADERS_COMPILED)
	for c in $(VULKAN_SHADERS_COMPILED); \
		do echo "\#include \"$c\"" >> $@; done

%.spirv.hpp: %.frag
	$(VULKAN_COMPILE_CMD) --input $< --output $@

In this example a separate header extension is used to distinguish these from regular header files which allows the custom rule to be invoked. Either the user is required to specify VULKAN_DIR or some automated mechanism can be added.

I haven't tested any of this code but the flow should at least capture the concept.

cebtenzzre · 2024-02-07T19:58:52Z

We could keep ggml-vulkan-shaders.hpp as a facade header which includes one header for each shader. This file itself should be autogenerated and populated with the list of shaders.

For the sake of build times, I would prefer if we built the shaders as a separate translation unit from ggml-vulkan.cpp - at the very least use a precompiled header; at best, have each shader in its own translation unit. The machine I have my AMD GPU in has an aging 16-thread Xeon, and often when developing the Kompute backend I have watched 15 threads sit idle as it recompiles all of the shader headers serially (as part of ggml-kompute.cpp) every time I touch a single line of C++ code.

pure-water · 2024-02-09T04:19:09Z

So it appears that I can only have ggm-vulkan-shaders.hpp as an input for now anyway?

bandoti · 2024-02-23T14:31:00Z

@0cc4m Let me know if you'd like support on the build scripts. I'm not so much familiar with the Vulkan particulars (e.g. for splitting out the embedded shaders into a compile script) but I'd be happy to help integrate things with Make/CMake. If there's a particular branch let me know and I'll take a look.

0cc4m · 2024-02-25T18:48:28Z

Eventually, but I'm busy trying to get MoE models to work and I don't really have the capacity to rework the shaders at the same time.

pure-water · 2024-02-26T08:22:51Z

is there the stand-alone vulkan shader sub directory now?

bandoti · 2024-02-27T14:08:45Z

is there the stand-alone vulkan shader sub directory now?

Not yet. :)

pure-water · 2024-02-28T00:38:49Z

Sorry, just made myself right. I would be able to see the shader code as the kcompute-shader one, right?

bandoti · 2024-03-04T15:45:46Z

Sorry, just made myself right. I would be able to see the shader code as the kcompute-shader one, right?

That is the goal being discussed here. However, if you are interested in seeing the shaders right now they are packed within ggml_vk_generate_shaders.py. This script is currently used to create the C++ header file ggml-vulkan-shaders.hpp.

pure-water · 2024-03-12T00:21:58Z

OK, thanks. I will update this post after i am done trying to update some shaders to improve some performance on particular network

github-actions · 2024-04-25T01:12:49Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

0cc4m · 2024-06-13T05:06:05Z

@bandoti Sorry this took so long, I'm working on extracting the shader code from the Python file. I'll make a PR that adds those files and adapts the Python file to read them. You could work on replacing the Python file with CMake/Make afterwards.

bandoti · 2024-06-24T13:38:32Z

I am working on integrating with the build system now. Will post an update once that's finished.

0cc4m · 2024-07-30T08:13:46Z

@bandoti I think we can close this issue, right?

bandoti added the bug-unconfirmed label Feb 6, 2024

0cc4m added enhancement New feature or request Vulkan Issues specific to the Vulkan backend and removed bug-unconfirmed labels Feb 6, 2024

ggerganov mentioned this issue Feb 12, 2024

Add Vulkan to cmake build ggml-org/ggml#730

Merged

github-actions bot added the stale label Apr 11, 2024

github-actions bot closed this as completed Apr 25, 2024

0cc4m reopened this Jun 5, 2024

github-actions bot removed the stale label Jun 6, 2024

0cc4m mentioned this issue Jun 15, 2024

Vulkan Shader Refactor, Memory Debugging Option #7947

Merged

4 tasks

sohzm mentioned this issue Jun 18, 2024

Add vulkan backend leejet/stable-diffusion.cpp#291

Merged

bandoti mentioned this issue Jun 25, 2024

vulkan : cmake integration #8119

Merged

4 tasks

github-actions bot added the stale label Jul 25, 2024

bandoti closed this as completed Jul 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Vulkan generated targets and shader organization #5356

Vulkan generated targets and shader organization #5356

bandoti commented Feb 6, 2024 •

edited

Loading

0cc4m commented Feb 6, 2024

Uh oh!

ggerganov commented Feb 6, 2024

Uh oh!

bandoti commented Feb 7, 2024 •

edited

Loading

Uh oh!

cebtenzzre commented Feb 7, 2024

Uh oh!

pure-water commented Feb 9, 2024

Uh oh!

bandoti commented Feb 23, 2024 •

edited

Loading

Uh oh!

0cc4m commented Feb 25, 2024

Uh oh!

pure-water commented Feb 26, 2024

Uh oh!

bandoti commented Feb 27, 2024

Uh oh!

pure-water commented Feb 28, 2024

Uh oh!

bandoti commented Mar 4, 2024

Uh oh!

pure-water commented Mar 12, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Apr 25, 2024

Uh oh!

0cc4m commented Jun 13, 2024

Uh oh!

bandoti commented Jun 24, 2024

Uh oh!

0cc4m commented Jul 30, 2024

Uh oh!

Vulkan generated targets and shader organization #5356

Vulkan generated targets and shader organization #5356

Comments

bandoti commented Feb 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

0cc4m commented Feb 6, 2024

Uh oh!

ggerganov commented Feb 6, 2024

Uh oh!

bandoti commented Feb 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cebtenzzre commented Feb 7, 2024

Uh oh!

pure-water commented Feb 9, 2024

Uh oh!

bandoti commented Feb 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

0cc4m commented Feb 25, 2024

Uh oh!

pure-water commented Feb 26, 2024

Uh oh!

bandoti commented Feb 27, 2024

Uh oh!

pure-water commented Feb 28, 2024

Uh oh!

bandoti commented Mar 4, 2024

Uh oh!

pure-water commented Mar 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Apr 25, 2024

Uh oh!

0cc4m commented Jun 13, 2024

Uh oh!

bandoti commented Jun 24, 2024

Uh oh!

0cc4m commented Jul 30, 2024

Uh oh!

bandoti commented Feb 6, 2024 •

edited

Loading

bandoti commented Feb 7, 2024 •

edited

Loading

bandoti commented Feb 23, 2024 •

edited

Loading

pure-water commented Mar 12, 2024 •

edited

Loading