Skip to content

Conversation

@pxl-th
Copy link
Member

@pxl-th pxl-th commented Oct 12, 2024

  • Remove @Const that are not set on arrays (e.g. operators, functions, integers).
  • Remove @inbounds from _mapreduce_block! and _reduce_block! since they have kernel-level inbounds=true specified.

Also unrelated to this, but are you planning on registering this package?
I wanted to use your sorting kernels for AMDGPU instead of ours, since they are ~4x faster: JuliaGPU/AMDGPU.jl#688

@anicusan
Copy link
Member

Thank you for going through the codebase to remove the redundant macros - really appreciate it!

Yes, I wanted a bit of time to discuss the interface with the community, add AK to the BuildKite CI, and include some proper Documenter.jl docs.

I'm happy with the ideas from the Discourse and more confident in the library now that I know you tested it, looked around the codebase and felt it's useful - thanks! I'll register the package as soon as I can.

@anicusan anicusan merged commit ea03d16 into JuliaGPU:main Oct 13, 2024
@pxl-th pxl-th deleted the pxl-th/cleanup branch October 13, 2024 02:12
@anicusan
Copy link
Member

I started the registration :)

@pxl-th
Copy link
Member Author

pxl-th commented Oct 13, 2024

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants