-
Notifications
You must be signed in to change notification settings - Fork 29
Fix reduction contig impl offset handling #1458
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix reduction contig impl offset handling #1458
Conversation
View rendered docs @ https://intelpython.github.io/dpctl/pulls/1458/index.html |
Array API standard conformance tests for dpctl=0.15.1dev0=py310ha25a700_68 ran successfully. |
Array API standard conformance tests for dpctl=0.15.1dev0=py310ha25a700_69 ran successfully. |
The optimization should not use max-work-group-size, to allow RT some of the SLM memory.
Array API standard conformance tests for dpctl=0.15.1dev0=py310ha25a700_70 ran successfully. |
This PR modifies computation of typed pointer to take offsets into account in reduction implementations for contiguous inputs.
It also adds tests which would fail before this change