Skip to content

V5.0.x OSC/UCX: Adding the following optimizations (nonblocking accumulate and reusing resources) #11025

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged

Conversation

MamziB
Copy link
Contributor

@MamziB MamziB commented Nov 2, 2022

v5.0.x OSC/UCX: Adding the following optimizations: 1) Reuse the same worker/eps in
single threaded applications, this is helpful if an application
creates many windows, therefore, we avoid the unnecessary overheads  and 2) adding the truly nonblocking
MPI_Accumulate/Get_Accumulate.

Signed-off-by: Mamzi Bayatpour [email protected]
Co-authored-by: Tomislav Janjusic [email protected]
Co-authored-by: Joseph Schuchart [email protected]>
(cherry-picked from commit 1ea6fb9)

…eps in

single threaded applications, this is helpful if an application
creates many windows, therefore, we avoid the unnecessary overheads  and 2) adding the truely nonblocking
MPI_Accumulate/Get_Accumulate.

Signed-off-by: Mamzi Bayatpour  <[email protected]>
Co-authored-by: Tomislav Janjusic <[email protected]>
Co-authored-by: Joseph Schuchart <[email protected]>>
(cherry picked from commit 1ea6fb9)
@awlauria awlauria merged commit a33ec38 into open-mpi:v5.0.x Nov 2, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants