-
Notifications
You must be signed in to change notification settings - Fork 897
RFC: ompi_mpi_params.c: set mpi_add_procs_cutoff default to 0 #1340
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RFC: ompi_mpi_params.c: set mpi_add_procs_cutoff default to 0 #1340
Conversation
Test FAILed. |
1 similar comment
Test FAILed. |
I agree. Setting to 1024 by default is a recipe for not getting tested often. Von meinem iPhone gesendet
|
Test FAILed. |
(void) mca_base_var_register ("ompi", "mpi", NULL, "add_procs_cutoff", | ||
"Maximum world size for pre-allocating resources for all " | ||
"remote processes. Increasing this limit may improve " | ||
"communication performance at the cost of memory usage " | ||
"(default: 1024)", MCA_BASE_VAR_TYPE_UNSIGNED_INT, NULL, | ||
"communication performance at the cost of memory usage" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@jsquyres you missed a ,
at the end of this line
@jsquyres Only quibble I have is stating that it upping the value might impact communication performance. I don't think that is true - it might impact the latency of the first message, but it will have no impact on performance. |
ac450cf
to
cd5bad8
Compare
@hjelmn Per our IM discussion yesterday, can you share your performance testing results of setting |
btw, what matters more to get the best performance
|
Launch performance is more impacted when used in conjunction with the direct modex param. Memory footprint is the primary target for this param when used by itself. So if we want to show a major improvement in launch performance, we'd need to turn both params "on". Not sure if the direct modex one has actually been tested enough for that step. Maybe we should turn it "on" by default in master for awhile? |
can you remind me what direct modex is ?
|
Sorry - yes, it is the async_modex parameter (which results in the PMIx On Thu, Feb 4, 2016 at 3:59 AM, Gilles Gouaillardet <
|
@@ -263,12 +263,11 @@ int ompi_mpi_register_params(void) | |||
ompi_rte_abort(1, NULL); | |||
} | |||
|
|||
ompi_add_procs_cutoff = 1024; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@hjelmn I'm not sure why you say that this extra assignment is necessary (in jsquyres@cd5bad8#commitcomment-15887486). Even if we come through registration a 2nd time, we don't want to reset the default if the user already set a different value. What am I missing here?
Regardless, we should either be doing this extra assignment for all MCA params, or no MCA params -- if there's a reason to do the extra assignment for this MCA param, that same reason should apply to all MCA params, right?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we should set the variable back to the default. That is the way I have done it pretty much everywhere throughout the code base. See the other registrations in this same function.
If we come through registration twice it means we finalized the project or component. I don't think we should use a possibly stale value.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I should note this is the default behavior of dynamically loaded components. If the component is unloaded all its variables will automatically be reset on the next dlopen. I was trying to be consistent.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@hjelmn and I chatted on the phone -- he filled me in on what I was missing: there's a possible inconsistency in behavior with a program that does something like this:
MPI_T_Init(...);
MPI_T_Cvar_read(..., &value_pre);
MPI_T_Cvar_write(...);
MPI_T_Finalize(...);
MPI_Init(...)
MPI_T_Init(...);
MPI_T_Cvar_read(..., &value_post);
assert(value_pre == value_post);
When the cvar is part of a dynamically-loaded component, the assert
will be true. If we remove the additional assignment, when the cvar is part of a statically-loaded component, the assert
will be false. Nathan's second assignment ensures that the assert
will be true in both cases.
This is actually a larger question for the MPI Tools Working Group in the MPI Forum: what consistency guarantees -- if any -- are provided by the MPI_T API when MPI_T and/or MPI is finalized?
For this PR, I'll remove my deletion of the 2nd assignment, and we'll leave OMPI's behavior in this area as it was. Nathan and I will bring up this what-does-MPI_T-guarantee issue with the Tools WG separately.
cd5bad8
to
ae254c1
Compare
Decrease the default value of the "mpi_add_procs_cutoff" MCA param from 1024 to 0.
ae254c1
to
902b477
Compare
Per Feb 9 2016 webex, we decided to set the value to 0 on the rationale that dynamic add procs stuff is not used for processes that are on the same server -- so therefore 0 is not a whole lot different than 32. So this PR is now updated to set the default add procs cutoff to 0. |
RFC: ompi_mpi_params.c: set mpi_add_procs_cutoff default to 0
Correct the binding algorithm to decouple it from oversubscribe.
Decrease the default value of the "mpi_add_procs_cutoff" MCA param from 1024 to 32.
@hppritcha Setting the milestone to v2.0.0 because the "partial add procs" behavior was supposed to be a key feature of v2.0.0. Setting that to only happen at 1024 procs seems a little high. Do you agree?
RFC for the rest of the community: does anyone have a concern about setting this value to 32?