-
Notifications
You must be signed in to change notification settings - Fork 901
OSC shared memory fence segfault #5262
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
That looks like a new failure mode. Will take a look tomorrow. |
@hjelmn FWIW, the file that fails to open in shmem_mmap_module.c:495 is |
@hjelmn This is still appearing on MTT. Any luck tracking down the issue? |
Not yet. I haven't been able to recreate the issue. Plan to try again this week. |
Per 2018-06-26 webex, please re-create with a debug build and run with Note: this is happening across the board on master, v3.0, v3.1. 😢 |
This commit fixes a typo where a bcast is used instead of the intended collective (barrier). References open-mpi#5262 Signed-off-by: Nathan Hjelm <[email protected]>
This commit fixes a typo where a bcast is used instead of the intended collective (barrier). References #5262 Signed-off-by: Nathan Hjelm <[email protected]>
This commit fixes a typo where a bcast is used instead of the intended collective (barrier). References open-mpi#5262 Signed-off-by: Nathan Hjelm <[email protected]> (cherry picked from commit 4c23068) Signed-off-by: Nathan Hjelm <[email protected]>
This commit fixes a typo where a bcast is used instead of the intended collective (barrier). References open-mpi#5262 Signed-off-by: Nathan Hjelm <[email protected]> (cherry picked from commit 4c23068) Signed-off-by: Nathan Hjelm <[email protected]>
Uh oh!
There was an error while loading. Please reload this page.
When running the
onesided/c_strided_getacc_indexed_shared
test, anopen
syscall is failing when initializing shared memory withMPI_Win_allocate_shared
. Execution is continuing until the uninitialized shared memory is accessed by a call toMPI_Win_fence
, at which point the process segfaults.The segfault specifically occurs at line 103 of osc_sm_active_target.c (because
module->global_state
is NULL).The failing open syscall occurs during at line 495 of shmem_mmap_module.c.
The stdout/stderr can be seen here:
https://mtt.open-mpi.org/index.php?do_redir=2633
@hjelmn I believe this is up your alley
The text was updated successfully, but these errors were encountered: