You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
in order to work around a bug in older gcc versions on x86_64,
__atomic_thread_fence (__ATOMIC_SEQ_CST)
was replaced with
__atomic_thread_fence (__ATOMIC_ACQUIRE)
based on the asumption that this did not introduce performance regressions.
It was recently found that this did introduce some performance regression,
mainly at scale on fat nodes.
So simply use an asm memory globber to both workaround older gcc bugs
and fix the performance regression.
Thanks S. Biplab Raut for bringing this issue to our attention.
Refs. #8603
Signed-off-by: Gilles Gouaillardet <[email protected]>
0 commit comments