I convert the CUDA simpleP2P example to HIP. However, the HIP program does not pass the verification. Could you reproduce the issue ? Thanks.
The HIP example is here: https://github.com/zjin-lcf/HeCBench/blob/master/p2p-hip/main.cu
Environment: ROCM 4.5.2 and MI100