Round the epilogue offset to positive values in cunn_SoftMaxForward #2210

xinyazhang · 2025-05-29T17:27:05Z

This fixes OOB memory access for followng code

import torch
qk = torch.randn((9,1017), dtype=torch.float64, device='cuda')
smqk = torch.softmax(qk, dim=-1)

Correctness can be confirmed with:

import torch
import numpy as np
from scipy.special import softmax

qk = torch.randn((9,1017), dtype=torch.float64, device='cuda')
nqk = qk.cpu().numpy()
smqk = torch.softmax(qk, dim=-1)
nsmqk = smqk.cpu().numpy()
smnqk = softmax(nqk, axis=-1)

print(f'{np.allclose(smnqk, nsmqk)}')

This is ported from upstream PR pytorch#154634

pruthvistony · 2025-06-06T05:33:19Z

@xinyazhang ,
Can this PR be closed?

xinyazhang · 2025-06-06T12:31:13Z

Duplicated with #2247

round the epilogue offset to positive values

f471d8d

xinyazhang closed this Jun 6, 2025

xinyazhang deleted the xinyazhang/fixsoftmax-size_9_1017 branch June 6, 2025 12:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Round the epilogue offset to positive values in cunn_SoftMaxForward #2210

Round the epilogue offset to positive values in cunn_SoftMaxForward #2210

Uh oh!

xinyazhang commented May 29, 2025

Uh oh!

pruthvistony commented Jun 6, 2025

Uh oh!

xinyazhang commented Jun 6, 2025

Uh oh!

Uh oh!

Round the epilogue offset to positive values in cunn_SoftMaxForward #2210

Round the epilogue offset to positive values in cunn_SoftMaxForward #2210

Uh oh!

Conversation

xinyazhang commented May 29, 2025

Uh oh!

pruthvistony commented Jun 6, 2025

Uh oh!

xinyazhang commented Jun 6, 2025

Uh oh!

Uh oh!