Commit 79d177d
Fix workspace allocation for f8f8bf16_rowwise_batched (#5098)
Summary:
Pull Request resolved: #5098
X-link: https://github.com/facebookresearch/FBGEMM/pull/2105
X-link: https://github.com/meta-pytorch/MSLK/pull/6
This diff updates the workspace allocation for f8f8bf16_rowwise_batched to make sure its on the proper device. Previously, it could default to using device 0 despite other inputs being on a different gpu.
Reviewed By: q10
Differential Revision: D86439655
fbshipit-source-id: c5652c4791b5075103876c8ae76bd65213d6a9cb1 parent 6d51557 commit 79d177d
File tree
1 file changed
+3
-2
lines changed- fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched
1 file changed
+3
-2
lines changedLines changed: 3 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
274 | 274 | | |
275 | 275 | | |
276 | 276 | | |
277 | | - | |
| 277 | + | |
| 278 | + | |
278 | 279 | | |
279 | 280 | | |
280 | 281 | | |
| |||
283 | 284 | | |
284 | 285 | | |
285 | 286 | | |
286 | | - | |
| 287 | + | |
287 | 288 | | |
288 | 289 | | |
289 | 290 | | |
| |||
0 commit comments