Skip to content

Pull requests: pytorch/helion

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix no libdw.so issue on AMD CI CLA Signed This label is managed by the Meta Open Source bot.
#1107 opened Nov 8, 2025 by yf225 Loading…
Add distributed CI job (4xH100) and example unit tests CLA Signed This label is managed by the Meta Open Source bot.
#1106 opened Nov 8, 2025 by yf225 Loading…
Use CPU machine for triton-cpu CLA Signed This label is managed by the Meta Open Source bot.
#1105 opened Nov 8, 2025 by oulgen Loading…
Fixes in helion puzzles CLA Signed This label is managed by the Meta Open Source bot.
#1104 opened Nov 8, 2025 by Athe-kunal Loading…
Add DE-Surrogate hybrid autotuner algorithm + early stopping option for DE and DE-Surrogate CLA Signed This label is managed by the Meta Open Source bot.
#1096 opened Nov 7, 2025 by FranciscoThiesen Loading…
[Autotuner] Use cudagraph for time measurement on Nvidia hardware CLA Signed This label is managed by the Meta Open Source bot.
#1089 opened Nov 6, 2025 by yf225 Loading…
Add OpenEvolve-based Autotuner for Helion GPU Kernels CLA Signed This label is managed by the Meta Open Source bot.
#1082 opened Nov 4, 2025 by mycpuorg Draft
Added dynamic-shape 0/1 bucketing: "zero_nonzero" env var CLA Signed This label is managed by the Meta Open Source bot.
#1053 opened Oct 30, 2025 by Itssshikhar Loading…
Use matmul fwd direclty in autograd for performance CLA Signed This label is managed by the Meta Open Source bot.
#1045 opened Oct 28, 2025 by tianrengao Loading…
Disallow both interpret modes active CLA Signed This label is managed by the Meta Open Source bot.
#1023 opened Oct 25, 2025 by yf225 Draft
blackwell attn with acc and stock triton support CLA Signed This label is managed by the Meta Open Source bot.
#996 opened Oct 20, 2025 by v0i0 Draft
Add simplified se_block kernel CLA Signed This label is managed by the Meta Open Source bot. fb-exported meta-exported
#989 opened Oct 20, 2025 by mengluy0125 Loading…
scripts for blackwell attn measurement CLA Signed This label is managed by the Meta Open Source bot.
#977 opened Oct 16, 2025 by v0i0 Loading…
Add epilogue subtiling CLA Signed This label is managed by the Meta Open Source bot.
#948 opened Oct 15, 2025 by PaulZhang12 Loading…
Validate user provided settings CLA Signed This label is managed by the Meta Open Source bot.
#947 opened Oct 15, 2025 by choijon5 Loading…
Use helion.cdiv CLA Signed This label is managed by the Meta Open Source bot.
#852 opened Oct 8, 2025 by oulgen Loading…
Upgrade benchmarks to cuda13 CLA Signed This label is managed by the Meta Open Source bot.
#826 opened Oct 7, 2025 by oulgen Loading…
Upgrade to cuda 13 CLA Signed This label is managed by the Meta Open Source bot.
#825 opened Oct 7, 2025 by oulgen Loading…
[Benchmark] template attention kernel and test CLA Signed This label is managed by the Meta Open Source bot.
#824 opened Oct 7, 2025 by Sibylau Loading…
add KL divergence backward helion kernel [attempt 2] CLA Signed This label is managed by the Meta Open Source bot.
#805 opened Oct 3, 2025 by williamwen42 Loading…
Add advanced compiler configurations CLA Signed This label is managed by the Meta Open Source bot.
#793 opened Oct 3, 2025 by jansel Draft
[example] flex attention CLA Signed This label is managed by the Meta Open Source bot.
#764 opened Oct 1, 2025 by v0i0 Loading…
[RFC] Add support for device for loop indexing CLA Signed This label is managed by the Meta Open Source bot.
#673 opened Sep 24, 2025 by oulgen Draft
[WIP] fp8_gemm_rowwise_grouped kernel CLA Signed This label is managed by the Meta Open Source bot.
#627 opened Sep 18, 2025 by yf225 Draft
[DO NOT MERGE]Make Puzzles Executable CLA Signed This label is managed by the Meta Open Source bot.
#605 opened Sep 17, 2025 by sekyondaMeta Draft
ProTip! Follow long discussions with comments:>50.