[RFC] Implement basic on disk caching #336

oulgen · 2025-07-19T23:40:21Z

Stacked PRs:

->[RFC] Implement basic on disk caching #336

[RFC] Implement basic on disk caching

stack-info: PR: #336, branch: oulgen/stack/26

jansel · 2025-07-20T20:47:20Z

helion/autotuner/cache.py

+    helion_key: Hash of source code of Helion
+    torch_key: Hash of source code of PyTorch
+    system_hash: Hash of system information,
+        including Triton, current device, cuda/rocm arch version


Do we want to include these by default?

I was thinking for this use case, where autotuning is very expensive and caching is off by default, it may make sense to default to a more minimal key.

My take is correctness is utmost important, and by default we should have as strict key as possible because if we are writing to the file system, people update their helion/triton/torch version, we shouldnt give them bad autotuning results.

Having said this, on L26, one of the TODOs is giving the users customization over the cache key. With that customization, we can allow users to choose to omit one or more of these by-default-included-strict requirements.

What do you think?

I think baseline here is users copy and pasting the config into their code. This makes the config a BC surface where we need to make sure old configs work with new versions of Helion/PyTorch.

I'm a lot more worried about surprise re-autotuning.

jansel · 2025-07-20T20:47:50Z

helion/autotuner/cache.py

+        return hashlib.sha256(repr(self).encode("utf-8")).hexdigest()
+
+
+class AutotuneCache:


jansel · 2025-07-20T20:49:55Z

helion/autotuner/cache.py

+        for idx, a in enumerate(args):
+            if isinstance(a, torch.Tensor):
+                input_dtypes.append((idx, a.dtype))
+                input_shapes.append((idx, a.shape))


Maybe we should mimic/reuse some of the key generation infra in kernel.py (for the in-memory cache). IMO the memory key should be a subset of the disk key. I think right now the memory key includes some extra stuff.

jansel · 2025-07-20T20:51:05Z

helion/autotuner/cache.py

+            cache_dir,  # pyright: ignore[reportPrivateImportUsage]
+        )
+
+        return Path(cache_dir()) / "helion" / f"{self._get_cache_key()}.best_config"


It might be useful to let users specify a filename for the cache so they can more easily distribute it to production environments.

TODO on L27 is about importing and exporting a file with all the autotune settings. Do you think that would be good enough for should I add an env option to control cache directory?

FWIW we can just do both

jansel · 2025-07-20T20:52:37Z

helion/autotuner/cache.py

+
+    def put(self, config: Config) -> None:
+        path = self._get_local_cache_path()
+        config.save(path)


Should we do the save to a temp file and rename trick here so the change is atomic. I thinking about multi-process races.

jansel · 2025-07-20T20:54:14Z

test/test_cache.py

+
+
+class TestCache(TestCase):
+    @mock.patch("helion.autotuner.DifferentialEvolutionSearch", new=BasicSearch)


Should we add a cleaner way for people to set the autotuner to use?

Yea, lets define an API that people can set. Out of the scope of this PR but I can look into it. Filed #337

stack-info: PR: #336, branch: oulgen/stack/26

jansel · 2025-07-26T04:21:10Z

helion/autotuner/base_cache.py

+
+@functools.cache
+def helion_key() -> str:
+    from torch._inductor.codecache import build_code_hash


inductor imports at top of file throughout PR

jansel · 2025-07-26T04:25:31Z

helion/runtime/kernel.py

    def _get_bound_kernel_cache_key(
        self, args: tuple[object, ...], signature: tuple[Hashable, ...]
    ) -> BoundKernelInMemoryCacheKey | None:
+        from ..autotuner.base_cache import BoundKernelInMemoryCacheKey


import at top of file

this one need to stay here similar to the DifferentialEvolutionSearch since autotuner/init.py usage results in partial initialization

jansel · 2025-07-26T04:25:35Z

helion/runtime/kernel.py

        args: tuple[object, ...],
        signature: tuple[Hashable, ...],
    ) -> BoundKernelInMemoryCacheKey:
+        from ..autotuner.base_cache import BoundKernelInMemoryCacheKey


import at top of file

stack-info: PR: #336, branch: oulgen/stack/26

oulgen added a commit that referenced this pull request Jul 19, 2025

[RFC] Implement basic on disk caching

544375a

stack-info: PR: #336, branch: oulgen/stack/26

oulgen force-pushed the oulgen/stack/25 branch from 0c8b535 to bc3438b Compare July 19, 2025 23:40

oulgen force-pushed the oulgen/stack/26 branch from 6895b25 to 544375a Compare July 19, 2025 23:40

oulgen mentioned this pull request Jul 19, 2025

Clean pyright warning #335

Merged

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jul 19, 2025

oulgen requested review from drisspg, jansel and yf225 July 19, 2025 23:40

oulgen changed the base branch from oulgen/stack/25 to main July 19, 2025 23:50

oulgen added a commit that referenced this pull request Jul 19, 2025

[RFC] Implement basic on disk caching

37aad8b

stack-info: PR: #336, branch: oulgen/stack/26

oulgen force-pushed the oulgen/stack/26 branch from 544375a to 37aad8b Compare July 19, 2025 23:50

oulgen changed the base branch from main to oulgen/stack/25 July 19, 2025 23:50

oulgen changed the base branch from oulgen/stack/25 to main July 20, 2025 00:08

oulgen added a commit that referenced this pull request Jul 20, 2025

[RFC] Implement basic on disk caching

7bd22d1

stack-info: PR: #336, branch: oulgen/stack/26

oulgen force-pushed the oulgen/stack/26 branch from 37aad8b to 7bd22d1 Compare July 20, 2025 00:08

oulgen changed the base branch from main to oulgen/stack/25 July 20, 2025 00:08

jansel requested changes Jul 20, 2025

View reviewed changes

oulgen changed the base branch from oulgen/stack/25 to main July 25, 2025 20:50

oulgen added a commit that referenced this pull request Jul 25, 2025

[RFC] Implement basic on disk caching

536d1f8

stack-info: PR: #336, branch: oulgen/stack/26

oulgen force-pushed the oulgen/stack/26 branch from 7bd22d1 to 536d1f8 Compare July 25, 2025 20:50

oulgen requested a review from jansel July 25, 2025 20:50

oulgen added a commit that referenced this pull request Jul 25, 2025

[RFC] Implement basic on disk caching

86226a2

stack-info: PR: #336, branch: oulgen/stack/26

oulgen force-pushed the oulgen/stack/26 branch from 536d1f8 to 86226a2 Compare July 25, 2025 21:29

oulgen added a commit that referenced this pull request Jul 25, 2025

[RFC] Implement basic on disk caching

6cc5ef0

stack-info: PR: #336, branch: oulgen/stack/26

oulgen force-pushed the oulgen/stack/26 branch from 86226a2 to 6cc5ef0 Compare July 25, 2025 21:36

jansel approved these changes Jul 26, 2025

View reviewed changes

[RFC] Implement basic on disk caching

cf7cc89

stack-info: PR: #336, branch: oulgen/stack/26

oulgen force-pushed the oulgen/stack/26 branch from 6cc5ef0 to cf7cc89 Compare July 26, 2025 16:38

oulgen merged commit 9064eda into main Jul 26, 2025
7 checks passed

		return hashlib.sha256(repr(self).encode("utf-8")).hexdigest()


		class AutotuneCache:



		class TestCache(TestCase):
		@mock.patch("helion.autotuner.DifferentialEvolutionSearch", new=BasicSearch)

[RFC] Implement basic on disk caching #336

[RFC] Implement basic on disk caching #336

Uh oh!

Conversation

oulgen commented Jul 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!