[v4] added wasm cache #1471

nico-martin · 2025-12-01T14:30:45Z

This adds caching of the wasm Binary.
I also added an env.cacheKey so developers can modify the cacheKey. By default it will be transformers-cache but they are free to set something related to their app.

Note: this will only cache the wasm file. So there will still be a request to https://cdn.jsdelivr.net/ for the mjs file. In my opinion, this should be cached via the service-worker if the applications requires full offline support.

xenova · 2025-12-01T21:34:59Z

So there will still be a request to https://cdn.jsdelivr.net/ for the mjs file. In my opinion, this should be cached via the service-worker if the applications requires full offline support.

Indeed, caching the mjs file would be necessary to ensure full offline support, and imo is essential before adding a caching feature like this PR proposes. Any ideas for how we could do this? Perhaps bundling ort-wasm-simd-threaded.jsep.mjs with transformers.js file?

xenova

Very nice + clean PR! Now we just need to figure out how to completely remove the ort-wasm-simd-threaded.jsep.mjs dependency.

nico-martin · 2025-12-04T10:40:44Z

So after another deep-dive into onnxruntime I figured out that it's actually no problem at all to load the wasm factory (.mjs) as a blob, which allows us to load it from cache.
The only problem is that inside the factory there are URLs that try to resolve relative with "import.meta.url". If we just replace this with the actual baseURL ist works just fine.
https://github.com/huggingface/transformers.js/blob/v4-cache-wasm-file/src/backends/utils/cacheWasm.js#L76

On to of that I also did some refactoring of the hub.js. My goal is to keep large files that only have a handfull of exported methods as clean as possible by extracting some heloper functions and constants into their separate files.

I also wanted to improve the caching (which is now used not only in the hub.js but also in the backends/onnx.js) so I created a helper function also also an interface "CacheInterface" that any given cache has to implement.

src/backends/onnx.js

src/env.js

xenova

Very worthwhile refactor -- thanks!

I wonder if you think the following feature could be useful: design some form of CacheRegistry class which a user could import from the library... like

import { cache } from '@huggingface/transformers';

/// check if model is cached
// cache.match('org/model') or something -- API should be well-designed, returning a list/map of files that are cached for this model maybe?
// cache.delete('org/model') -- remove all files cached for this model

I think we can draw inspiration from hf cache cli tool

hf cache --help
Usage: hf cache [OPTIONS] COMMAND [ARGS]...

  Manage local cache directory.

Options:
  --help  Show this message and exit.

Commands:
  ls      List cached repositories or revisions.
  prune   Remove detached revisions from the cache.
  rm      Remove cached repositories or revisions.
  verify  Verify checksums for a single repo revision from cache or a...

may not need to be added in this PR, but maybe something to discuss here.

src/backends/utils/cacheWasm.js

src/utils/hub.js

nico-martin · 2025-12-09T12:23:28Z

I like the CacheRegistry! The only problem is that we normally don't know upfront all the files a model will load/expect (altough I think it would be great to add that as well).
I think that opens up a completely new topic. I will keep this on my radar but not implement it here in the PR.

xenova · 2025-12-16T05:19:17Z

src/backends/utils/cacheWasm.js

+        if (cache) {
+            try {
+                return await cache.match(url);
+            } catch (e) {
+                console.warn(`Error reading ${fileName} from cache:`, e);


if await cache.match(url) returns undefined (i.e., it is not in cache), then we return undefined from this function... and the fetch below is never called (meaning, it is never cached).

xenova · 2025-12-16T05:20:41Z

src/backends/utils/cacheWasm.js

+        if (cache) {
+            try {
+                return await cache.match(url);
+            } catch (e) {
+                console.warn(`Error reading ${fileName} from cache:`, e);


Suggested change

if (cache) {

try {

return await cache.match(url);

} catch (e) {

console.warn(`Error reading ${fileName} from cache:`, e);

if (cache) {

try {

const result = await cache.match(url);

if (result) {

return result;

}

} catch (e) {

console.warn(`Error reading ${fileName} from cache:`, e);

seems to fix it

made this change.

…ransformers.js into v4-cache-wasm-file

Don't throw error if we can't open cache or load file from cache, but we are able to make the request.

xenova

Thanks! 🚀 Had time today to finish the review, and it works well! I had to make one small adjustment (only return when await cache.match(url) matches... not when undefined), and it's good to go :)

I also like the refactor out of the monolithic hub.js file.

I tested it on the recent chatterbox webgpu demo, and it now runs fully offline thanks to this PR! 🔥

added wasm cache

1a758a0

nico-martin requested a review from xenova December 1, 2025 14:30

nico-martin changed the title ~~added wasm cache~~ [v4] added wasm cache Dec 2, 2025

xenova reviewed Dec 3, 2025

View reviewed changes

xenova mentioned this pull request Dec 3, 2025

[v4] Switch build system to use esbuild #1466

Open

some refactoring of the hub.js and caching of the wasm factory

1ce23bc

xenova reviewed Dec 4, 2025

View reviewed changes

src/backends/onnx.js Outdated Show resolved Hide resolved

xenova reviewed Dec 4, 2025

View reviewed changes

src/env.js Show resolved Hide resolved

xenova reviewed Dec 4, 2025

View reviewed changes

src/env.js Show resolved Hide resolved

nico-martin added 2 commits December 4, 2025 17:37

fixed comment

e480fc3

added string as cache return

24596d9

xenova requested changes Dec 9, 2025

View reviewed changes

src/backends/utils/cacheWasm.js Show resolved Hide resolved

src/utils/hub.js Outdated Show resolved Hide resolved

fixes after review

3adc971

resolved conflicts

afef880

nico-martin requested a review from xenova December 15, 2025 09:13

Merge branch 'v4' into v4-cache-wasm-file

ddad7cb

xenova reviewed Dec 16, 2025

View reviewed changes

xenova and others added 3 commits December 16, 2025 00:21

Only return if match is found

01ba6ae

Merge branch 'v4-cache-wasm-file' of https://github.com/huggingface/t…

154b261

…ransformers.js into v4-cache-wasm-file

Return response even if cache doesn't exist

29c4010

Don't throw error if we can't open cache or load file from cache, but we are able to make the request.

xenova approved these changes Dec 16, 2025

View reviewed changes

xenova merged commit 0082c20 into v4 Dec 16, 2025
1 check failed

xenova deleted the v4-cache-wasm-file branch December 16, 2025 05:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[v4] added wasm cache #1471

[v4] added wasm cache #1471

nico-martin commented Dec 1, 2025

Uh oh!

xenova commented Dec 1, 2025

Uh oh!

xenova left a comment

Uh oh!

nico-martin commented Dec 4, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

xenova left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

nico-martin commented Dec 9, 2025

Uh oh!

xenova Dec 16, 2025

Uh oh!

xenova Dec 16, 2025

Uh oh!

xenova Dec 16, 2025

Uh oh!

xenova left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[v4] added wasm cache #1471

[v4] added wasm cache #1471

Conversation

nico-martin commented Dec 1, 2025

Uh oh!

xenova commented Dec 1, 2025

Uh oh!

xenova left a comment

Choose a reason for hiding this comment

Uh oh!

nico-martin commented Dec 4, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

xenova left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

nico-martin commented Dec 9, 2025

Uh oh!

xenova Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

xenova Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

xenova Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

xenova left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

xenova left a comment •

edited

Loading