Separated context and state for easier parallelization. #494

sandrohanea · 2023-02-12T00:44:59Z

Separated and tested the state of the transformation as a different struct.
Fixed examples (stream, command, talk, main) => because these are not using the callbacks when parsing the results, they need to call whisper_full_with_state instead of whisper_full so, they can read the results from the state at the end of transformation.
Fix rest of examples
Fix Bindings + examples of bindings

Note:
In order for the bindings to be able to use same context with multiple different transformations (with a state for each transformation) in parallel, some adjustments are needed, but the default cases are already working.

sandrohanea · 2023-02-12T00:46:26Z

whisper.cpp

-    std::vector<float> probs;
-    std::vector<float> logits;
-    std::vector<float> logprobs;
+    std::vector<float> probs{};


Fought with some read access denied exception for almost 2h, only to find out that it's a bug in compiler (only for debug): microsoft/STL#1934

The workaround is just to initialize the struct (otherwise, it's pretty hard to debug)

sandrohanea · 2023-02-13T15:42:29Z

@ggerganov please take a look when you have some time.

It will be pretty hard to keep this branch and resolve conflicts if other changes are done.

sandrohanea · 2023-02-13T15:45:22Z

examples/bench.wasm/emscripten.cpp

@@ -10,20 +10,20 @@

 constexpr int N_THREAD = 8;

-// TODO: get rid of this vector of contexts - bad idea in the first place


Fixed all these todos and kept only 1 context needed + a vector of states.

RndyP · 2023-02-13T17:07:59Z

Been following your work here. Not sure if this is related, but I profile Whisper under Visual C++ profiling tool, and there is this section of code in ggml.c that is using a spinlock that is taking 10% of CPU time. Shouldn't this be using a mutex?

sandrohanea · 2023-02-13T17:24:46Z

Hello @RndyP ,
Indeed, it looks like that code can be optimized, but it's not in the scope of this PR. This PR is meant to split the whisper_context into whisper_context + whisper_state so whisper_context can be reused and thread safe.

About that spinlock, I agree it would be nice to change it. Can you, maybe create another issue about your findings?

RndyP · 2023-02-13T22:32:04Z

I reported this in issue #300, which has fallen down the list. I don't understand the code enough to suggest a fix myself.
In MSVC, atomic_load() looks like this:

static LONG atomic_load(atomic_int* ptr) {
return InterlockedCompareExchange(ptr, 0, 0);

I believe InterlockedCompareExchange() is simply locking the data values, and the while() is simply spinning and waiting for the value to change.

ggerganov · 2023-02-15T17:27:00Z

@sandrohanea
Thanks for this nice work! I will need some time to think about this change before merging. I kind of want to keep the existing API the same if possible and add extra functions for your use case, but not sure if this is better compared to what you have proposed here.

Don't worry about keeping up-to-date - I will be able to do that if necessary.

sandrohanea · 2023-02-15T17:41:00Z

Thanks a lot for taking the time, it totally makes sense.

Also, thanks again for creating the whole library in the first place, really titanic work to port all tensor operations and everything.

I was thinking also about keeping a "default state" in the context and use that if no different state is provided, this way it won't be a breaking change to existing functionality.

On the other hand, it is really easy to use it wrong if context is thread safe "sometimes".

It's your call on this one. If you have any idea how this could be done better, please, let me know and I'll be happy to help.

sandrohanea · 2023-03-06T09:31:56Z

Closing this as the alternative with opt in state was merged.

Separated context and state for easier parallelization

704566b

sandrohanea commented Feb 12, 2023

View reviewed changes

Sandro Hanea added 9 commits February 12, 2023 02:36

Fixed bench and main examples to use state.

b6aa7be

Fixed rest of the samples

d4ee026

Fixed go bindings + js

aac0150

Forgot to add some files

edb25e4

Fixed node.addon + one warning which I introduced.

8841840

Fixed whisper_full_parallel when only 1 thread + optimized the other.

0cd3bdb

Fixed emscripten.

ff2373a

Fixed rest of emscripten.

2be35dd

Fixed switft example.

ff00f01

sandrohanea changed the title ~~[WIP] Separated context and state for easier parallelization.~~ Separated context and state for easier parallelization. Feb 13, 2023

sandrohanea commented Feb 13, 2023

View reviewed changes

This was referenced Feb 14, 2023

Release 1.2.1 sandrohanea/whisper.net#5

Merged

Feature request: whisper.cpp in a library (with a .Net wrapper) #303

Open

sandrohanea mentioned this pull request Feb 21, 2023

Added whisper state + default state on the whisper_context #523

Merged

sandrohanea closed this Mar 6, 2023

sandrohanea deleted the feature/separateCtxAndState branch March 6, 2023 09:32

chriskyndrid mentioned this pull request Jul 6, 2023

Parallel Processing and Performance tazz4843/whisper-rs#72

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Separated context and state for easier parallelization. #494

Separated context and state for easier parallelization. #494

Uh oh!

sandrohanea commented Feb 12, 2023 •

edited

Loading

Uh oh!

sandrohanea Feb 12, 2023

Uh oh!

sandrohanea commented Feb 13, 2023 •

edited

Loading

Uh oh!

sandrohanea Feb 13, 2023 •

edited

Loading

Uh oh!

RndyP commented Feb 13, 2023 •

edited

Loading

Uh oh!

sandrohanea commented Feb 13, 2023

Uh oh!

RndyP commented Feb 13, 2023 •

edited

Loading

Uh oh!

ggerganov commented Feb 15, 2023

Uh oh!

sandrohanea commented Feb 15, 2023

Uh oh!

sandrohanea commented Mar 6, 2023

Uh oh!

Uh oh!

		@@ -10,20 +10,20 @@

		constexpr int N_THREAD = 8;

		// TODO: get rid of this vector of contexts - bad idea in the first place

Separated context and state for easier parallelization. #494

Separated context and state for easier parallelization. #494

Uh oh!

Conversation

sandrohanea commented Feb 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sandrohanea Feb 12, 2023

Choose a reason for hiding this comment

Uh oh!

sandrohanea commented Feb 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sandrohanea Feb 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

RndyP commented Feb 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sandrohanea commented Feb 13, 2023

Uh oh!

RndyP commented Feb 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov commented Feb 15, 2023

Uh oh!

sandrohanea commented Feb 15, 2023

Uh oh!

sandrohanea commented Mar 6, 2023

Uh oh!

Uh oh!

sandrohanea commented Feb 12, 2023 •

edited

Loading

sandrohanea commented Feb 13, 2023 •

edited

Loading

sandrohanea Feb 13, 2023 •

edited

Loading

RndyP commented Feb 13, 2023 •

edited

Loading

RndyP commented Feb 13, 2023 •

edited

Loading