Computation class #5001

fjetter · 2021-06-30T15:10:56Z

This supercedes #4972 and closes #4613

Notable changes to #4972

Source code attached to the computation object is now inferred by walking back the stack. All calls originating from dask and distributed modules are ignored. First frame of a non-ignored module is interpreted as user code and we'll extract that source. The ignore list is currently only dask and distributed but we can add other common libs like xarray, prefect as suggested in #XYZ. I tested this in jupyter notebooks and unit tests and it worked as I think it should. I ended up needing to write special treatment for list/dict comprehension and wouldn't be surprised to see more special cases popping up. we'll see
The max history and ignore modules can now be configured
The module ignore thing uses regular expressions. This may be overkill and I'm not entirely sure if this poses a security issue. Implementation actually felt even simpler compared to a "split / str compare". I figured that people might want to use glob patterns or something more sophisticated than hard coded paths. regex was the "simplest" choice. If that faces resistance, I'll go for a simple str compare (startswith, is in, etc.)
I added a poor mans HTML repr (don't be too critical, I just copied stuff around) simply to get started. I hope that somebody with a better sense for style will iterate on this
I chose to go for a dedicated UUID nevertheless (see Add initial draft of large scale computation class #4972 (comment)) mostly because I believe we need a UUID, especially when connecting to an external service and I couldn't find the UUID @mrocklin was referring to.

distributed/distributed.yaml

fjetter · 2021-06-30T16:13:57Z

Test failures are a lot #4859 and distributed/diagnostics/tests/test_progress.py::test_AllProgress of which I'm pretty sure that it is a known flaky test. No idea why this clusters so strongly around this change.

There is also distributed/tests/test_client_executor.py::test_cancellation which is already marked with a pytest.rerun marker but it still fails.

:(

mrocklin · 2021-07-20T15:01:20Z

Checking in here @fjetter. I know that you've been busy recently with stability issues. This seems like it might be close though?

fjetter · 2021-07-20T15:02:16Z

This seems like it might be close though?

I was hoping this is done. I'll rebase and see what CI thinks about it

Fixes dask#4613

fjetter · 2021-07-20T15:16:02Z

Locally tests look good. If there are no complaints and CI turns out to be green I'll go ahead and merge.

mrocklin · 2021-07-20T15:23:16Z

Thanks @fjetter

fjetter · 2021-07-20T15:27:46Z

ok, py3.7 crashes hard. some cythonization issue but shouldn't be hard to track down

fjetter · 2021-07-20T16:02:56Z

The cythonization problem turns out to be a bit awkward. The Computation._recent as a global class attribute doesn't work. afaik, there is no such thing as a class attribute in cython. Instead, I will probably simply track this in the scheduler itself, after all we will probably not want to track recent computations across multiple schedulers in the same python process.
I might be wrong, distributed users continue to surprise me with creative ways to break stuff :)

fjetter · 2021-07-22T09:45:13Z

Failures appear to be unrelated. I'll go ahead and merge this now

fjetter force-pushed the computation_class branch from f53b309 to dc19a85 Compare June 30, 2021 15:11

mrocklin reviewed Jun 30, 2021

View reviewed changes

distributed/distributed.yaml Show resolved Hide resolved

Add initial draft of large scale computation class

af2545f

Fixes dask#4613

fjetter force-pushed the computation_class branch from 2616001 to af2545f Compare July 20, 2021 15:05

Cythonization fixes

741a648

fjetter force-pushed the computation_class branch from 5a4d418 to 741a648 Compare July 21, 2021 12:14

fjetter merged commit 1275af3 into dask:main Jul 22, 2021

fjetter mentioned this pull request Jul 22, 2021

Add initial draft of large scale computation class #4972

Closed

fjetter deleted the computation_class branch August 10, 2021 09:57

gforsyth mentioned this pull request Aug 17, 2021

KeyError when using distributed scheduler with __array_function__ #5224

Closed

ian-r-rose mentioned this pull request Dec 8, 2021

xarray.DataArray map_blocks failed to deserialize #5504

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Computation class #5001

Computation class #5001

Uh oh!

fjetter commented Jun 30, 2021

Uh oh!

Uh oh!

fjetter commented Jun 30, 2021

Uh oh!

mrocklin commented Jul 20, 2021

Uh oh!

fjetter commented Jul 20, 2021

Uh oh!

fjetter commented Jul 20, 2021

Uh oh!

mrocklin commented Jul 20, 2021

Uh oh!

fjetter commented Jul 20, 2021

Uh oh!

fjetter commented Jul 20, 2021

Uh oh!

fjetter commented Jul 22, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Computation class #5001

Computation class #5001

Uh oh!

Conversation

fjetter commented Jun 30, 2021

Uh oh!

Uh oh!

fjetter commented Jun 30, 2021

Uh oh!

mrocklin commented Jul 20, 2021

Uh oh!

fjetter commented Jul 20, 2021

Uh oh!

fjetter commented Jul 20, 2021

Uh oh!

mrocklin commented Jul 20, 2021

Uh oh!

fjetter commented Jul 20, 2021

Uh oh!

fjetter commented Jul 20, 2021

Uh oh!

fjetter commented Jul 22, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants