What's the best way of diagnosing scheduler memory issues?

I don't have a repro unfortunately, but my question is limited to how to investigate problems.

I have a job that frequently will cause the scheduler to start accumulating memory, eventually to 400GB of memory and then failing (the scheduler machine has that much memory).

I think I've read the docs thoroughly, but can't find any reason the scheduler should do this beyond having large values in tasks. I'm fairly confident that the client is not submitting any large values in tasks — it starts with `bag.from_sequence(date_list)` and has a series of `.map` calls. 

Are there other reasons? How could I see what the scheduler is holding?

Thanks in advance!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

What's the best way of diagnosing scheduler memory issues? #4998

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

What's the best way of diagnosing scheduler memory issues? #4998

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions