-
-
Notifications
You must be signed in to change notification settings - Fork 741
Closed
Description
I don't have a repro unfortunately, but my question is limited to how to investigate problems.
I have a job that frequently will cause the scheduler to start accumulating memory, eventually to 400GB of memory and then failing (the scheduler machine has that much memory).
I think I've read the docs thoroughly, but can't find any reason the scheduler should do this beyond having large values in tasks. I'm fairly confident that the client is not submitting any large values in tasks — it starts with bag.from_sequence(date_list)
and has a series of .map
calls.
Are there other reasons? How could I see what the scheduler is holding?
Thanks in advance!
Metadata
Metadata
Assignees
Labels
No labels