Skip to content

Process deadlock while waiting for s3 file transfer #545

@ferrouswheel

Description

@ferrouswheel

Despite updating our aws-sdk-cpp libs recently to ensure we had fixes #511 and #520 - we are still seeing infrequent deadlocks when transferring to s3. Presumedly it's still related to TransferHandle::WaitUntilFinished() being called and m_waitUntilFinishedSignal never receiving a signal, but I am unable to see how.

We only have a single thread interacting with the AWS libraries, so we are reasonably confident it's not anything specific to our implementation.

It is difficult to debug as we only see this on EC2 instances (whether this is because we only run things at scale on EC2 or because of the reduced latency I don't know). Debugging requires some setup to remote attach and catch it before we terminate the process. If we encounter it again I will try update with the exact location it blocks.

For now I just wanted to document the issue in case others have seen it or have ideas where the trouble might be occurring.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugThis issue is a bug.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions