-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Description
Despite updating our aws-sdk-cpp libs recently to ensure we had fixes #511 and #520 - we are still seeing infrequent deadlocks when transferring to s3. Presumedly it's still related to TransferHandle::WaitUntilFinished()
being called and m_waitUntilFinishedSignal
never receiving a signal, but I am unable to see how.
We only have a single thread interacting with the AWS libraries, so we are reasonably confident it's not anything specific to our implementation.
It is difficult to debug as we only see this on EC2 instances (whether this is because we only run things at scale on EC2 or because of the reduced latency I don't know). Debugging requires some setup to remote attach and catch it before we terminate the process. If we encounter it again I will try update with the exact location it blocks.
For now I just wanted to document the issue in case others have seen it or have ideas where the trouble might be occurring.