-
Notifications
You must be signed in to change notification settings - Fork 3.4k
Description
Description
Starting around 2021-09-13 11:30 am PST, the rust-lang project has noticed a large failure rate on our Windows runners. We are experiencing sporadic errors reading, copying, and creating executables with various errors related to file locks. Some examples are:
- OS error 32 (ERROR_SHARING_VIOLATION) copying an executable.
- "LNK1104: cannot open file" running msvc linker.
- "rm: cannot remove '...some.exe ': Device or resource busy"
We are also getting reports from other projects experiencing similar errors.
I wanted to check if there were perhaps any unannounced changes to windows-latest-xl
, or if perhaps there are any new scanning features running (windows defender, indexing service, etc.).
More information may be found at rust-lang/rust#88924 with links to failed runs, or most of the runs at https://github.com/rust-lang-ci/rust/actions are currently failing due to this error.
Virtual environments affected
- Ubuntu 16.04
- Ubuntu 18.04
- Ubuntu 20.04
- macOS 10.15
- macOS 11
- Windows Server 2016
- Windows Server 2019
- Windows Server 2022
Image version and build link
Links to failed builds:
https://github.com/rust-lang-ci/rust/runs/3591374923
https://github.com/rust-lang-ci/rust/runs/3591788935
https://github.com/rust-lang-ci/rust/runs/3593656760
https://github.com/rust-lang-ci/rust/runs/3594233900
https://github.com/rust-lang-ci/rust/runs/3594557656
https://github.com/rust-lang-ci/rust/runs/3592301669
https://github.com/rust-lang-ci/rust/runs/3600286433
https://github.com/Lokathor/wide/runs/3592322856
https://github.com/PyO3/pyo3/runs/3601829130
https://github.com/PyO3/pyo3/runs/3590459443
Is it regression?
No response
Expected behavior
Windows runners shouldn't have any services or issues locking files during a build.
Actual behavior
Windows runners are experiencing a high error rate with file locks related to executables being created or copied or removed.
Repro steps
Reproduction may be difficult since it is part of a large build system, and the errors are happening somewhat randomly, and it is not known what is causing them. But, roughly, most of the errors seem related to creating a new executable, and then immediately trying to copy it to a new location. Or, we are also seeing errors with link.exe failing to read files.