Skip to content

SCT fails with OSError: [Errno 98] Address already in use  #6345

@ilya-rarov

Description

@ilya-rarov

Issue description

  • This issue is a regression.
  • It is unknown if this issue is a regression.

The artifacts-ubuntu2204-arm-test job failed with the error:

14:36:24  < t:2023-07-09 11:36:24,227 f:test_config.py  l:296  c:sdcm.test_config     p:INFO  > Initializing Argus connection...
14:36:24  < t:2023-07-09 11:36:24,703 f:tester.py       l:390  c:ArtifactsTest        p:INFO  > test_id 707e495b-0757-47c6-966e-4a613fa62c4f already exists in Argus with status: created
14:36:25  < t:2023-07-09 11:36:24,849 f:tester.py       l:405  c:ArtifactsTest        p:INFO  > sct_runner info in Argus TestRun is updated
14:36:26  Process SyncManager-3:
14:36:26  Traceback (most recent call last):
14:36:26    File "/usr/local/lib/python3.10/multiprocessing/process.py", line 315, in _bootstrap
14:36:26      self.run()
14:36:26    File "/usr/local/lib/python3.10/multiprocessing/process.py", line 108, in run
14:36:26      self._target(*self._args, **self._kwargs)
14:36:26    File "/usr/local/lib/python3.10/multiprocessing/managers.py", line 591, in _run_server
14:36:26      server = cls._Server(registry, address, authkey, serializer)
14:36:26    File "/usr/local/lib/python3.10/multiprocessing/managers.py", line 156, in __init__
14:36:26      self.listener = Listener(address=address, backlog=16)
14:36:26    File "/usr/local/lib/python3.10/multiprocessing/connection.py", line 453, in __init__
14:36:26      self._listener = SocketListener(address, family, backlog)
14:36:26    File "/usr/local/lib/python3.10/multiprocessing/connection.py", line 596, in __init__
14:36:26      self._socket.bind(address)
14:36:26  OSError: [Errno 98] Address already in use
14:36:26  
14:36:26  Aborted!

The error happened at the very beginning of the execution of Run SCT Test () stage of the pipeline, when ClusterTester was being initialized - the test didn't really start.

How frequently does it reproduce?

It reproduced in many other artifacts jobs
scylla-master:

scylla-enterprise:

and more

Installation details

Cluster size: 1 nodes (im4gn.xlarge)

Scylla Nodes used in this run:
No resources left at the end of the run

OS / Image: ami-022c8ce295ce9ac4c (aws: undefined_region)

Test: artifacts-ubuntu2204-arm-test
Test id: 707e495b-0757-47c6-966e-4a613fa62c4f
Test name: scylla-master/artifacts/artifacts-ubuntu2204-arm-test
Test config file(s):

Logs and commands
  • Restore Monitor Stack command: $ hydra investigate show-monitor 707e495b-0757-47c6-966e-4a613fa62c4f
  • Restore monitor on AWS instance using Jenkins job
  • Show all stored logs command: $ hydra investigate show-logs 707e495b-0757-47c6-966e-4a613fa62c4f

Logs:

Jenkins job URL
Argus

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions