Skip to content

Commit 2ab27b6

Browse files
committed
fix: ray head wait-for-workers initContainer should retry if wait fails
1 parent cc25b95 commit 2ab27b6

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

guidebooks/ml/ray/start/kubernetes/chart/templates/_head-deployment.tpl

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -91,7 +91,7 @@ spec:
9191
image: bitnami/kubectl
9292
command: ["/bin/sh", "-c", "--"]
9393
args:
94-
- {{ print "echo 'Waiting for workers'; kubectl wait pod -l " (.Values.podTypes.rayWorkerType.selector) " --for=condition=Ready --timeout=-1s" }}
94+
- {{ print "echo 'Waiting for workers'; while true; do kubectl wait pod -l " (.Values.podTypes.rayWorkerType.selector) " --for=condition=Ready --timeout=-1s && break; sleep 1; done" }}
9595
{{- end }}
9696

9797
containers:

0 commit comments

Comments
 (0)