Skip to content

Azure DevOps: Source controller getting stuck  #402

@mfamador

Description

@mfamador

Hello.

We have 3 AKS clusters, all running the exact same versions of flux (0.16.1) in two different Azure regions (North Europe and East US).

The source-controller version is 0.15.3.

❯ k describe deploy source-controller -n flux-system --context aks-stag-eun | grep -i image
    Image:       ghcr.io/fluxcd/source-controller:v0.15.3
❯ k describe deploy source-controller -n flux-system --context aks-stag-ue | grep -i image
    Image:       ghcr.io/fluxcd/source-controller:v0.15.3

Both clusters are synching with the same Azure DevOps git repositories (gitImplementation: libgit2).

Everything is working great on East US clusters but in North Europe source-controller gets stuck multiple times a day and only killing it seems to make the sources to reconcile again (we've created a cronjob to restart source-controllerevery half a hour).

Even restarting every half a hour we're still getting a lot of gaps where there's no source reconciliation.

Screenshot 2021-07-20 at 09 46 51

In this state, any manual reconciliation also gets stuck and never finishes:

>  flux reconcile source git core -n core --context aks-stag-eun

► annotating GitRepository core in core namespace
✔ GitRepository annotated
◎ waiting for GitRepository reconciliation

There's no logs on source-controller when it's in this lock state ...

I'm pretty sure it's a connectivity problem to Azure DevOps or something not directly related to source-controller, but maybe it should recover or timeout from whatever it's trying to do (?)

I've also increased concurrent from the default 2 to 6 but it seems to not be doing anything differently.

Thanks!

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

Status

Done

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions