Skip to content

Conversation

@Despire
Copy link
Contributor

@Despire Despire commented Jan 26, 2026

Aims to resolve the following issues:

Initial work on #150

Closes #1627
Closes #1617
Closes #1316

This PR introduces a refactor that get rids of the Builder service and adds NATS as a worker queue to distribute the messages between the workers (terraformer, ansibler, kube-eleven, kuber).

Further, this PR introduces the initial work on the reconciliation loop which works as follows:

Once a change is detected the diff is worked out and is applied, if it fails claudie rolls back to the original state will proceed to again diff with the desired state. This will be an endless reconciliation loop that will always try to build the desired state even on failures.

Contrary to how claudie in master works, which after a failure is encountered logs the error and nothing is done until the manifest is again updated. Usually on errors it was enough to retry, thus some users were manually editing nodepool labels to trigger the reconciliation. Which is now completely obsolete as claudie will work it out on itself, it will be enough to update the manifest for the desired state.

@Despire Despire changed the title [WIP]: Feat/reconciliation loop Feat/reconciliation loop Feb 12, 2026
@Despire Despire requested review from JKBGIT1, bernardhalas, m-brando and samuelstolicny and removed request for JKBGIT1 and m-brando February 12, 2026 17:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Bug: Autoscaler stuck Feature: Retrigger Manifest on error Feature: Move away from builder TTL

2 participants