fix(daemon): exit process when cluster completes #237

EivMeyer · 2026-01-30T12:36:52Z

Problem

Zombie zeroshot processes remained after heroshot shipped PRs, blocking new agent spawns.

Root Cause

When CLUSTER_COMPLETE triggers orchestrator.stop(), the cluster state changes to 'stopped' but no SIGTERM is sent. The cleanup handlers registered via process.on('SIGTERM') are never triggered.

Fix

Added polling in setupDaemonCleanup() to detect when cluster state changes to 'stopped' or 'completed', then exit the process.

Testing

The fix was triggered by a heroshot run where 3 zombie processes from already-shipped items (#1159, #1168) were blocking new spawns.

Release dev to main --------- Co-authored-by: Eivind Meyer <eiv.meyer@gmail.com> Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com> Co-authored-by: Eivind Meyer <eivind.meyer@ksat.no> Co-authored-by: Michael Eichelbeck <141341133+mkceichelbeck@users.noreply.github.com> Co-authored-by: Michael Eichelbeck <michael.eichelbeck.ext@wtsde.onmicrosoft.de>

CLUSTER_COMPLETE triggers orchestrator.stop() which sets state to 'stopped', but no SIGTERM was sent to trigger the cleanup handlers. Added polling in setupDaemonCleanup to detect state change and exit. Fixes zombie zeroshot processes in heroshot runs.

tomdps and others added 2 commits January 27, 2026 10:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(daemon): exit process when cluster completes #237

fix(daemon): exit process when cluster completes #237

Uh oh!

EivMeyer commented Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix(daemon): exit process when cluster completes #237

Are you sure you want to change the base?

fix(daemon): exit process when cluster completes #237

Uh oh!

Conversation

EivMeyer commented Jan 30, 2026

Problem

Root Cause

Fix

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants