We call it Claudikins because "Draconian-AI-Supervisor" was taken.
A disciplined workflow engine run by a team of neurotic AI agents.
You asked Claude for a bug fix. He refactored half your codebase.
You asked Claude for a feature. He placed a bunch of stubs that look a little bit real.
You asked Claude if you should drink that coffee you forgot about, now you're feeling sick and threw up on your keyboard, which cost too much just to hear nice click clack noises whilst you code, which actually isn't that great because it wakes your dog up at night, then you have to take him out in the cold to poop whilst you're still sick and well...okay, maybe that one was just me, but the point still stands!
claudikins-kernel applies SRE discipline to AI workflows. It enforces a strict 4-stage pipeline with gates between each step. You literally cannot skip verification. You cannot ship without the Cynic's approval.
Constraint is freedom. By preventing shortcuts, you get code that actually works.
┌──────────┐ ┌──────────┐ ┌──────────┐ ┌──────────┐
│ /outline │────▶│ /execute │────▶│ /verify │────▶│ /ship │
└──────────┘ └──────────┘ └──────────┘ └──────────┘
│ │ │ │
▼ ▼ ▼ ▼
taxonomy- babyclaude catastrophiser git-perfectionist
extremist spec-reviewer cynic
code-reviewer
Each arrow is a gate. Try to /ship without /verify passing? Blocked. Try to /execute without a plan? Blocked. The system enforces this - not guidelines, guardrails.
# Prerequisites: jq (JSON processor)
# Windows: winget install jqlang.jq
# Ubuntu/Debian: sudo apt install jq
# macOS: brew install jq
# Add the Claudikins marketplace
/marketplace add elb-pr/claudikins-marketplace
# Install the plugin
/plugin install claudikins-kernelRestart Claude Code. Then:
# Start your first disciplined session
/claudikins-kernel:outline "Add user authentication to the app"These aren't generic "agents". They're your synthetic staff - each with a job and a personality.
| Agent | Role | Personality |
|---|---|---|
| taxonomy-extremist | Researcher | The librarian. Categorises everything. Reads your codebase, external docs, the web - returns structured findings. |
| babyclaude | Implementer | The eager junior. Does exactly what you specify. One task, one branch, fresh context. No scope creep. |
| spec-reviewer | Compliance | The auditor. Did you do what you said you'd do? Mechanical check against acceptance criteria. |
| code-reviewer | Quality | The critic. Is it actually any good? Error handling? Edge cases? Naming? |
| catastrophiser | Verification | The QA lead who assumes everything will break. Runs your code, takes screenshots, curls your endpoints. Sees it working, doesn't trust tests alone. |
| cynic | Simplification | The senior engineer who hates complexity. If it can be done in 5 lines, won't let you use 10. |
| conflict-resolver | Merge Handler | The diplomat. When branches collide, proposes resolutions. |
| git-perfectionist | Documentation | The pedant. README not updated? Changelog wrong? Blocked until it's right. |
Iterative brainstorming until you have a solid plan.
- Brain-jam - Back and forth with Claude. Pick from options, don't type essays.
- Research - taxonomy-extremist agents dig through your codebase in parallel.
- Approaches - 2-3 ways to solve it. Pros, cons, recommendation. You pick.
- Draft - Plan written section by section. You approve each one.
Output: plan.md with a task table that /execute can parse.
Execute the plan task by task with fresh agents and code review.
- Batch checkpoint - "Batch 1/3: [task-1, task-2]. Ready?" You decide.
- Per task - Creates branch, spawns fresh babyclaude, implements, commits.
- Two-stage review - spec-reviewer checks compliance, code-reviewer checks quality.
- Merge decision - You choose: merge all, merge some, or keep branches.
Key feature: Each babyclaude gets fresh context. No pollution between tasks.
Claude must see the code working. Not trust. Verify.
- Automated checks - Tests, lint, type check.
- Output verification - catastrophiser runs your code:
- Web app? Starts server, takes screenshots.
- API? Curls endpoints, checks responses.
- CLI? Runs commands, verifies output.
- Polish pass - cynic looks for unnecessary complexity. Changes one thing at a time, tests after each.
- Human checkpoint - Comprehensive report. You decide: ready to ship?
Output: verify-state.json with unlock_ship: true if approved. Plus file hashes so /ship can detect tampering.
Merge to main with proper docs and PR.
- Gate check - Won't run unless verify passed AND code hasn't changed.
- Commit strategy - Squash or preserve? Message drafted, you approve.
- Documentation - git-perfectionist updates README, CHANGELOG, version. Section by section.
- PR creation - Draft, approve, create via
ghCLI. - Merge - Wait for CI if you want. Merge. Cleanup branches.
Output: Code on main. PR merged. Done properly.
| Protection | What it does |
|---|---|
| Cross-command gates | Can't skip steps. Execute needs plan. Ship needs verify. |
| State files | Each command writes to .claude/. Resume if context dies. |
| File locking | flock prevents race conditions on state writes. |
| Code integrity | SHA256 hashes ensure shipped code = verified code. |
| Session management | Stale session (4+ hours)? Warns you research might be outdated. |
| Human checkpoints | Nothing auto-proceeds. You approve every phase. |
Industrial-grade patterns adapted for AI workflows.
| Distributed Systems Pattern | Claude Code Adaptation |
|---|---|
| Circuit breakers | Stuck agent detection |
| Distributed tracing | Execution spans |
| Load shedding | Batch size limits |
| Coordinated checkpoints | Batch-boundary saves |
| Deadline propagation | Task time budgets |
| Exponential backoff | Retry with jitter |
Same principles, different scale. Reliability through structure - not speed through parallelism.
- jq - Used by hook scripts for JSON processing
# Ubuntu/Debian
sudo apt install jq
# macOS
brew install jq
# Windows
winget install jqlang.jq| Plugin | Purpose |
|---|---|
claudikins-tool-executor |
MCP access for research and verification |
claudikins-automatic-context-manager |
Context monitoring at 60% |
| Plugin | Purpose |
|---|---|
claudikins-klaus |
Escalation when truly stuck |
v1.2.0 - Full Release - Regularly Maintained.
View the marketplace | Changelog
MIT
We call it Claudikins because "Draconian-AI-Supervisor" was taken.
