Operations Runbook
This page is for operators who need reliable day-to-day CCCC execution.
1) Runtime Topology
Default runtime home:
CCCC_HOME=~/.cccc
Key paths:
~/.cccc/registry.json~/.cccc/daemon/ccccd.sock~/.cccc/daemon/ccccd.log~/.cccc/groups/<group_id>/group.yaml~/.cccc/groups/<group_id>/ledger.jsonl
2) Startup and Health Checks
Start
bash
ccccHealth Baseline
bash
cccc doctor
cccc daemon status
cccc groupsExpected:
- daemon reachable
- runtimes detected
- active group list loadable
3) Incident Triage Order
When a group appears stuck:
- Check daemon health.
- Check group state (
active/idle/paused/stopped). - Check actor runtime status.
- Check message obligations (reply-required/attention ack).
- Check automation and delivery throttling.
Useful commands:
bash
cccc daemon status
cccc actor list
cccc inbox --actor-id <actor_id>
cccc tail -n 100 -f4) Fast Recovery Playbook
Actor-level recovery (preferred)
bash
cccc actor restart <actor_id>Use this before group-level restart.
Group-level recovery
bash
cccc group stop
cccc group startDaemon-level recovery (last resort)
bash
cccc daemon stop
cccc daemon start5) Secure Remote Access
Required baseline:
- Set
CCCC_WEB_TOKEN. - Use Cloudflare Access or Tailscale for network boundary.
Do not:
- Expose Web UI directly without an access gateway.
- Store secrets in repo files.
6) Upgrade Playbook (RC-safe)
Before upgrade
- Stop active high-risk sessions.
- Backup
CCCC_HOME. - Record current version and smoke state.
Upgrade
bash
python -m pip install -U cccc-pairAfter upgrade
bash
cccc doctor
cccc daemon status
cccc mcpRun a small end-to-end smoke:
- create/attach group
- add/start actor
- send/reply
- verify ledger and inbox behavior
7) Backup and Restore
Backup (minimal)
Backup CCCC_HOME:
- registry
- daemon logs (optional)
- all groups (
group.yaml, ledger, state)
Restore
- Stop daemon.
- Restore
CCCC_HOMEdirectory. - Start daemon and verify with
cccc doctor.
8) Operational Guardrails
- Keep one source of truth: decisions should be in CCCC messages.
- Use
reply_requiredfor critical asks. - Prefer explicit recipients over broad broadcast when scope is narrow.
- Keep automation focused on objective reminders, not chat noise.
9) Escalation Checklist
If an issue repeats:
- Collect evidence:
- group id
- actor id
- event ids
- recent
cccc tail -n 100
- Capture reproducible sequence.
- Classify severity (
P0/P1/P2). - Register fix or risk in release findings.