1. Agent reliability & AgentOps hardening
The flagship. Reliability designed into the harness, not patched.
Most agent stacks are built prompt-first and break the first time the real world pushes back. We treat reliability as architecture. The tactics below are the proof.
Idempotency tokens
a retry that lands twice counts once
State checkpoints before any stateful operation
Connector layer
circuit breakers, retry budgets, read-vs-write permissions
Cost guards
daily budget, soft alert 70%, hard kill 95%
Reliability SLIs
success rate, retry rate, p95 latency, cost-per-output
Progressive tool access
read-only first, write earned