Designed reversibility
Three layers of off-switch
- Feature flag — disable the model code path; fall back to rules / baseline. Fastest, smallest blast radius.
- Registry rollback — point the alias at the previous version. Seconds.
- Hard cut-off — stop serving entirely; downstream uses a safe default. Last resort.
Runbook minimum
- Who pages whom.
- How to revert the alias / flip the flag (exact commands).
- The safe baseline and its behaviour.
- The threshold that automatically triggers a freeze on promotions (error budget exhausted).
Test the kill-switch quarterly. An untested switch is worse than no switch — it gives false confidence.