Theory — staged, reversible rollout
Roll out by earning trust, one mode at a time
Never go straight to auto-processing — you have no production accuracy evidence yet. Stage it so each phase is reversible:
- Phase 1 (weeks 1–3) · Shadow mode. Pipeline runs, writes nothing to the CRM. You compare its output against agents' manual claims to build the real accuracy baseline on live traffic.
- Phase 2 (weeks 4–7) · Assisted mode. The pipeline pre-fills fields; the agent confirms before submit. This is where the ontology proves it saves the 6–9 minutes, with a human still in the loop.
- Phase 3 (weeks 8–12) · Gated auto-process. Only the highest-confidence flows auto-create (
stt_confidence ≥ 0.88andentity_coverage ≥ 0.80and no forbidden-uncertainty terms). Everything else falls to assisted/review.
Assisted mode is also your permanent safe fallback: any incident degrades the system back to it rather than offline.