Overview
Data Integration & Interoperability (KA 6)
Batch, micro-batch, streaming, change-data-capture, API — and the contracts between them.
Why it matters
Most enterprise pain lives in integration: the brittle nightly jobs, the ‘mystery 9 a.m. break’, the silent CDC slot. DMBOK organises this KA around patterns + contracts rather than tools.
Going deeper
Integration patterns and the latency / coupling trade:
| Pattern | Latency | Coupling | Failure mode |
|---|---|---|---|
| Nightly batch ETL/ELT | Hours | Loose | Stale data; cron misses; backfill pain |
| Micro-batch (5–15 min) | Minutes | Loose | Hidden lag amplification |
| CDC (log-based) | Seconds | Tight to source | Slot lag; replication storms |
| Event streaming (Kafka) | Sub-second | Loose | Schema-evolution discipline required |
| Synchronous API call | Real-time | Very tight | Cascading failure if upstream blips |
The discipline DMBOK adds: every integration is governed by a contract (schema + SLA + ownership). Without contracts, the pattern is irrelevant — it'll break.