Batch vs Stream & Latency Budgets

What you choose to call 'real time' is a budget, not a feature.

0/1 done

Define 'real-time' as a number

Latency budgets, not buzzwords

'Real-time' is meaningless without a number. A streaming system is characterised by:

  • End-to-end latency — produce → consume committed (target: ms to seconds).
  • Processing semantics — at-most-once, at-least-once, exactly-once.
  • Backpressure handling — what happens when consumers slow down?
  • Replayability — can a new consumer replay 30 days of history without resharding?

CAP & PACELC remind us: under partition you choose Availability vs Consistency; even in steady state you trade Latency vs Consistency. Kafka leans CP (with acks=all) but lets you opt back into AP by tuning replication and acks.

Reflect

Translate one of your current SLAs into a streaming latency budget.

  • What is the *p99* tolerable produce-to-consume latency?
  • What happens to the business when you double that latency?
  • Which acks / min.insync.replicas setting matches that budget honestly?

Reading in progress · 0 of 1 activity done