Why is *increasing* a topic's partition count a breaking change for some consumers?

It changes the key→partition hash mapping, so per-key ordering is no longer guaranteed across the boundary

It resets all consumer offsets to zero

Capacity Planning & Partition Sizing — Semantic Web Academy

The numbers behind 'how many partitions?'

Partition count is the hardest-to-change decision in Kafka: you can add partitions, but doing so re-shuffles key→partition assignment, breaking per-key ordering for every consumer that depends on it. So size it deliberately.

Throughput floor. Measure (or estimate) per-partition throughput t for your workload (often 10–50 MB/s produce, less for consume with heavy processing). For a target T, you need at least partitions >= T / t. Then take the max of the produce-side and consume-side requirement — consumers can't parallelise beyond the partition count, so a slow handler often dominates.

Consumer parallelism ceiling. A consumer group can have at most one consumer per partition. If you might need 24 parallel consumers at peak, you need >= 24 partitions now. Plan for the 18-month peak, not today's.

Costs of over-partitioning. Each partition is open files + memory + a replication stream + longer leader-election and recovery times. Tens of thousands of partitions per broker is a real ceiling (much higher under KRaft than ZooKeeper, but not infinite). Rule of thumb: right-size, leave 2–3x headroom, and prefer a few well-sized topics over thousands of tiny ones.

Capacity Planning & Partition Sizing

Throughput, parallelism, and the cost of too many

The numbers behind 'how many partitions?'

A partition-sizing calculator (Python)

Reflect