Overview
Data Quality (KA 11) — The Six Dimensions
Accuracy, completeness, consistency, timeliness, uniqueness, validity — measure, target, monitor, escalate.
Why it matters
DQ is the KA most often equated with ‘a dashboard’ and most rarely run as a programme. DMBOK organises it around measure → target → monitor → escalate, the same loop SRE uses for service reliability.
Going deeper
Six DQ dimensions, each with a typical SQL-able rule:
| Dimension | Definition | Sample rule |
|---|---|---|
| Accuracy | Does the value reflect reality? | Cross-check against system of record |
| Completeness | Are required fields present? | COUNT(*) WHERE col IS NULL = 0 |
| Consistency | Same fact agrees across systems? | SUM(a.value) = SUM(b.value) |
| Timeliness | Fresh enough for use? | max(updated_at) > now() - interval '1 hour' |
| Uniqueness | One row per real-world entity? | COUNT(*) = COUNT(DISTINCT key) |
| Validity | Does it satisfy declared format/range? | regex / CHECK constraints |
Pick a target per dimension per dataset (perfection is uneconomic), instrument, and route the alerts to the right human. Without that last step the dashboard is decorative.