Overview
Documents & Content (KA 7)
Unstructured content is a first-class asset — capture, classify, retain, search.
Why it matters
DMBOK's most-ignored KA. Email, PDFs, contracts, support tickets — the unstructured 80% of enterprise data — usually has no owner, no classification, no retention. Then a discovery request lands.
Going deeper
The minimum-viable content programme:
- Capture — a defined system of record per content class (contracts → CLM, tickets → ITSM, customer docs → CCM).
- Classify — sensitivity tag at upload (auto-classify where possible).
- Retain — retention rules driven by class, enforced by the platform.
- Search — full-text + metadata, with access scoped by classification.
- Dispose — active deletion when retention expires (not just ‘aged out’).
The unstructured-data layer is the one where the gap between policy and enforcement is widest — and where regulators have started focusing.