What's the single most-skipped step in enterprise content programmes?

Active disposal when retention expires — content piles up, multiplying discovery cost and regulator exposure

Documents & Content (KA 7) — Semantic Web Academy

Overview

Documents & Content (KA 7)

Unstructured content is a first-class asset — capture, classify, retain, search.

Why it matters

DMBOK's most-ignored KA. Email, PDFs, contracts, support tickets — the unstructured 80% of enterprise data — usually has no owner, no classification, no retention. Then a discovery request lands.

Going deeper

The minimum-viable content programme:

Capture — a defined system of record per content class (contracts → CLM, tickets → ITSM, customer docs → CCM).
Classify — sensitivity tag at upload (auto-classify where possible).
Retain — retention rules driven by class, enforced by the platform.
Search — full-text + metadata, with access scoped by classification.
Dispose — active deletion when retention expires (not just ‘aged out’).

The unstructured-data layer is the one where the gap between policy and enforcement is widest — and where regulators have started focusing.

Analogy

Document & content management is the difference between a library and a filing cabinet in someone's attic.

Both contain documents. Only one is findable, classified, retained on schedule, and accessible to the right reader. An attic is the default state of enterprise unstructured data — until a court order asks for ‘every email about Project X between dates Y and Z’, and the answer is ‘we have a SharePoint somewhere’.

Make it stick

Anchor documents & content (ka 7) to something you actually own.

›Where in your platform does *documents & content (ka 7)* live today — and who owns it?
›What is the smallest version of *documents & content (ka 7)* you could ship next sprint?
›What's the most likely misuse of *documents & content (ka 7)*, and how would you spot it in a design review?

Reading in progress · 0 of 1 activity done