Overview
Data Catalogues
Searchable inventory of datasets with owners, schemas, freshness, trust signals.
Why it matters
A catalog is the library card system for data — without it, every new analyst recreates the same dataset because they can't find the existing one.
Going deeper
A catalog earns trust through four layers, in this order:
- Inventory — every dataset is registered (auto-harvested, not manual).
- Ownership — every dataset has a human or team name attached.
- Operational signals — freshness, schema, row counts, last run state.
- Trust signals — quality scores, certified-by-domain badge, DQ contract link.
Skip layer 1 and the catalog is incomplete. Skip layer 2 and it's a phonebook with no phone numbers. Skip layer 3 and consumers can't tell stale from fresh. Skip layer 4 and every dataset still feels like a coin-flip.