4 · Deployment — Ontop service, audit, and governance

How an OBDA platform ships under regulator scrutiny. Stable mappings, signed releases, query-explain by default.

0/2 done

Theory — deployment & governance

What ships

  1. Ontop as a service (ontop endpoint --ontology=… --mapping=… --properties=…). HTTP SPARQL endpoint, JDBC connection pool to the warehouse, auth in front via the bank's IAM.
  2. The ontology at a stable PURL (https://atlas.example.com/onto/reporting/1.0.0), signed, mirrored to an internal artifact store.
  3. The R2RML mapping versioned alongside.
  4. A regression query suite — every regulator question answered to date becomes a CI assertion that re-runs on every release.

Three governance habits

G1 — Explain by default. Every regulator query stores the rewritten SQL alongside the result. An auditor can always retrace: SPARQL → SQL → row → DWH table → source system. The chain of evidence is the product.

G2 — Mapping reviews are mandatory. Ontology PRs can be reviewed by an ontologist alone. Mapping PRs require both the ontologist AND the data engineer who owns the underlying table. A wrong mapping silently reclassifies real customers.

G3 — Profile gate in CI. ontop validate --profile QL runs on every PR. The day someone adds an equivalentClass axiom that pushes the ontology into DL, the build fails — before a regulator submits a query that explodes into exponential rewriting.

Failure modes worth pre-mortemming

  • Mapping drift — a DBA renames a column, R2RML breaks silently at next query. Mitigation: schema-check step in CI that diffs DWH metadata against the mapping.
  • Over-broad templatestemplate "…/party/{id}" where id is not unique across tables ⇒ IRI collisions. Mitigation: include the table in the template (/party/{table_name}/{id}).
  • Cost explosions — a regulator submits an unconstrained ?s ?p ?o query. Mitigation: SPARQL complexity limits at the gateway.

Reflect

OBDA is the most under-deployed semantic-web pattern in enterprise IT. Most teams haven't realised that 'we have a warehouse' is not a reason to avoid RDF — it's the exact precondition for the simplest possible RDF deployment.

  • Where in your org would a 30-line R2RML expose a virtual semantic API tomorrow?
  • What's your migration path if Ontop one day adds OWL 2 RL support and a use case asks for it?

Reading in progress · 0 of 2 activities done