Theory — the thresholds that justify a graph
The signal that JSON has hit its ceiling
JSON-first is right until it isn't. Graduate from a JSON ontology to a graph-backed one when at least two of these hold:
- Multi-hop questions are now the product. 'Show every open claim whose repair workshop also appears in a fraud ring connected to this policyholder' — customer → policy → claim → workshop → fraud pattern. JSON can store those links; it can't traverse them.
- Cross-system entity resolution is costing real hours. The same policyholder is
cust_4821in claims,P-99213in billing, and a name string in the fraud tool. Reconciling them by hand, repeatedly, is the tax a shared identity (IRIs) removes. - Regulators want relationship evidence, not flat logs. 'Prove this claim was never linked to a flagged workshop' is a path query, awkward over event logs.
- The rule set outgrows maintainable JSON branching and needs semantic reuse / inference (e.g. automatic
hagelschaden ⊑ elementarschadenrollups everywhere).
If none of these bite, stay JSON-first — a graph you don't query multi-hop is pure operational overhead.