Why is text-to-metric over a semantic layer dramatically more reliable than text-to-SQL on raw tables?

The agent is constrained to a small typed catalogue (metrics, dimensions, grains) — it cannot hallucinate columns or joins because only the catalogued objects are valid outputs

Because LLMs are faster at JSON than SQL

The agent is constrained to a small typed catalogue (metrics, dimensions, grains) — it cannot hallucinate columns or joins because only the catalogued objects are valid outputs

Because the semantic layer pre-runs every possible query

It isn't — they're equivalent

Reverse ETL & Text-to-Metric (LLM-Friendly Semantic Layers) — Semantic Web Academy

The semantic layer's two newest customers

BI dashboards used to be the only consumer that mattered. Today there are two more, both of which only work if the semantic layer is a real API:

1. Reverse ETL — operational activation

Tools like Hightouch, Census and Polytomic read from the warehouse / semantic layer and push the rows into operational SaaS tools — Salesforce, HubSpot, Braze, Zendesk, Intercom. The promise: the same customer_health_score that drives the executive dashboard appears as a custom field on every salesperson's Salesforce account, and as a Braze audience for the lifecycle-marketing email.

Without a semantic layer, reverse ETL produces a fork: the warehouse number and the Salesforce number diverge within a month. With a semantic layer, the same definition drives both — and changes propagate uniformly.

2. Text-to-Metric — LLM agents that don't hallucinate

Vanilla 'text-to-SQL' is a hallucination machine: the model confidently invents column names. Text-to-metric constrains the LLM to a small typed catalogue (metrics, dimensions, time_grains, filters) and emits requests against the semantic layer instead of raw SQL.

// Agent input: 'How did revenue per region trend last quarter?'
// Agent output (validated against the catalogue):
{
  "metrics":   ["revenue"],
  "groupBy":   ["region", "week"],
  "timeGrain": "week",
  "filter":    "last_quarter"
}

Hallucination becomes structurally impossible: the agent can only ask for metrics that exist, on dimensions that exist, at grains that exist. The semantic layer is the LLM's schema.

Reverse ETL & Text-to-Metric (LLM-Friendly Semantic Layers)

Overview

The semantic layer's two newest customers

1. Reverse ETL — operational activation

2. Text-to-Metric — LLM agents that don't hallucinate

Map vs improvised directions

Reflect