A model scores 0.95 AUC in training but 0.71 in production. The first hypothesis should be…

Label leakage / non point-in-time-correct feature joins

Wrong colour scheme on the dashboard

Point-in-Time Correctness — Semantic Web Academy

Leaking the future

Suppose you train a churn model with label_date = 2026-04-01 and a feature account_balance from a balance table. A naive SQL join takes the most recent balance — which might be after the label date. The model now 'sees the future' during training and looks artificially accurate. In production it has no future, so it underperforms.

Point-in-time joins restrict each feature lookup to values available at or before the label timestamp. Feature stores do this for you; if you roll your own, the logic is non-trivial — and a frequent source of subtle bugs.

Point-in-Time Correctness

`AS OF` joins

Leaking the future

Analogy