Overview
Subject Access & Erasure Pipelines
SAR / DSAR / 'right to be forgotten' — these are an engineering pipeline, not a legal process.
Why it matters
When the request arrives, you need to find every dataset that contains the subject, extract or delete in N days, prove it, and not break statistical aggregates downstream.
Going deeper
Building a scalable erasure (Right to Be Forgotten) pipeline:
- Identity Resolution: Resolving the requestor's email to their canonical identity (e.g.
user_123). - Orchestration: Firing a webhook to every internal system (CRM, Warehouse, Microservices).
- Nullification vs Deletion: In a warehouse, deleting rows can break historical aggregate reporting (e.g. Total Revenue dips). Instead of a SQL
DELETE, best practice is toUPDATE table SET name=NULL, email=NULL WHERE user_id=123. The financial aggregate remains, but the PII is irretrievably erased. - Receipt Generation: Emitting an audit log (stored as long as the law requires) proving the deletion executed.