Skip to Content
ANALYSISMEMBER

Why erasure requests don't require model retraining: Governance that is defensible

Deleting training data, not retraining models, is emerging as the practical answer to AI erasure requests.

Published
Subscribe to IAPP Newsletters

Contributors:

Divija Kota

CIPP/US, CIPM, FIP

Senior product manager

Microsoft

Once personal data enters a training pipeline, questions about whether the training was lawful matter. But the harder, more practical issue is what erasure looks like once the model exists. 

When someone submits an erasure request, privacy teams delete the source data everywhere it is stored and exclude the requester from future training runs, fine-tuning jobs and feedback loops. They do not retrain models that already learned from that data. 

That is not a dodge; it is the most honest answer based on how these systems work, what regulators have said so far and what good-faith compliance looks like in practice. 

The position has not been tested in enforcement, which is exactly why the scaffolding around it matters more than the position itself. Privacy, legal, machine learning engineering and product teams all have to agree, or it falls apart the first time someone pushes.

Why is retraining usually the wrong answer for an individual request? Once personal data enters a training pipeline, it shapes gradient updates across millions or billions of parameters. The model holds patterns across the corpus but can memorize specific examples, which is why evaluation matters. Two ideas get conflated. 

Machine unlearning is the research attempt to remove a specific record's influence from an already-trained model. Prospective exclusion is the operational practice of dropping that record from every future run. Exclusion can be guaranteed and audited today; unlearning is not yet reliable or verifiable enough to anchor compliance. 

So why not retrain from scratch? When training is lawful, the request comes from a single person and evaluation already shows the model is not memorizing or leaking data, retraining is unlikely to materially improve that individual's privacy. Their influence is already diluted across billions of parameters. If any condition fails, a planned refresh is back on the table.

Contributors:

Divija Kota

CIPP/US, CIPM, FIP

Senior product manager

Microsoft

MEMBER

Unlock this exclusive content and more

Join the IAPPAlready a member? Sign in

Membership opens up a world of resources

In-depth knowledge

From original research reports and daily news coverage to legislative trackers and infographics, we have the information you need to stay ahead of change.

A global network

Make valuable professional connections through more than 160 local IAPP KnowledgeNet chapters in 70 countries.

Access to the experts

Connect with top thinkers in privacy, AI governance and cybersecurity for fresh ideas and insights.

Learn what you get from membership