Why AI Governance Needs Whistleblower Protections

Whistleblower protection tends to be treated as a narrow employment-law matter. In the AI context it is something larger: a partial substitute for the external verification that does not yet exist. Until inspectors can reliably audit what happens inside a frontier lab, the most reliable early warning of reckless development or an alarming capability is an insider willing to raise the alarm. Whether they can do so safely is therefore a governance question of the first order.

Why insiders matter so much for AI

Frontier AI has an unusual transparency problem. The most consequential decisions — how a model is trained, what safety testing it received, what worrying behaviours emerged, whether warnings were overruled — happen inside private organisations, invisible to outsiders. Regulators lack the access and often the expertise to see them independently. In this environment, the researchers and engineers with direct knowledge are the only people positioned to detect and report that something has gone wrong before it becomes public in the worst way.

This concern is not hypothetical. In 2024 a group of current and former employees of leading AI companies published an open letter, 'A Right to Warn about Advanced Artificial Intelligence', arguing that confidentiality agreements were blocking them from raising safety concerns and calling for protections to allow it. That such a letter was necessary is itself evidence of the problem: the people closest to the risk felt legally and professionally unable to talk about it.

What is blocking them

Restrictive agreements. Broad confidentiality and non-disparagement clauses — sometimes tied to vested equity worth millions — can make speaking out financially ruinous.
Legal exposure. Without a clear protected channel, an employee who discloses concerns risks being sued for breach of contract or trade-secret violation.
Gaps in existing law. Traditional whistleblower statutes protect reports of illegal activity, but much of what is dangerous in frontier AI is not yet illegal — leaving disclosures about lawful-but-reckless development unprotected.
Career risk. In a small field, the informal cost of being seen as disloyal can end a career even where formal retaliation is barred.

What effective protection would look like

Protected disclosure channels

Legally guaranteed routes for AI workers to report safety concerns to a regulator or oversight body, shielded from liability, even when the conduct disclosed is not itself illegal.

Limits on silencing contracts

Restrictions on the use of confidentiality and non-disparagement clauses to suppress good-faith safety concerns, so that reporting a danger cannot cost an employee their vested compensation.

Anti-retaliation guarantees

Strong, enforceable protection against dismissal, blacklisting, or other retaliation for protected disclosures, with real remedies.

A body to receive warnings

An oversight authority — a regulator or safety institute — with the mandate and expertise to act on disclosures rather than let them disappear.

Until we can inspect the labs from the outside, the people inside them are our smoke detectors. A law that lets a company silence its own researchers about a catastrophic risk is a law that disables the alarm.
Naoto Nakada, Founder · Nakada Foundation to Save Humanity

Tractable, and a bridge to more

Whistleblower protection has a rare quality in AI governance: it is achievable through ordinary domestic legislation, without a treaty, without international consensus, and without resolving the hard questions of thresholds and verification. Any jurisdiction that hosts frontier labs can enact it unilaterally, and doing so would immediately strengthen the world's ability to detect dangerous development. It is also a bridge to harder governance: the protected disclosures it enables generate exactly the kind of inside information that regulators and, eventually, an international verification regime would rely on. Building external verification will take years. Protecting the insiders who can warn us in the meantime is something legislatures can do now — and it is among the highest-leverage safety measures available before a treaty exists.

QUICK ANSWERS

Common questions.

Why are whistleblower protections important for AI safety?

Because outside verification of frontier AI is still primitive, the people best positioned to detect dangerous development first are the researchers inside the labs. Their most consequential decisions happen invisibly within private organisations. Until inspectors can independently audit those organisations, a protected insider willing to raise the alarm is often the only reliable early warning — making their ability to speak out safely a core governance question, not a narrow employment matter.

What stops AI workers from raising safety concerns?

Broad confidentiality and non-disparagement agreements — sometimes tied to vested equity worth millions — can make speaking out financially ruinous; without a protected channel, disclosure risks lawsuits for breach of contract or trade-secret violation; traditional whistleblower laws protect reports of illegal activity, but much dangerous frontier AI development is not yet illegal; and in a small field, the informal career cost of being seen as disloyal can be severe.

What is the 'Right to Warn' letter?

A 2024 open letter from current and former employees of leading AI companies, titled 'A Right to Warn about Advanced Artificial Intelligence', arguing that confidentiality agreements were preventing them from raising safety concerns and calling for protections that would let them do so. Its existence is evidence of the problem: the people closest to the risk felt legally and professionally unable to speak about it.

What would effective AI whistleblower protection require?

Legally guaranteed protected channels for reporting safety concerns to a regulator, shielded from liability even when the disclosed conduct is not itself illegal; limits on confidentiality and non-disparagement clauses used to silence good-faith concerns; strong anti-retaliation guarantees with real remedies; and an oversight body with the expertise and mandate to act on disclosures. Crucially, all of this can be enacted through ordinary domestic law, without a treaty.

The Case for AI
Whistleblower Protections.

Why insiders matter so much for AI

What is blocking them

What effective protection would look like

Protected disclosure channels

Limits on silencing contracts

Anti-retaliation guarantees

A body to receive warnings

Tractable, and a bridge to more

Common questions.

Go deeper.

The Case for AIWhistleblower Protections.

Why insiders matter so much for AI

What is blocking them

What effective protection would look like

Protected disclosure channels

Limits on silencing contracts

Anti-retaliation guarantees

A body to receive warnings

Tractable, and a bridge to more

Common questions.

Go deeper.

The treaty doesn't exist yet.That's the point.

The Case for AI
Whistleblower Protections.

The treaty doesn't exist yet.
That's the point.