Home/Glossary/AI detector false positive
Definition

What is AI detector false positive?

Definition

An AI detector false positive is human-written text wrongly flagged as machine-generated. Documented rates are highest for non-native English speakers — one Stanford study found detectors flagged 61% of TOEFL essays as AI-written.

False positives are not occasional glitches; they are structural. Detectors flag text that looks statistically predictable, and predictability correlates with things that have nothing to do with AI: writing in a second language, following genre conventions, or simply writing cleanly.

The consequences land asymmetrically. A student flagged by a detector faces an integrity process with no way to disprove the score; a freelancer loses a client. The accused party bears the burden of proving a negative — which finished text cannot do.

This asymmetry is the core argument for certification: evidence collected during writing means the writer never has to argue against a black box.