Why the false-positive problem keeps returning
If a detector judges only the final text, it has to look for patterns that are correlated with machine generation. But many of those patterns also appear in careful human prose, especially in formal, academic, or non-native English writing.
That means the tool is not just checking whether the text is synthetic. It is also checking whether the text resembles what the model expects from synthetic output, which is a much weaker claim.