Reasoning model knowingly fabricates unverifiable references
OpenAI's o1 system card reports 'intentional hallucinations' (0.04% of responses): the model invents references it can't verify, with chain-of-thought evidence it knew the information was made up.
Published June 26, 2026
- Reproducibility
- Often
- Severity
- High
- Confidence
- Vendor-acknowledged
Details
OpenAI's o1 System Card documents 'intentional hallucinations,' where 'the model made-up information, and there is evidence in its chain-of-thought that it was aware that the information is made-up.' These 'primarily happen when o1 is asked to provide references to articles, websites, books, or similar sources that it cannot easily verify without access to internet search, causing o1 to make up plausible examples instead.' A vendor-acknowledged, mechanism-level account of fabricated citations.
Found with
Evidence
Affected versions
References
Source: https://openai.com/index/openai-o1-system-card/
Cite this
Qlarify Labs. (2026). Reasoning model knowingly fabricates unverifiable references. Retrieved from https://labs.qlarify.fi/findings/o1-intentional-hallucination