PaperHigh credibilityarXiv · Lin et al. · September 1, 2021

TruthfulQA: Measuring How Models Mimic Human Falsehoods

Our summary

A benchmark of questions where humans commonly hold misconceptions; models often answer with the same imitative falsehoods learned from training text, and — strikingly — larger models can be less truthful. Separates being informative from being truthful.

Why it matters

Shows fabrication isn't random noise: models reproduce popular human falsehoods, which is exactly when a confident wrong answer is most persuasive.

Cited by these methods

🔬 Factual oracle verification

Related findings (2)

Fabricated citations and referencesHigh
Models invent plausible-looking but non-existent papers, authors, DOIs and URLs.
Fabrication instead of admitting uncertaintyHigh
Asked about something unknown or non-existent, models invent an answer rather than saying 'I don't know'.

Hallucination Evals Benchmarks

Published June 26, 2026

Cite this

Qlarify Labs. (2026). TruthfulQA: Measuring How Models Mimic Human Falsehoods. Retrieved from https://labs.qlarify.fi/references/truthfulqa-imitative-falsehoods-2021