← Reference library
PaperHigh credibilityarXiv · Lin et al. · September 1, 2021

TruthfulQA: Measuring How Models Mimic Human Falsehoods

Our summary

A benchmark of questions where humans commonly hold misconceptions; models often answer with the same imitative falsehoods learned from training text, and — strikingly — larger models can be less truthful. Separates being informative from being truthful.

Why it matters

Shows fabrication isn't random noise: models reproduce popular human falsehoods, which is exactly when a confident wrong answer is most persuasive.

Cited by these methods

Related findings (2)

Published June 26, 2026

Cite this

Qlarify Labs. (2026). TruthfulQA: Measuring How Models Mimic Human Falsehoods. Retrieved from https://labs.qlarify.fi/references/truthfulqa-imitative-falsehoods-2021