← Reference library
PaperHigh credibilityarXiv · Lin et al. · September 1, 2021
TruthfulQA: Measuring How Models Mimic Human Falsehoods
Our summary
A benchmark of questions where humans commonly hold misconceptions; models often answer with the same imitative falsehoods learned from training text, and — strikingly — larger models can be less truthful. Separates being informative from being truthful.
Why it matters
Shows fabrication isn't random noise: models reproduce popular human falsehoods, which is exactly when a confident wrong answer is most persuasive.
Cited by these methods
Related findings (2)
Published June 26, 2026
Cite this
Qlarify Labs. (2026). TruthfulQA: Measuring How Models Mimic Human Falsehoods. Retrieved from https://labs.qlarify.fi/references/truthfulqa-imitative-falsehoods-2021