← Reference library
PaperHigh credibilityTACL 2021 (arXiv:2102.01017) · Yanai Elazar, Nora Kassner, Shauli Ravfogel, et al. · February 1, 2021

Measuring and Improving Consistency in Pretrained Language Models

Our summary

Using paraphrased cloze queries (ParaRel), shows that pretrained models give inconsistent answers to logically equivalent questions — a poorly structured knowledge representation rather than a stable one.

Why it matters

Empirical grounding for logic and consistency testing — asserting the invariants a sound reasoner must satisfy and watching them break across related questions.

Cited by these methods

Published June 26, 2026

Cite this

Qlarify Labs. (2026). Measuring and Improving Consistency in Pretrained Language Models. Retrieved from https://labs.qlarify.fi/references/measuring-consistency-plms-2021