← Reference library
PaperHigh credibilityTACL 2021 (arXiv:2102.01017) · Yanai Elazar, Nora Kassner, Shauli Ravfogel, et al. · February 1, 2021
Measuring and Improving Consistency in Pretrained Language Models
Our summary
Using paraphrased cloze queries (ParaRel), shows that pretrained models give inconsistent answers to logically equivalent questions — a poorly structured knowledge representation rather than a stable one.
Why it matters
Empirical grounding for logic and consistency testing — asserting the invariants a sound reasoner must satisfy and watching them break across related questions.
Cited by these methods
Published June 26, 2026
Cite this
Qlarify Labs. (2026). Measuring and Improving Consistency in Pretrained Language Models. Retrieved from https://labs.qlarify.fi/references/measuring-consistency-plms-2021