PaperHigh credibilityTACL 2021 (arXiv:2102.01017) · Yanai Elazar, Nora Kassner, Shauli Ravfogel, et al. · February 1, 2021

Measuring and Improving Consistency in Pretrained Language Models

Read the original ↗Archived copy

Our summary

Using paraphrased cloze queries (ParaRel), shows that pretrained models give inconsistent answers to logically equivalent questions — a poorly structured knowledge representation rather than a stable one.

Why it matters

Empirical grounding for logic and consistency testing — asserting the invariants a sound reasoner must satisfy and watching them break across related questions.

Cited by these methods

🔬 Logic & consistency testing

Published June 26, 2026

Cite this

Qlarify Labs. (2026). Measuring and Improving Consistency in Pretrained Language Models. Retrieved from https://labs.qlarify.fi/references/measuring-consistency-plms-2021