← AI tech topics

Reasoning & chain-of-thought

Chain-of-thought and dedicated reasoning models trade tokens for accuracy: the model writes out intermediate steps before answering. The gains are real but so are the new failure modes — the stated reasoning is not always the real reasoning (unfaithful chains), performance collapses on problems slightly outside the training distribution, and small irrelevant changes to a question can flip the answer. The linked findings are the largest cluster in the catalog: reasoning is where confident output and actual capability diverge most visibly.

Findings (15)

Methods

References

Cite this

Qlarify Labs. (2026). Reasoning & chain-of-thought. Retrieved from https://labs.qlarify.fi/topics/reasoning-and-chain-of-thought