← Reference library
PaperHigh credibilityarXiv · Xiong et al. · June 1, 2023

Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs

Our summary

An empirical evaluation of confidence elicitation across LLMs, finding that models verbalize high confidence even when wrong and are generally overconfident — plausibly imitating human patterns of asserting certainty.

Why it matters

Stated certainty is a poor signal of correctness, so any system that gates on a model's self-reported confidence inherits that miscalibration.

Related findings (1)

Published June 26, 2026

Cite this

Qlarify Labs. (2026). Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs. Retrieved from https://labs.qlarify.fi/references/llm-confidence-elicitation-2023