← AI tech topics

Robustness & input perturbations

A robust system gives the same answer to the same question asked slightly differently. LLMs often don't: typos, reordered options, added whitespace or an irrelevant sentence can change the output — which means a single passing test proves little. Robustness testing makes the perturbation systematic (metamorphic relations, paraphrase sets, character-level noise) and measures the flip rate. The linked methods and findings show how small the perturbation can be and still matter.

Findings (4)

Methods

Cite this

Qlarify Labs. (2026). Robustness & input perturbations. Retrieved from https://labs.qlarify.fi/topics/robustness-and-perturbations