OtherEstablished

Unit testing the deterministic scaffold

Test the deterministic code around the model — prompt builders, output parsers, schema validators, tool wrappers — in isolation, with exact assertions, the way you'd test any software.

Published June 26, 2026

Tool use Reliability

How it works

An LLM feature is mostly ordinary software. The prompt-assembly, the JSON validators, the retry and back-off logic, the function that parses the model's reply — none of that is probabilistic, so it deserves ordinary unit tests with exact, deterministic assertions. Unit-testing the scaffold catches the broken APIs, schema mismatches, and off-by-one prompt-assembly bugs that otherwise get misattributed to 'the model being flaky'. It is the cheapest, most reliable layer of an AI test harness precisely because it removes the model from the equation.

When to use it

Around every non-model component: prompt templating, output parsing and validation, tool/function wrappers, token and cost accounting, access-control checks.

Limitations

Says nothing about the model's behaviour — a fully green unit suite can still ship a system that hallucinates. It guards the plumbing, not the judgement; pair it with the probabilistic and robustness methods.

Method yield

Findings: 1
Versions spanned: 4
Yield score: 3

1 Medium

Severity-weighted across the published findings below. Why we measure this →

Findings it surfaces (1)

Documented failures this method catches — the evidence it works.

Format-constraint violations under strict schemasMedium
Asked for strictly-formatted output (e.g. JSON to a schema), models emit invalid or extra content.
How it found it: A schema-validation unit test rejects the malformed output deterministically, before it ever reaches a parser.
Tool use

References & further reading

The ML Test Score: A Rubric for ML Production Readiness and Technical Debt Reduction
Eric Breck, Shanqing Cai, Eric Nielsen, Michael Salib, D. Sculley · IEEE Big Data 2017 · December 11, 2017

Cite this

Qlarify Labs. (2026). Unit testing the deterministic scaffold. Retrieved from https://labs.qlarify.fi/methods/unit-testing