Models miscount letters within a word (e.g. how many 'r's are in a given word) because they reason over tokens, not characters.
Published June 26, 2026
Reproducibility
Often
Severity
Low
Confidence
Reviewer-confirmed
Details
Because text is processed as sub-word tokens, models lack reliable access to individual characters. Asking for letter counts, reversing strings, or character-level edits produces confident but wrong answers. The error is structural to tokenization, not a knowledge gap.