Disallowed content encoded with look-alike unicode or spacing can slip past safety filters.
Published June 26, 2026
Reproducibility
Sometimes
Severity
High
Confidence
Reviewer-confirmed
Details
Replacing characters with visually similar unicode (homoglyphs) or inserting zero-width characters can cause input filters and the model to mis-handle disallowed requests. Both a robustness and a safety-bypass surface.