PaperHigh credibilityarXiv · Greshake et al. · February 1, 2023

Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection

Our summary

Demonstrates indirect prompt injection against real LLM-integrated applications: adversarial instructions hidden in web pages, emails, or other retrieved content hijack the model when it later processes them — no access to the prompt required. Catalogs concrete attacks (data theft, manipulation) on tool- and retrieval-connected systems.

Why it matters

The academic foundation for the highest-severity agent risk — once a model ingests untrusted content, that content can issue commands. Complements Willison's practitioner framing.

Cited by these methods

🔬 Prompt-injection & jailbreak testing

Related findings (2)

Prompt injection Safety Agents

Published June 26, 2026

Cite this

Qlarify Labs. (2026). Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection. Retrieved from https://labs.qlarify.fi/references/indirect-prompt-injection-greshake-2023