← Reference library
PaperHigh credibilityarXiv · Greshake et al. · February 1, 2023

Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection

Our summary

Demonstrates indirect prompt injection against real LLM-integrated applications: adversarial instructions hidden in web pages, emails, or other retrieved content hijack the model when it later processes them — no access to the prompt required. Catalogs concrete attacks (data theft, manipulation) on tool- and retrieval-connected systems.

Why it matters

The academic foundation for the highest-severity agent risk — once a model ingests untrusted content, that content can issue commands. Complements Willison's practitioner framing.

Cited by these methods

Related findings (2)

Published June 26, 2026

Cite this

Qlarify Labs. (2026). Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection. Retrieved from https://labs.qlarify.fi/references/indirect-prompt-injection-greshake-2023