← AI tech topics
AI agents & tool use
Agents let a model plan and call tools to act in the world. The autonomy that makes them useful also makes them dangerous: untrusted input can hijack instructions (prompt injection), tool arguments can be hallucinated, and errors compound across steps. The linked findings show concrete failure modes to test for before deploying an agent on untrusted data.
Findings (5)
- Data exfiltration through prompt injection in agentsSafetyCritical
- Format-constraint violations under strict schemasTool useMedium
- Hallucinated tool/function argumentsTool useHigh
- Indirect prompt injection via retrieved contentPrompt injectionCritical
- Reasoning model regresses on tool use versus its base modelTool useMedium
Methods
References
Cite this
Qlarify Labs. (2026). AI agents & tool use. Retrieved from https://labs.qlarify.fi/topics/ai-agents-and-tool-use