News & Library

Reference library

The best external writing on AI testing, limitations and quality — curated, summarized, and rated. We link out to the source; the value-add is our summary and the findings each piece connects to.

3 references

PaperHigh credibilityarXiv
Gorilla: Large Language Model Connected with Massive APIs
Connects an LLM to large API collections and documents the tendency to hallucinate API calls and arguments when prompted directly; retrieval-aware training reduces but does not eliminate the fabrication.
🐛 1 linked findingHallucinationTool useAgents
BlogHigh credibilitySimon Willison’s Weblog
Prompt injection: what’s the worst that can happen?
An accessible explanation of why prompt injection is hard to fix: once an LLM agent processes untrusted content, that content can hijack its instructions. Walks through concrete exfiltration and abuse scenarios for tool-using assistants.
🐛 2 linked findingsPrompt injectionSafetyAgents
PaperHigh credibilityarXiv
Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection
Demonstrates indirect prompt injection against real LLM-integrated applications: adversarial instructions hidden in web pages, emails, or other retrieved content hijack the model when it later processes them — no access to the prompt required. Catalogs concrete attacks (data theft, manipulation) on tool- and retrieval-connected systems.
🐛 2 linked findingsPrompt injectionSafetyAgents

Gorilla: Large Language Model Connected with Massive APIs

Prompt injection: what’s the worst that can happen?

Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection