Research Note

Red teaming RAG systems for poisoned context and over-disclosure

Testing approaches for retrieval abuse, source trust, sensitive context leakage, and model response drift.

Research Note10 min readSecurity Engineering, AI Product Teams

RAG Expands the Attack Surface

Retrieval-augmented generation connects models to enterprise knowledge, but retrieved content can also carry malicious instructions, outdated policy, or over-permissive context. Red teaming must test the retrieval path as carefully as the model response.

Prompt Attacks in Documents

Indirect prompt injection often hides inside tickets, webpages, PDFs, emails, and knowledge articles. A strong test program plants adversarial instructions in realistic sources and verifies whether the application follows retrieval content over system policy.

Over-Disclosure Tests

Teams should test whether the assistant reveals restricted source text, cross-tenant content, secrets, or confidential summaries. Access control drift and weak source filtering can turn a helpful assistant into a data exposure channel.

Remediation Signals

Useful findings identify the source, retrieval path, user role, model response, exploitability, and business impact. Retesting should confirm that source filtering, context inspection, and response controls actually reduced risk.

More Resources

Continue exploring AI security thinking.

Guide

9 min read

The enterprise guide to AI agent runtime security

A practical security model for agents that retrieve context, reason over data, call tools, and act across business systems.

For CISO, Head of AI, Security Engineering

Executive Brief

8 min read

How CISOs can govern workforce AI without slowing adoption

Policy patterns for shadow AI discovery, prompt DLP, app governance, and employee enablement.

For CISO, CIO, Risk and Compliance

Technical Guide

9 min read

Prompt injection defense for production GenAI applications

A layered approach to defending copilots, chatbots, RAG apps, and agents from direct and indirect prompt attacks.

For AI Product Teams, Security Engineering

Request a Demo

Secure the AI your enterprise runs on.

See how Kavalan helps security and AI teams govern workforce AI, protect agentic systems, and continuously validate GenAI risk.

Request a Demo Explore Platform