Band Thresholds
| Band | Retrieval Rate | Severity | Meaning |
|---|---|---|---|
| Critical | ≥ 75% | CRITICAL | Poisoned document reliably retrieved across most queries. Very high likelihood of reaching the LLM context window. |
| High | ≥ 50% | HIGH | Poisoned document retrieved for a majority of queries. Likely to influence LLM behavior in many interactions. |
| Medium | ≥ 25% | MEDIUM | Poisoned document retrieved for some queries. Impact depends on which queries are affected. |
| Low | < 25% | LOW | Poisoned document rarely retrieved. Poisoning is ineffective for this corpus/query combination. |
How Retrieval Rate Is Calculated
For each query in the test set, RXP checks whether the poisoned document appears in the top-k retrieved results. The retrieval rate is:--top-k on the CLI or the web UI form).
Example
A validation run with 5 queries and top-k of 5:Factors Affecting Band Placement
Corpus size — Larger knowledge bases increase competition. A poisoned document that ranks in top-5 with 10 corpus documents may fall to rank 50 with 500 documents, reducing the retrieval rate. Embedding model — Different models produce different similarity scores. Test across models with--model all to see if poisoning effectiveness varies.
Query specificity — Broad queries (“tell me about company policies”) retrieve more diverse documents. Specific queries (“what is the vacation policy?”) may rank a topically-matched poison document higher.
Poison document quality — Documents that are semantically closer to the target queries rank higher. A poison document about “vacation policy guidelines” will outrank one about “general HR information” for vacation-related queries.
Using Bands for Decision-Making
Critical/High — The poisoned document reliably reaches the LLM. If this were a real attack, the injected instructions would likely execute. Review document upload controls, implement content validation, and consider retrieval-time filtering. Medium — Inconsistent retrieval. The attack works for some queries but not others. Lower priority but worth monitoring, especially if the affected queries are high-value. Low — Poisoning is ineffective with this configuration. Standard RAG retrieval adequately dilutes the poisoned document. Acceptable risk in most scenarios.Comparing Across Models
Run validation with--model all to compare bands across every registered embedding model: