Small-Rule Guardrails for Retrieval-Augmented Generation

Public artifact bundle for the preprint:

Small-Rule Guardrails for Retrieval-Augmented Generation: Prompt Injection and Vector Poisoning Checks Mukunda Rao Katta. ORCID: 0009-0007-6071-3896. License: CC BY 4.0.

A compact engineering note on small-rule guardrails sitting between retrieval and prompt construction in RAG pipelines, implemented as two zero-dependency npm packages.

Reference implementations

@mukundakatta/prompt-injection-shield — string-pattern checks for adversarial instructions in retrieved text
@mukundakatta/vector-poison-score — heuristic risk score for retrieved chunks (oversize, secret-exfil patterns, suspicious links)

Files in this repo

File	Purpose
`rag-guardrails-small-rule-preprint.pdf`	Submission-ready manuscript
`paper.md`	Source draft
`paper.bib`	Bibliography
`abstract.txt`	Upload-ready abstract
`keywords.txt`	Suggested keywords
`assets/workflow-figure.svg`	Workflow figure
`submission-metadata.json`	Structured submission metadata
`render_preprint_pdf.py`	Reproduces the PDF locally
`rag-guardrails-figshare-package.zip`	Bundle for Figshare/Zenodo deposit

Abstract

Retrieval-augmented generation systems often treat retrieved text as helpful evidence, but retrieved text can also contain adversarial instructions, suspicious link patterns, oversized chunks, or secret-exfiltration requests. This paper presents a small-rule guardrail approach implemented through two zero-dependency JavaScript packages: prompt-injection-shield and vector-poison-score. The method is deliberately lightweight. It scans retrieved documents and tool outputs before they are inserted into model context, reports explicit risk reasons, and supports filtering or line stripping as a simple containment step. The contribution is not a replacement for full security review or large-scale benchmark evaluation. Instead, it offers an inspectable baseline that developers can place between retrieval and prompt construction while building, testing, and auditing agentic RAG workflows.

Citation

Cite via the Zenodo or Figshare DOI:

Katta, M. R. (2026). Small-Rule Guardrails for Retrieval-Augmented Generation: Prompt Injection and Vector Poisoning Checks (Version v1). Zenodo. https://doi.org/10.5281/zenodo.20057632

Figshare mirror: https://doi.org/10.6084/m9.figshare.32193543

License

CC BY 4.0 for the manuscript and figures. MIT for the reference packages.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Small-Rule Guardrails for Retrieval-Augmented Generation

Reference implementations

Files in this repo

Abstract

Citation

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
README.md		README.md
abstract.txt		abstract.txt
keywords.txt		keywords.txt
paper.bib		paper.bib
paper.md		paper.md
rag-guardrails-figshare-package.zip		rag-guardrails-figshare-package.zip
rag-guardrails-small-rule-preprint.pdf		rag-guardrails-small-rule-preprint.pdf
render_preprint_pdf.py		render_preprint_pdf.py
submission-metadata.json		submission-metadata.json

Folders and files

Latest commit

History

Repository files navigation

Small-Rule Guardrails for Retrieval-Augmented Generation

Reference implementations

Files in this repo

Abstract

Citation

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages