Hermes Agent + CaMeL

A publishable Hermes fork with CaMeL trust boundaries integrated into the runtime.

This fork is designed for operators who want the Hermes agent loop to distinguish between:

trusted control: system prompt, approved skills, real user turns
untrusted data: tool outputs, retrieved web content, browser content, files, session recall, MCP data

Sensitive tools are authorized against a trusted operator plan instead of instructions embedded in untrusted content.

Research Provenance

This fork is inspired by Google Research's CaMeL paper and reference repository:

This repository does not aim to reproduce the Google research stack exactly, and it does not present itself as a benchmark-equivalent implementation of that repo.

This repository was implemented directly within Hermes and does not vendor Google source code unless explicitly noted in future changes.

Instead, it adapts the core boundary-setting ideas from the paper to Hermes' existing runtime model:

Google's repo is a research artifact aimed at reproducing the paper's results and evaluation setup
this fork is a Hermes-native runtime integration aimed at hardening a production-style agent loop
Google's work includes its own pipeline, interpreter, evaluation assumptions, and benchmark framing
this fork focuses on Hermes-specific trust separation, tool gating, provenance handling, and operator-intent enforcement

In other words, the design is research-inspired, but the implementation and problem framing are specific to Hermes.

What This Fork Changes

This fork adds a runtime security layer centered on agent/camel_guard.py and the Hermes tool loop.

Main additions:

trusted operator plan extraction from real user turns
provenance-aware wrapping for untrusted tool outputs
per-turn security envelope injected into the effective system context
capability gating for sensitive tools
gating for automatic memory flush and synthetic continuation turns
stripping of internal CaMeL metadata before provider API calls

Sensitive capabilities gated by this integration include:

terminal and command execution
file mutation
persistent memory writes
external messaging
scheduled actions
skill mutation
delegation and subagents
browser interaction and selected external side effects

Read-only actions such as send_message(action="list") and cronjob(action="list") remain allowed.

Threat Model

This fork is built to reduce indirect prompt injection risk in the normal Hermes workflow.

The target attack pattern is:

Hermes retrieves untrusted content from the web, a browser session, a file, session recall, or an MCP server.
That content contains hidden or explicit instructions such as "ignore previous instructions", "send a message", or "run this terminal command".
The model attempts to treat that content as control rather than evidence.
Hermes blocks the side effect unless the trusted operator plan explicitly authorizes that capability.

Architecture

1. Trusted operator plan

Hermes derives trusted control from real user turns only. Synthetic system-control turns do not pollute the trusted plan.

2. Untrusted data channel

Tool outputs are treated as untrusted data by default and wrapped with provenance metadata before they re-enter model context.

3. Security envelope

Every turn includes a compact CaMeL security envelope describing the trusted goal, authorized capabilities, and current untrusted source inventory.

4. Capability gating

Side-effecting tools are checked against the trusted operator plan before execution.

5. Provider hygiene

Internal CaMeL metadata is removed before messages are sent to the configured model provider.

Validation

Hermes runtime compatibility

Validated against the Hermes runtime suite:

pytest -q tests/agent/test_camel_guard.py tests/test_run_agent.py

Result:

205 passed

Paper-aligned indirect injection benchmark

A Hermes-specific micro-benchmark aligned to the CaMeL paper/repo important_instructions attack shape was also run.

Observed outcomes:

indirect terminal exfiltration attempt: blocked
indirect external messaging attempt: blocked
indirect persistent-memory write attempt: blocked
indirect browser side-effect attempt: blocked
explicitly authorized terminal use: allowed
safe read-only list action: allowed

Detailed notes:

docs/camel-benchmark.md

Install

Fresh install from this fork

curl -fsSL https://raw.githubusercontent.com/nativ3ai/hermes-agent-camel/main/scripts/install.sh | bash

Then reload your shell and start Hermes:

source ~/.zshrc
hermes

Existing upstream Hermes checkout

Use the camelup installer repo to apply this fork or switch an existing checkout to the CaMeL build.

Runtime Modes

This fork now supports two explicit runtime behaviors from the same checkout:

guarded runtime: CaMeL trust boundaries enforced
legacy runtime: CaMeL disabled for compatibility or comparison

Examples:

hermes --camel-guard on
hermes --camel-guard off
hermes chat --camel-guard monitor -q "Summarize the report"

Mode behavior:

on or enforce: full CaMeL enforcement
monitor: record and surface trust-boundary decisions without enforcing blocks
off or legacy: disable CaMeL and run the legacy runtime behavior

This keeps one codebase and one install path while making it easy to compare guarded and legacy behavior side by side.

Developer setup

git clone https://github.com/nativ3ai/hermes-agent-camel.git
cd hermes-agent-camel
git submodule update --init mini-swe-agent
curl -LsSf https://astral.sh/uv/install.sh | sh
uv venv .venv --python 3.11
source .venv/bin/activate
uv pip install -e ".[all,dev]"
uv pip install -e "./mini-swe-agent"
pytest -q tests/agent/test_camel_guard.py tests/test_run_agent.py

Files Of Interest

agent/camel_guard.py
run_agent.py
hermes_cli/config.py
tests/agent/test_camel_guard.py
tests/test_run_agent.py
docs/camel-benchmark.md

Scope

This fork follows the trust-boundary principles described in the CaMeL paper, but applies them to Hermes' agent runtime rather than to the paper's original evaluation stack.

It is not presented as a full reproduction of the paper's AgentDojo benchmark matrix or as a claim of matching the paper's performance characteristics. The validation here is Hermes-specific and focused on runtime trust boundaries plus paper-aligned indirect injection scenarios.

Upstream Relation

This repository tracks Hermes Agent from Nous Research and carries the CaMeL integration as a focused runtime security extension.

Upstream Hermes: https://github.com/NousResearch/hermes-agent
CaMeL paper: https://arxiv.org/abs/2503.18813
CaMeL repo: https://github.com/google-research/camel-prompt-injection
Upstream PR: NousResearch/hermes-agent#1992
Third-party notices: THIRD_PARTY_NOTICES.md

For the original general-purpose Hermes README, see docs/upstream-readme.md.

Related Add-On

For payment flows that keep the same operator-intent boundary outside the model loop, see:

Hermes PayGuard: https://github.com/nativ3ai/hermes-payguard

Hermes PayGuard is a separate optional plugin, not a core runtime patch. It adds:

safe-by-design USDC transfer intents
Circle user-controlled and developer-controlled flows
Circle CCTP route quoting and attestation tracking
x402 paid fetch flows

That separation is intentional: the payment approval ledger and execution rails belong outside the core CaMeL runtime layer.

Name		Name	Last commit message	Last commit date
Latest commit History 2,305 Commits
.github		.github
.plans		.plans
acp_adapter		acp_adapter
acp_registry		acp_registry
agent		agent
assets		assets
benchmarks/camel_guard/fixtures		benchmarks/camel_guard/fixtures
cron		cron
datagen-config-examples		datagen-config-examples
docs		docs
environments		environments
gateway		gateway
hermes_cli		hermes_cli
honcho_integration		honcho_integration
landingpage		landingpage
mini-swe-agent @ 07aa6a7		mini-swe-agent @ 07aa6a7
optional-skills		optional-skills
scripts		scripts
skills		skills
tests		tests
tinker-atropos @ 65f084e		tinker-atropos @ 65f084e
tools		tools
website		website
.env.example		.env.example
.gitignore		.gitignore
.gitmodules		.gitmodules
AGENTS.md		AGENTS.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
RELEASE_v0.2.0.md		RELEASE_v0.2.0.md
RELEASE_v0.3.0.md		RELEASE_v0.3.0.md
THIRD_PARTY_NOTICES.md		THIRD_PARTY_NOTICES.md
batch_runner.py		batch_runner.py
cli-config.yaml.example		cli-config.yaml.example
cli.py		cli.py
hermes		hermes
hermes_constants.py		hermes_constants.py
hermes_state.py		hermes_state.py
hermes_time.py		hermes_time.py
mini_swe_runner.py		mini_swe_runner.py
minisweagent_path.py		minisweagent_path.py
model_tools.py		model_tools.py
package-lock.json		package-lock.json
package.json		package.json
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
rl_cli.py		rl_cli.py
run_agent.py		run_agent.py
setup-hermes.sh		setup-hermes.sh
toolset_distributions.py		toolset_distributions.py
toolsets.py		toolsets.py
trajectory_compressor.py		trajectory_compressor.py
utils.py		utils.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hermes Agent + CaMeL

Research Provenance

What This Fork Changes

Threat Model

Architecture

1. Trusted operator plan

2. Untrusted data channel

3. Security envelope

4. Capability gating

5. Provider hygiene

Validation

Hermes runtime compatibility

Paper-aligned indirect injection benchmark

Install

Fresh install from this fork

Existing upstream Hermes checkout

Runtime Modes

Developer setup

Files Of Interest

Scope

Upstream Relation

Related Add-On

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Hermes Agent + CaMeL

Research Provenance

What This Fork Changes

Threat Model

Architecture

1. Trusted operator plan

2. Untrusted data channel

3. Security envelope

4. Capability gating

5. Provider hygiene

Validation

Hermes runtime compatibility

Paper-aligned indirect injection benchmark

Install

Fresh install from this fork

Existing upstream Hermes checkout

Runtime Modes

Developer setup

Files Of Interest

Scope

Upstream Relation

Related Add-On

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages