Pliny

Named after Pliny the Elder, who authored Naturalis Historia — the oldest surviving comprehensive encyclopedic work.

Pliny is an autonomous research agent that fans out queries across multiple AI agents in parallel, then synthesizes the results into a single report. It uses Claude Code and Codex as research backends, authenticating via your existing subscriptions — no API keys needed.

Quick Start

Prerequisites

Bun >= 1.1
An Anthropic subscription (for Claude Code)
An OpenAI subscription (for Codex)
claude and codex CLIs installed and authenticated

Install

git clone https://github.com/kevinmichaelchen/pliny.git
cd pliny
bun install

Configure

cp pliny.config.example.yaml pliny.config.yaml

Edit pliny.config.yaml to choose your models:

agents:
  claudeModel: claude-sonnet-4-5-20250929  # or claude-opus-4-6
  codexModel: gpt-5.2
  codexReasoningEffort: medium              # minimal | low | medium | high | xhigh

servers: {}  # MCP servers are optional — agents use their own tools

Run

# Basic research query
bun run src/cli.ts "What are the tradeoffs of Redis vs Valkey?"

# JSON output
bun run src/cli.ts --format json "History of ice cream"

# Custom config
bun run src/cli.ts --config ./my-config.yaml "your query"

How It Works

Pliny runs a 3-step pipeline:

Decompose — Claude breaks your topic into 3 focused subtopics
Fan-out — Claude researches all subtopics; Codex researches a speed-adjusted subset (see below)
Synthesize — Claude merges all findings into a single markdown report

Each agent uses whatever MCP tools it has configured in its own environment (~/.mcp.json, etc.) — Perplexity, Exa, web search, and so on.

Model Speed Awareness

Pliny knows that different models have different latency characteristics. GPT-5.2 via Codex is roughly 10x slower than Claude Sonnet for end-to-end research tasks. To keep total wall-clock time reasonable, Codex only researches a fraction of subtopics while Claude covers all of them:

Note

The speed ratios are rough anecdotal observations from end-to-end CLI usage, not formal benchmarks. Raw API latency is actually comparable (~600ms TTFT for GPT-5.2 vs ~800ms for Sonnet), but the Codex CLI adds significant overhead from sandbox startup, reasoning depth, and multi-turn tool loops. Your mileage will vary by query complexity and reasoning effort setting.

Config	Claude subtopics	Codex subtopics	Why
Sonnet + `medium`	3	1	Codex is ~10x slower
Sonnet + `low`	3	1	Codex is ~5x slower
Opus + `medium`	3	1	Opus is ~3x, Codex ~10x
Sonnet + `minimal`	3	1	Codex is ~3x slower
Opus + `xhigh`	3	1	Both are slow — Codex gets 1

Defaults

Setting	Default	Notes
Subtopics	3	Each query is decomposed into 3 subtopics
Claude concurrency	3	Claude researches all subtopics in parallel
Codex concurrency	1	Codex researches 1 subtopic (speed-adjusted)
Depth	1 pass	Single decompose-research-synthesize cycle (no recursive loops)
Claude maxTurns	8	Per-subtopic research turn limit
Codex sandboxMode	`read-only`	Codex runs in read-only sandbox

Can I interrupt a running query?

Yes — Ctrl+C kills the process. Because Claude and Codex run as separate subprocesses via their SDKs, the OS will clean them up. There is no partial resume; re-run the query from scratch.

MCP Servers

Pliny itself doesn't connect to MCP servers by default. Instead, each agent CLI (Claude Code, Codex) uses its own MCP configuration. Whatever tools you have set up for claude or codex will be available during research.

If you want Pliny to connect to MCP servers directly (for the deepagents path or the MCP server interface), configure them in pliny.config.yaml:

servers:
  deepwiki:
    url: https://mcp.deepwiki.com/mcp

  perplexity:
    command: npx
    args: ["-y", "@perplexity-ai/mcp-server"]
    env:
      PERPLEXITY_API_KEY: ${PERPLEXITY_API_KEY}

See .specs/mcp-servers.md for a full list of recommended servers.

Agent Backends

Agent	SDK	Auth	Speed
Claude Code	`@anthropic-ai/claude-agent-sdk`	Anthropic subscription	Fast (Sonnet) to moderate (Opus)
Codex	`@openai/codex-sdk`	OpenAI subscription	Varies by `codexReasoningEffort`

Speed tip: For quick demos, use Sonnet + codexReasoningEffort: medium. Opus and xhigh produce deeper analysis but take significantly longer.

Output

The CLI writes the final report to stdout (markdown by default, JSON with --format json). Progress logs go to stderr.

The markdown report includes:

Executive summary (2-3 paragraphs)
Sections for each subtopic with synthesized findings
Source attribution where available
Coverage notes

Internals

Architecture, deepagents integration, LangGraph primitives, and project structure are documented in the .specs/ directory.

TODO

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.specs		.specs
skills		skills
src		src
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
package.json		package.json
pliny.config.example.yaml		pliny.config.example.yaml
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pliny

Quick Start

Prerequisites

Install

Configure

Run

How It Works

Model Speed Awareness

Defaults

Can I interrupt a running query?

MCP Servers

Agent Backends

Output

Internals

TODO

License

About

Uh oh!

Releases

Packages

Languages

kevinmichaelchen/pliny

Folders and files

Latest commit

History

Repository files navigation

Pliny

Quick Start

Prerequisites

Install

Configure

Run

How It Works

Model Speed Awareness

Defaults

Can I interrupt a running query?

MCP Servers

Agent Backends

Output

Internals

TODO

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages