Codebase Research Agent

An AI agent that answers technical questions about any public GitHub repository by exploring the code itself — using tool-calling, not embeddings or pre-indexing. Every session is persisted to PostgreSQL; the agent reads prior findings before re-exploring the same repo, making the database a real memory layer.

Stack: Python 3.11 · Django 5.0 · Django REST Framework · PostgreSQL 16 · OpenAI / Anthropic / Ollama

Author: Al Amin Ahamed · github.com/mralaminahamed · alaminahamed.com

Quick Start

Prerequisites: Docker + Docker Compose; an API key for your chosen LLM provider. For Ollama (local inference), no API key is needed — just set OLLAMA_BASE_URL in .env.

# 1. Clone
git clone https://github.com/mralaminahamed/codebase-research-agent.git
cd codebase-research-agent

# 2. Configure
cp .env.example .env
# Edit .env — set LLM_PROVIDER and the matching API key (see Configuration below)

# 3. Start services
docker compose up --build -d

# 4. Run migrations
docker compose exec web python manage.py migrate

# 5. Load sample data (optional — creates two pre-seeded sessions)
docker compose exec web python scripts/seed_sample_data.py

# 6. Create an admin superuser
docker compose exec web python manage.py createsuperuser

Interface	URL
Web UI	`http://localhost:8080/research/`
Admin	`http://localhost:8080/admin/`
API base	`http://localhost:8080/api/`

Web UI

Open http://localhost:8080/research/ for an interactive interface — paste a GitHub URL and a question, hit Research, and watch tool calls stream in real time as the agent explores the repository. Results include the final answer plus a table of every tool invocation and finding the agent produced.

The admin at /admin/ exposes all sessions, tool calls, and findings with full filtering and hierarchy views.

Configuration

All settings are via environment variables in .env. Copy .env.example and populate:

Variable	Default	Required	Description
`SECRET_KEY`	—	✓	Django secret key
`DEBUG`	`True`		Django debug mode
`DATABASE_URL`	`postgres://postgres:postgres@db:5432/research`	✓	Postgres connection string
`LLM_PROVIDER`	`openai`	✓	`openai`, `anthropic`, or `ollama`
`OPENAI_API_KEY`	—	if `openai`	OpenAI API key
`OPENAI_MODEL`	`gpt-4o`		OpenAI model
`ANTHROPIC_API_KEY`	—	if `anthropic`	Anthropic API key
`ANTHROPIC_MODEL`	`claude-sonnet-4-6`		Anthropic model
`OLLAMA_BASE_URL`	`http://localhost:11434`	if `ollama`	Use `host.docker.internal` on Mac/Windows inside Docker
`OLLAMA_MODEL`	`llama3.2`	if `ollama`	Ollama model name
`MAX_AGENT_ITERATIONS`	`20`		Tool-call iteration cap per session
`MAX_INPUT_TOKENS`	`40000`		Cumulative input-token budget per session
`MEDIA_ROOT`	`/app/media`		Where repository clones are stored

Only the active provider's settings are required. The inactive providers' keys can be left blank.

API

Three endpoints under /api/.

`POST /api/sessions/` — Start a research session

curl -s -X POST http://localhost:8080/api/sessions/ \
  -H "Content-Type: application/json" \
  -d '{
    "repo_url": "https://github.com/tiangolo/fastapi",
    "question": "How does FastAPI handle dependency injection internally?"
  }' | jq .

This call is synchronous — it blocks until the agent completes (typically 20–90 seconds for a small repo). Returns the full session including tool calls and findings.

Response shape:

{
  "id": "8a4b1c2d-...",
  "repository": {
    "id": 1,
    "url": "https://github.com/tiangolo/fastapi",
    "name": "tiangolo/fastapi",
    "commit_sha": "a1b2c3..."
  },
  "question": "How does FastAPI handle dependency injection internally?",
  "final_answer": "FastAPI implements dependency injection in fastapi/dependencies/utils.py...",
  "status": "COMPLETE",
  "iterations": 6,
  "input_tokens": 18200,
  "output_tokens": 1100,
  "tool_calls": [
    {
      "sequence": 0,
      "tool_name": "get_previous_findings",
      "arguments": {"repo_url": "https://github.com/tiangolo/fastapi"},
      "result_preview": "No prior findings on this repository.",
      "duration_ms": 12
    }
  ],
  "findings": [
    {
      "id": 1,
      "file_path": "fastapi/dependencies/utils.py",
      "line_start": 415,
      "line_end": 480,
      "note": "solve_dependencies recursively resolves the dependency graph..."
    }
  ]
}

`GET /api/sessions/<uuid>/` — Retrieve a session

curl -s http://localhost:8080/api/sessions/8a4b1c2d-.../ | jq .

`GET /api/repositories/<id>/sessions/` — List past sessions for a repo

curl -s 'http://localhost:8080/api/repositories/1/sessions/?page=1' | jq .

Paginated, page size 20.

How the Agent Works

On every new session, the agent:

Clones (or fetches) the target repository locally (shallow, --depth=1).
First tool call is always get_previous_findings(repo_url) — if prior sessions found relevant code locations, the agent builds on them instead of re-exploring from scratch. This makes the database a real memory layer, not a passive log.
Uses list_files, search_code, and read_file to locate relevant code.
Calls save_finding(file_path, note, line_start, line_end) for each location worth remembering.
Stops when it has enough information, or when the iteration cap (default 20) or token budget (default 40,000 input tokens) is reached.
Returns a final answer that cites specific files and line numbers.

Every tool invocation is persisted as a ToolCall row. Every save_finding call creates a Finding row. Both are visible in the Django admin and returned in the API response.

Project Structure

codebase-research-agent/
├── config/              # Django project (settings, urls, wsgi)
├── research/
│   ├── models.py        # Repository, ResearchSession, ToolCall, Finding
│   ├── admin.py
│   ├── serializers.py
│   ├── views.py         # Three DRF endpoints + research UI view
│   ├── urls.py
│   └── services/
│       ├── llm_adapter.py    # LLMAdapter ABC + OpenAIAdapter + AnthropicAdapter + OllamaAdapter + factory
│       ├── repo_service.py   # Clone, validate, path safety
│       ├── code_tools.py     # list_files, read_file, search_code, get_file_summary
│       ├── db_tools.py       # save_finding, get_previous_findings, list_past_sessions
│       ├── tool_registry.py  # Canonical tool specs + dispatcher
│       └── agent.py          # Provider-agnostic tool-use loop
├── scripts/
│   ├── seed_sample_data.py
│   └── run_demo.sh
├── docker-compose.yml
├── Dockerfile
├── pyproject.toml
├── .env.example
└── README.md

Running Tests

docker compose exec web pytest -v

The test suite covers:

Model invariants (Repository.from_url idempotency, ToolCall sequence uniqueness)
Code tool path-safety (path traversal rejected)
get_previous_findings correctly excludes the current session
OpenAIAdapter and AnthropicAdapter format tools and normalise responses correctly
End-to-end agent integration test against a mocked adapter (provider-agnostic)

No real API calls are made during tests.

Switching LLM Provider

Change LLM_PROVIDER in .env and restart the web container. No other changes needed — the agent loop, tool registry, and database schema are entirely provider-agnostic.

Anthropic:

# .env
LLM_PROVIDER=anthropic
ANTHROPIC_API_KEY=sk-ant-...

docker compose restart web

Ollama (local):

# .env
LLM_PROVIDER=ollama
OLLAMA_BASE_URL=http://host.docker.internal:11434  # Mac/Windows inside Docker
OLLAMA_MODEL=llama3.2

docker compose restart web

Note: Ollama model tool-call support varies. Models that output tool calls as plain JSON text rather than structured function calls will not work reliably with the agent loop. llama3.2 and mistral-nemo have been tested.

Limitations

Public HTTPS GitHub URLs only.
Synchronous request handling — one slow session blocks a worker.
No authentication, no rate limiting (out of scope per brief).
read_file caps at 200 lines per call.
No vector retrieval (ripgrep is sufficient at this scope).

License

MIT.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Codebase Research Agent

Quick Start

Web UI

Configuration

API

`POST /api/sessions/` — Start a research session

`GET /api/sessions/<uuid>/` — Retrieve a session

`GET /api/repositories/<id>/sessions/` — List past sessions for a repo

How the Agent Works

Project Structure

Running Tests

Switching LLM Provider

Limitations

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
assets		assets
config		config
research		research
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
DECISIONS.md		DECISIONS.md
Dockerfile		Dockerfile
README.md		README.md
conftest.py		conftest.py
docker-compose.yml		docker-compose.yml
manage.py		manage.py
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini

Folders and files

Latest commit

History

Repository files navigation

Codebase Research Agent

Quick Start

Web UI

Configuration

API

POST /api/sessions/ — Start a research session

GET /api/sessions/<uuid>/ — Retrieve a session

GET /api/repositories/<id>/sessions/ — List past sessions for a repo

How the Agent Works

Project Structure

Running Tests

Switching LLM Provider

Limitations

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

`POST /api/sessions/` — Start a research session

`GET /api/sessions/<uuid>/` — Retrieve a session

`GET /api/repositories/<id>/sessions/` — List past sessions for a repo

Packages