🍯 Honeypot Dashboard

A Cowrie SSH honeypot disguised as a Solana validator node, with a live web dashboard showing real-time attacker sessions, geolocation, and LLM-generated behavior descriptions.

What It Does

This project runs an SSH honeypot that masquerades as a misconfigured Solana validator node. When attackers connect, Cowrie logs every login attempt, shell command, and file download. The dashboard processes these logs and presents:

Live attack map — Geographic visualization of attacker origins using Leaflet.js
Attacker leaderboard — Top attackers ranked by attempts, with nicknames and ISP info
Session replays — What attackers typed after logging in, with command annotations
LLM-generated descriptions — Natural language summaries of attacker behavior (e.g., "Full hardware audit — profiling this box for cryptomining potential")
Credential analytics — Most common username/password combinations tried
Daily breakdowns — Session counts, unique IPs, and success rates over time

Architecture

Internet → Port 22 (iptables NAT) → Cowrie honeypot (sandboxed)
                                          ↓
                                      JSON logs
                                          ↓
                     generate.py (parse + GeoIP + LLM describe + render HTML)
                                          ↓
                                    dashboard.html (self-contained)
                                          ↓
                           serve.py ← nginx reverse proxy (HTTPS)

                     analytics.py (incremental session analysis, geo tracking)

Data Flow

Cowrie captures SSH connections on port 22 (via iptables NAT redirect) and logs events as JSON
generate.py runs every 5 minutes via cron, parses the last 7 days of logs, performs batch GeoIP lookups, generates LLM descriptions for interesting sessions, and renders a self-contained HTML dashboard
serve.py serves the dashboard on localhost, behind nginx with TLS and HTTP basic auth
analytics.py runs every 5 minutes, incrementally processing new log entries and maintaining aggregated statistics with 30-day retention

The Bait

The honeypot is themed as a Solana validator node to attract crypto-targeting attackers:

Fake Solana wallet with seed phrases in .env
Planted credentials in .bash_history
Realistic validator configuration files (keypair, stake account, vote account)
Enticing directory structure that rewards exploration
Fake systemd service for solana-validator

This disguise is effective — many attackers specifically try Solana-related credentials (solana:solana, sol:validator, validator:validator).

Key Features

Session Intelligence

3-layer description system: Dictionary lookup → regex pattern matching → LLM generation (via Ollama)
Attacker nicknames: Country-themed names (e.g., tulip_sol, dragon_root) based on origin and behavior
Command annotations: Inline technical notes on what each command does and why an attacker would run it
Description caching: LLM descriptions are cached to avoid redundant inference

Robustness

Incremental log processing — Byte-offset tracking, doesn't re-read entire log files
Log rotation detection — Handles file size changes gracefully (no line-counting fragility)
Atomic file writes — All JSON caches and the dashboard HTML use temp file + os.rename to prevent corruption
Gzip-aware parsing — Detects rotated .gz logs by magic bytes, not file extension
Session deduplication — Events are deduped by (session, timestamp, eventid)
7-day rolling window — Dashboard shows recent activity, not all-time data
30-day data retention — Analytics pruning prevents unbounded disk growth
Ollama health checks — Gracefully skips LLM descriptions if Ollama is down; uses cached/pattern-matched descriptions instead

Security Hardening

XSS prevention — All attacker-controlled data (usernames, passwords, commands, ISP names) is HTML-escaped before rendering
Directory traversal protection — serve.py only serves dashboard.html; all other paths return 404
HTTPS rate limiting — nginx limit_req on both HTTP and HTTPS endpoints prevents brute-forcing
Localhost binding — serve.py binds to 127.0.0.1 only; nginx handles external access
HTTP basic auth — Dashboard is password-protected
No sensitive data in output — The generated HTML contains only attacker IPs and their activity, not server configuration

Dashboard UI

Dark hacker-aesthetic theme with CRT scanline effect
Interactive Leaflet.js map with pulsing markers
Chart.js visualizations for credentials and attack timeline
Click-to-fly: click any attacker nickname to zoom to their location on the map
Mobile-responsive layout
Auto-refresh every 30 seconds

File Structure

honeypot-dashboard/
├── README.md
├── .gitignore
└── app/
    ├── generate.py              # Log parser + GeoIP + LLM + HTML renderer
    ├── serve.py                 # HTTP server (localhost:9999, behind nginx)
    ├── analytics.py             # Incremental analytics with byte-offset tracking
    ├── dashboard.html           # Generated output (gitignored)
    ├── description_cache.json   # LLM description cache (gitignored)
    ├── geoip_cache.json         # GeoIP lookup cache (gitignored)
    └── analytics.json           # Aggregated analytics data (gitignored)

Setup

Prerequisites

A VPS or server you're comfortable exposing to the internet
Python 3.10+
Cowrie SSH/Telnet honeypot
Ollama with a small model (e.g., qwen3:4b) — optional but recommended
nginx with Let's Encrypt for TLS

1. Install Cowrie

Follow the official Cowrie installation guide. Key steps:

# Create cowrie user
sudo adduser --disabled-password cowrie

# Clone and set up Cowrie
sudo -u cowrie git clone https://github.com/cowrie/cowrie /home/cowrie/cowrie
cd /home/cowrie/cowrie
sudo -u cowrie python3 -m venv cowrie-env
sudo -u cowrie ./cowrie-env/bin/pip install -r requirements.txt

# Configure Cowrie to listen on a high port (e.g., 2223)
# Then redirect port 22 to it via iptables:
sudo iptables -t nat -A PREROUTING -p tcp --dport 22 -j REDIRECT --to-port 2223

# Move your real SSH to a non-standard port first!

2. Install the Dashboard

# Clone this repo
git clone https://github.com/brezgis/honeypot-dashboard.git /home/dashboard

# Install Ollama (optional, for LLM descriptions)
curl -fsSL https://ollama.com/install.sh | sh
ollama pull qwen3:4b  # or any small model

# The dashboard reads logs from Cowrie's default location:
#   /home/cowrie/cowrie/var/log/cowrie/cowrie.json
# If your Cowrie logs are elsewhere, edit LOG_PATH in generate.py and analytics.py

3. Configure nginx

Example nginx config for HTTPS with Let's Encrypt:

limit_req_zone $binary_remote_addr zone=dashboard:10m rate=5r/s;

server {
    listen 443 ssl;
    server_name your-domain.example.com;

    ssl_certificate /etc/letsencrypt/live/your-domain.example.com/fullchain.pem;
    ssl_certificate_key /etc/letsencrypt/live/your-domain.example.com/privkey.pem;

    limit_req zone=dashboard burst=10 nodelay;
    auth_basic "Honeypot Dashboard";
    auth_basic_user_file /etc/nginx/.htpasswd;

    location / {
        proxy_pass http://127.0.0.1:9999;
        proxy_set_header Host $host;
        proxy_set_header X-Real-IP $remote_addr;
    }
}

Create the password file:

sudo apt install apache2-utils
sudo htpasswd -c /etc/nginx/.htpasswd your-username

4. Set Up Cron Jobs

# Run as the dashboard user (not root)
crontab -e

# Regenerate dashboard every 5 minutes
*/5 * * * * cd /home/dashboard/app && /usr/bin/python3 generate.py >> /var/log/honeypot-dashboard.log 2>&1

# Run analytics every 5 minutes (stagger by 2 minutes to avoid contention)
2-57/5 * * * * cd /home/dashboard/app && /usr/bin/python3 analytics.py >> /var/log/honeypot-analytics.log 2>&1

5. Start the Server

# Run serve.py as a systemd service or in screen/tmux
cd /home/dashboard/app
python3 serve.py

Or create a systemd service:

[Unit]
Description=Honeypot Dashboard Server
After=network.target

[Service]
User=dashboard
WorkingDirectory=/home/dashboard/app
ExecStart=/usr/bin/python3 serve.py
Restart=always
RestartSec=5

[Install]
WantedBy=multi-user.target

Configuration

Key settings are at the top of each script:

generate.py

Variable	Default	Description
`LOG_PATH`	`/home/cowrie/cowrie/var/log/cowrie/cowrie.json`	Cowrie JSON log location
`LOCAL_TZ`	`America/New_York`	Timezone for dashboard timestamps

serve.py

Variable	Default	Description
`PORT`	`9999`	HTTP server port (localhost only)
`MIN_REGEN_INTERVAL`	`30`	Minimum seconds between regenerations

analytics.py

Variable	Default	Description
`LOG_PATH`	`/home/cowrie/cowrie/var/log/cowrie/cowrie.json`	Cowrie JSON log location
`RETENTION_DAYS`	`30`	Days to keep analytics data before pruning

LLM Model

The LLM model is specified in generate.py's llm_generate() function. Default is qwen3:4b. Any Ollama-compatible model works — smaller models are faster, larger ones produce better descriptions.

Companion: Discord Alert Bot

A separate watcher script (not included in this repo) can monitor Cowrie logs in real-time and send Discord alerts for successful logins, interesting commands, and file downloads. It runs on a separate machine and reads logs via SSH, providing immediate notification of attacker activity.

Tech Stack

Cowrie — SSH/Telnet honeypot framework
Python 3 — Dashboard generation, HTTP serving, analytics
Ollama — Local LLM inference for session descriptions
Leaflet.js — Interactive attack origin map
Chart.js — Credential and timeline visualizations
ip-api.com — Batch GeoIP lookups (free tier)
nginx — Reverse proxy with TLS termination and rate limiting
Let's Encrypt — Free TLS certificates via certbot

How the LLM Descriptions Work

The description system uses a 3-layer approach for efficiency:

Layer 1 — Command annotations (instant): A dictionary maps ~50 common commands to short technical notes (e.g., uname -a → "OS/kernel identification"). Plus ~40 regex patterns for compound commands.
Layer 2 — Pattern matching (instant): Regex-based classification of common attack patterns. Returns varied descriptions (8+ options per category, seeded by IP hash for deterministic output).
Layer 3 — LLM generation (cached): For novel sessions that don't match known patterns, a few-shot prompt sends the session details to a local Ollama model. The response is cached in description_cache.json, so each unique session is only described once.

The prompt uses a raw/few-shot format with real examples to guide the model toward concise, technical, opinionated descriptions. Bad outputs (meta-commentary, refusals, too-short responses) are detected and filtered.

Made by

Anna Brezgis and Claude — brezgis.com

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🍯 Honeypot Dashboard

What It Does

Architecture

Data Flow

The Bait

Key Features

Session Intelligence

Robustness

Security Hardening

Dashboard UI

File Structure

Setup

Prerequisites

1. Install Cowrie

2. Install the Dashboard

3. Configure nginx

4. Set Up Cron Jobs

5. Start the Server

Configuration

generate.py

serve.py

analytics.py

LLM Model

Companion: Discord Alert Bot

Tech Stack

How the LLM Descriptions Work

Made by

License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
app		app
.gitignore		.gitignore
README.md		README.md

brezgis/honeypot-dashboard

Folders and files

Latest commit

History

Repository files navigation

🍯 Honeypot Dashboard

What It Does

Architecture

Data Flow

The Bait

Key Features

Session Intelligence

Robustness

Security Hardening

Dashboard UI

File Structure

Setup

Prerequisites

1. Install Cowrie

2. Install the Dashboard

3. Configure nginx

4. Set Up Cron Jobs

5. Start the Server

Configuration

generate.py

serve.py

analytics.py

LLM Model

Companion: Discord Alert Bot

Tech Stack

How the LLM Descriptions Work

Made by

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages