Snag

Snag

Beta - This tool is in active development. If you encounter any issues, please report them.

Screenshot-to-text CLI tool powered by vision AI (Google Gemini, OpenRouter, or Z.AI).

Capture any region of your screen and instantly get a markdown description in your clipboard - ready to paste into an LLM, document, or anywhere else.

Features

Region selection - Click and drag to capture any part of your screen
Multi-monitor support - Works across all your displays
Smart transcription - Handles text, code, diagrams, charts, UI elements, and images
Instant clipboard - Results copied automatically, ready to paste
Multiple providers - Google Gemini, OpenRouter, or Z.AI (GLM-4.6V)
Cross-platform - Linux (X11/Wayland), Windows, macOS

Installation

# Install with uv (recommended)
uv tool install git+https://github.com/am-will/snag.git

# Update to latest version
snag --update

Linux Dependencies

X11:

# For clipboard support
sudo apt install xclip  # or xsel

Wayland:

sudo apt install slurp grim wl-clipboard

macOS Permissions

On first run, macOS will prompt for Screen Recording permissions:

Grant permission when prompted
Restart the app (required for permissions to take effect)
On second run, grant "bypass screen capture restrictions" if prompted

You can manage these in: System Settings → Privacy & Security → Screen Recording

Setup

Run snag --setup to configure your API keys and defaults:

$ snag --setup

==================================================
  Current Settings
==================================================

  API Keys:
    Google Gemini:  not configured
    OpenRouter:     not configured
    Z.AI:           not configured

  Defaults:
    Provider: google
    Model:    gemini-2.5-flash

==================================================
  Setup Menu
==================================================

  1. Configure Google Gemini API key
  2. Configure OpenRouter API key
  3. Configure Z.AI API key
  4. Set default provider
  5. Set default model
  6. Exit setup

Get your API keys:

Google Gemini: https://aistudio.google.com/apikey (free)
OpenRouter: https://openrouter.ai/keys
Z.AI: https://open.bigmodel.cn/ (requires GLM Coding Plan)

Usage

# Capture a region (uses defaults from config)
snag

# Use Google Gemini directly
snag --provider google --model gemini-2.5-flash

# Use OpenRouter with any supported model
snag --provider openrouter --model google/gemini-2.5-flash-lite
snag --provider openrouter --model anthropic/claude-3.5-sonnet

# Use Z.AI (GLM-4.6V via MCP) - requires Node.js >= 22
snag --provider zai

# Configure API keys and defaults
snag --setup

# Update to latest version
snag --update

Controls

Action	Result
Left-click + drag	Select region
Release mouse	Capture and process
Right-click	Cancel
Escape or q	Cancel

Keyboard Shortcut (Recommended)

Set up a global keyboard shortcut in your desktop environment to run snag:

GNOME: Settings → Keyboard → Custom Shortcuts → Add:

Name: Snag
Command: /path/to/snag (or just snag if in PATH)
Shortcut: Your choice (e.g., Super+Shift+S)

KDE: System Settings → Shortcuts → Custom Shortcuts → Add

Config Files

Configuration is stored in ~/.config/snag/:

API Keys (.env):

GEMINI_API_KEY="your-gemini-key"
OPENROUTER_API_KEY="your-openrouter-key"
Z_AI_API_KEY="your-zai-key"

Settings (config.toml):

[defaults]
provider = "google"
model = "gemini-2.5-flash"

This location works with keyboard shortcuts (which don't have access to shell environment variables).

Example Output

Capture a code snippet:

**Python Code:**
def hello_world():
    print("Hello, World!")

Capture a diagram:

**Flowchart showing:**
- Start → Process A → Decision
- If yes → Process B → End
- If no → Process C → End

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
snag		snag
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
plan.md		plan.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Snag

Features

Installation

Linux Dependencies

macOS Permissions

Setup

Usage

Controls

Keyboard Shortcut (Recommended)

Config Files

Example Output

License

About

Uh oh!

Releases

Packages

Contributors 2

Languages

License

am-will/snag

Folders and files

Latest commit

History

Repository files navigation

Snag

Features

Installation

Linux Dependencies

macOS Permissions

Setup

Usage

Controls

Keyboard Shortcut (Recommended)

Config Files

Example Output

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages