v3.0.0 โ€ข 1852 tests passing โ€ข MIT Licensed

Universal Data Layer for AI Systems

Transform documentation, GitHub repos, PDFs, and codebases into structured knowledge for any AI system. 16 output formats. One tool for RAG pipelines, AI coding assistants, and Claude skills.

Quick Install
$ pip install skill-seekers
1852 Tests
16 Output Formats
4 Input Sources
20-40 min / skill

Any Source โ†’ Any AI System

One tool unifies all your data preprocessing needs

Input Sources

๐ŸŒ

Documentation

Any doc site, Docusaurus, GitBook, ReadTheDocs

๐Ÿ™

GitHub Repos

Public & private repos, with AST analysis

๐Ÿ“„

PDF Files

Scanned docs, manuals, research papers with OCR

๐Ÿ’ป

Local Codebases

27+ languages, game engines, custom projects

16 Output Formats

RAG / Vectors

LangChain LlamaIndex Chroma FAISS Haystack Qdrant Weaviate

AI Platforms

Claude Gemini OpenAI

AI Coding

Cursor Windsurf Cline Continue.dev

Generic

Markdown JSON YAML
Example: React โ†’ LangChain
# One command: scrape docs
$ skill-seekers scrape --config react.json
# One command: package for any format
$ skill-seekers package output/react --target langchain
# Done! Ready for your RAG pipeline

Universal Preprocessing for AI

Skill Seekers is the universal data layer for AI systems. It transforms documentation websites, GitHub repositories, PDF files, and local codebases into production-ready formats for RAG pipelines, AI coding assistants, Claude skills, and any LLM platform.

The Problem

  • ร— 70% of RAG development time is spent on data preprocessingโ€”scraping, cleaning, chunking
  • ร— AI coding assistants don't know your frameworks without manual context injection
  • ร— Multi-source knowledge (docs + code + PDFs) requires complex integration
  • ร— Different AI systems need different formatsโ€”LangChain, LlamaIndex, Cursor, Claude

The Solution

  • โœ“ One tool for all sources: docs, GitHub repos, PDFs, and local codebases
  • โœ“ Smart chunking preserves context, code blocks, and hierarchical structure
  • โœ“ 16 output formats: RAG pipelines, AI coding assistants, Claude skills, vector DBs
  • โœ“ 15-45 minutes end-to-end: from source to production-ready AI knowledge
โšก

10x Faster Development

Stop copying docs manually. Generate comprehensive skills in minutes, not hours.

๐ŸŽฏ

Framework Expertise

Give AI assistants deep knowledge of any framework with API references and examples.

๐Ÿ”„

Always Up-to-Date

Re-run when docs update. Keep your AI knowledge fresh and accurate.

Universal Intelligence Platform

Transform any documentation into structured knowledge for any AI system

๐ŸŒŠ
NEW - v2.6.0

Three-Stream Analysis

Split GitHub repos into Code (C3.x), Docs, and Insights streams for comprehensive skills

๐ŸŒ
Data Sources

Multi-Source Scraping

Extract from documentation websites, GitHub repositories, and PDF files

โœจ
AI Features

AI Enhancement

Automatically add explanations, examples, and best practices using Claude

๐Ÿค–
NEW - v3.0.0

16 LLM Platforms

Deploy to LangChain, LlamaIndex, Chroma, Claude, Gemini, OpenAI, Cursor, and more

๐Ÿ“ฆ
Configuration

24 Preset Configs

Ready-to-use configs for popular frameworks (React, Vue, Django, etc.)

๐Ÿ”ง
NEW - v3.0.0

26 MCP Tools

AI agents can prepare their own knowledge with 26 MCP tools (v3.0.0)

โœ…
Quality

1852 Tests

Production-ready with 1852 tests across 100 test files (v3.0.0)

โšก
Automation

Zero Manual Work

Fully automated pipeline from source to production-ready skill

โฑ๏ธ
Performance

20-40 Minutes

Complete skill generation in under an hour, including AI enhancement

+16 more features

Ready to transform your documentation?

Get Started Now

Get Started in 3 Steps

From zero to production-ready skill in 20-40 minutes

1

1. Install

Install from PyPI in seconds

pip install skill-seekers
2

2. Scrape Docs

Use preset configs or create your own

skill-seekers scrape --config react
# Or from URL directly
skill-seekers scrape --url https://react.dev --name react
3

3. Package & Upload

Create .zip and upload to Claude

skill-seekers package output/react/
skill-seekers upload react.zip

# Done! Your skill is ready to use

Multiple Installation Options

PyPI (Recommended)

Easiest
pip install skill-seekers

uv (Modern)

Fast
uv tool install skill-seekers

From Source

Dev
git clone && pip install -e .

MCP Integration

5 Agents
./setup_mcp.sh

Who Uses Skill Seekers?

From solo developers to enterprise teams

๐Ÿ‘จโ€๐Ÿ’ป

For Developers

Create skills from documentation + GitHub repos with automatic conflict detection.

"Build a React skill from official docs + GitHub repo, catch API changes before they surprise you."

๐ŸŽฎ

For Game Developers

Generate comprehensive skills for game engines like Godot (handles 40K+ pages!).

"Create complete Godot skill covering all topics with intelligent router/hub pattern."

๐Ÿ‘ฅ

For Teams

Combine internal docs + code repositories into single source of truth.

"Share custom configs via private git repos across 3-500+ team members."

๐Ÿ“š

For Learners

Build comprehensive skills from docs, code examples, and PDF tutorials.

"Combine official docs + GitHub examples + PDF manual into one unified learning resource."

๐Ÿ”

For Open Source

Analyze repos to find documentation gaps and outdated examples automatically.

"Detect discrepancies between documentation and actual code implementation."

Multi-Platform Support

Export your skills to multiple LLM platforms with platform-specific optimizations

๐Ÿค– Claude AI
๐Ÿ’Ž Google Gemini
๐Ÿ”ฎ OpenAI ChatGPT
๐Ÿ“ Generic Markdown

By the Numbers

Trusted metrics from a production-ready tool

โญ
9,114
GitHub Stars
๐Ÿด
899
Forks
๐Ÿ‘ฅ
28
Contributors
โœ…
1852
Tests Passing
๐Ÿค–
16
Output Formats
๐Ÿ”ง
26
MCP Tools

Open source โ€ข MIT Licensed โ€ข Active development