Welcome to Learn Harness Engineering
Learn Harness Engineering is a course dedicated to the engineering of AI coding agents. We have deeply studied and synthesized the most advanced Harness Engineering theories and practices in the industry. Our core references include:
- OpenAI: Harness engineering: leveraging Codex in an agent-first world
- Anthropic: Effective harnesses for long-running agents
- Anthropic: Harness design for long-running application development
- Awesome Harness Engineering
Through systematic environment design, state management, verification, and control systems, this course teaches you how to make agentic coding tools like Codex and Claude Code truly reliable. It helps you build features, fix bugs, and automate development tasks by constraining your AI coding assistant with explicit rules and boundaries.
Get started
Choose your learning path to get started. The course is divided into theoretical lectures, hands-on projects, and a copy-ready resource library.
Lectures
Understand why strong models still fail and learn the theory behind effective harnesses.
Projects
Hands-on practice building a reliable agentic environment from scratch.
Resource Library
Copy-ready templates (AGENTS.md, feature_list.json) to use in your own repositories.
The Core Mechanism of a Harness
A harness doesn't "make the model smarter"; rather, it establishes a closed-loop working system for the model. You can understand its core workflow through this simple diagram:
What you will learn
Here are some of the key concepts you will master:
- Constrain agent behavior with explicit rules and boundaries.
- Maintain context across long-running, multi-session tasks.
- Stop agents from declaring victory too early.
- Verify work using full-pipeline tests and self-reflection.
- Make runtime observable and debuggable.
Next steps
Once you understand the core concepts, these guides help you go deeper:
- Lecture 01: Why Capable Agents Still Fail: Start with the theory behind harness engineering.
- Project 01: Baseline vs Minimal Harness: Walk through your first real task.
- Templates: Grab the minimal harness pack (AGENTS.md, feature_list.json, claude-progress.md) for your own projects.