Agentic Coding Tool Evals

Micro apps to compare and evaluate Agentic Coding Tools in a hands on way.

Apps

apps/ui_component_eval/* - Which Agentic Coding Tool can adhere to the prompts (apps/ui_component_eval/README.md) and build the best UI component?

Setup

Using Claude Code, run the custom slash command /trees "claude_code,gemini_cli" to create the worktrees.

Requirements

Claude Code (or setup git worktrees manually)
Git
bun | npm | yarn | pnpm
.env (copy .env.sample to .env)

Agentic Coding Tools

Tools we want to evaluate

Beware, we're running these tools will run in permisssionless mode.

Claude Code - claude --dangerously-skip-permissions - https://docs.anthropic.com/en/docs/claude-code/overview

Gemini CLI - gemini --yolo - https://github.com/google-gemini/gemini-cli

Codex CLI - codex --dangerously-auto-approve-everything - https://github.com/openai/codex

Master AI Coding

Learn to code with AI with foundational Principles of AI Coding

Follow the IndyDevDan youtube channel for more AI coding tips and tricks.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.claude		.claude
apps/ui_component_eval		apps/ui_component_eval
scripts		scripts
.env.sample		.env.sample
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
GEMINI.md		GEMINI.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Agentic Coding Tool Evals

Apps

Setup

Requirements

Agentic Coding Tools

Master AI Coding

About

Uh oh!

Releases

Packages

Languages

disler/agentic-coding-tool-eval

Folders and files

Latest commit

History

Repository files navigation

Agentic Coding Tool Evals

Apps

Setup

Requirements

Agentic Coding Tools

Master AI Coding

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages