Skip to content

disler/agentic-coding-tool-eval

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Agentic Coding Tool Evals

Micro apps to compare and evaluate Agentic Coding Tools in a hands on way.

Apps

apps/ui_component_eval/* - Which Agentic Coding Tool can adhere to the prompts (apps/ui_component_eval/README.md) and build the best UI component?

Setup

Using Claude Code, run the custom slash command /trees "claude_code,gemini_cli" to create the worktrees.

Requirements

  • Claude Code (or setup git worktrees manually)
  • Git
  • bun | npm | yarn | pnpm
  • .env (copy .env.sample to .env)

Agentic Coding Tools

Tools we want to evaluate

Beware, we're running these tools will run in permisssionless mode.

Claude Code - claude --dangerously-skip-permissions - https://docs.anthropic.com/en/docs/claude-code/overview

Gemini CLI - gemini --yolo - https://github.com/google-gemini/gemini-cli

Codex CLI - codex --dangerously-auto-approve-everything - https://github.com/openai/codex

Master AI Coding

Learn to code with AI with foundational Principles of AI Coding

Follow the IndyDevDan youtube channel for more AI coding tips and tricks.

About

Simple ideas to compare Agentic Coding Tools

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published