Skip to content

LLM agent solving traces, leaderboards, and benchmark results across security CTF and hacking platforms

License

Notifications You must be signed in to change notification settings

0ca/BoxPwnr-Traces

Repository files navigation

BoxPwnr-Traces

Machine solving traces and results generated by BoxPwnr.

Total Challenges Challenges Solved Total Traces Platforms

Each trace includes the full conversation log showing LLM reasoning, commands executed, and outputs received. Browse leaderboards, replay runs in an interactive web viewer, and read AI-generated reports:

🔬 BoxPwnr Traces & Benchmarks

Platform Solved Completion Traces
HTB Labs 54/514 13.0% 326
HTB Starting Point 25/25 100.0% 772
PortSwigger Labs 163/270 60.4% 377
XBOW Validation Benchmarks 94/104 90.4% 512
Cybench CTF Challenges 35/40 87.5% 616
picoCTF Challenges 115/439 26.2% 428
TryHackMe Rooms 31/459 11.2% 238
HackBench Benchmarks 3/16 18.8% 3
Neurogrid CTF: The ultimate AI security showdown 17/36 47.2% 197

Last updated: 2026-02-23 22:10:05

About

LLM agent solving traces, leaderboards, and benchmark results across security CTF and hacking platforms

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •