The AI coding benchmark · free · runs locally
Everyone says they're a 10x engineer now. Most have no idea if they are.
You have run hundreds of AI sessions and burned real money in tokens. AgentMetrics reads your local session history and shows what you actually shipped, how much of it survived, and how you stack up against other developers.





You ran 5.2× the average developer.
Here is what that spend actually earned, last 8 months.
$7,916
275
18%
$162
Spend vs survived
Your spend is climbing. The share that survives is not.
High on activity, low where it counts. Run the report to confirm your real percentiles.
Code Yield is the share of your AI-authored code that shipped and survived 30 days.
5 tools · 1 reportWhat's in your report
Your usage dashboard shows the bill. This shows how you stack up.
Where you rank
Your percentile against other AI-native developers on output that actually survives, not tokens burned.
Your Code Yield
How much of your AI-authored code shipped, lasted, and mattered. The one number a usage dashboard cannot show you.
Where your spend leaks
Your real cost per change that stuck, broken down by tool and model, so you can see which ones are worth the bill.
Code Yield, Code Half-Life, and cost per realized change are the Return on Code metrics. Read how they are defined in the glossary.
How it works
Three steps. The first one is the only one you do today.
Leave your email
Today. That is the whole ask. We send the rest when the tool opens.
Run one command
Next week you get a single CLI command. It runs locally and reads the session history your AI tools already keep. Nothing uploads.
Open your report
See your numbers instantly. Then opt in to benchmark them against every other developer who ran it.
Local first
The tool runs on your machine and reads the session history your AI tools already keep. Your source code is never read and nothing uploads. You share aggregate numbers only when you choose to join the benchmark.
Benchmark your team, not just yourself.
See which developers and which tools turn AI spend into code that ships and survives, and which ones quietly leak the budget. Govern per-seat spend on yield, not seats.
Questions
The things people ask before they run it.
- Is it free?
- Yes. Your personal AgentMetrics report is free. You run a single command locally and get your numbers back. The benchmark against other developers is also free once you opt in to share your aggregate score.
- Does my code leave my machine?
- No. The tool runs locally and reads the session metadata your AI coding tools already store on your machine. Your source code is never read or uploaded. You only ever share aggregate numbers, and only if you choose to join the benchmark.
- Which tools does it work with?
- It is tool-neutral. It reads history across Claude Code, Codex, Cursor, Copilot, Gemini, and more, so you get one cross-tool picture instead of a separate dashboard per vendor.
- When do I get it?
- We are sending run instructions next week. Leave your email now to be first in line when it opens.
- Can a team lead benchmark the whole team?
- Yes. The team view turns per-seat AI spend into Return on Code at the team and tool level, and surfaces who is shipping value with AI versus who is mostly burning tokens. Choose the team option to join the waitlist and we will reach out as the team product opens.
Find out where you actually stand.
Leave your email. We send your run instructions next week.