Track · Resume Tailoring
The leaderboard.
200 résumé/JD pairs. 8 tools. 20 of those tasks are trap-street probes — we know in advance what the tools should NOT say.
Tasks
200
Tools
10
Trap probes
20
Total fabrications caught
42
Last refresh
Apr 25, 2026
| # | Tool | Tier | Score | Fabrications | $/task | Latency | Pricing |
|---|---|---|---|---|---|---|---|
| 01 | open-resume-tailor indie / open source | 84.2 | 0 | $0.008 | 1.4s | Free | |
| 02 | Claude (direct prompt) Anthropic | 79.1 | 1 | $0.015 | 2.1s | API | |
| 03 | Rezi Rezi.ai | 71.3 | 6 | $0.042 | 3.8s | $29/mo | |
| 04 | Huntr Huntr.co | 68.4 | 4 | $0.038 | 2.9s | $39/mo | |
| 05 | Kickresume Kickresume | 64.0 | 3 | $0.031 | 3.2s | $19/mo | |
| 06 | WeekendHack v3.2 indie | 64.0 | 0 | $0.004 | 0.9s | Free | |
| 07 | Teal AI Teal HQ | 62.8 | 2 | $0.022 | 2.4s | $9/mo | |
| 08 | Gemini 2.0 (direct) Google | 60.2 | 5 | $0.011 | 1.8s | API | |
| 09 | GPT-4o (direct) OpenAI | 58.9 | 7 | $0.018 | 2.5s | API | |
| 10 | 简历优化助手 Pro (redacted) | 41.2 | 14 | $0.091 | 5.2s | ¥199/mo |
Reading the table. The tier badge tells you how we know the score (Bronze = self-reported, Silver = we audited 10–20% of the run, Gold = we ran the entire eval ourselves). The fabrication count includes both LLM-judge flags and trap-street trips.