Toggle between simple and detailed views. Click on column headers to sort. Click on a row to see full details.
Rank | Model | Pass Rate ↓ | Speed per Case | Cost |
---|---|---|---|---|
1 | MS R1 | $0.000 | ||
2 | qwen3 235B | $2.426 | ||
3 | DeepSeek R1 | $6.192 | ||
4 | Flash 2.5 Thinking | $5.000 | ||
5 | R1T-Chimera | $0.000 | ||
6 | Flash 2.5 Thinking | $6.000 | ||
7 | DeepSeek Chat v3 | $1.355 | ||
8 | Qwen3 30B | $1.364 | ||
9 | GPT-4.1-mini | $2.170 | ||
10 | Grok-3-mini-beta | $0.778 | ||
11 | Grok-3-mini-beta | $2.165 | ||
12 | Grok-3-mini-beta | $1.010 | ||
13 | GLM-4 | $0.000 | ||
14 | Qwen 2.5 Coder 32B | $0.894 |