| Challenge | Models | Score A | Score B | Total Cost | Date | |
|---|---|---|---|---|---|---|
| #16 | A:Grok 4 B:Llama 3.1 70B Instruct | 64.9 | 0.0 | $0.0262 | Dec 20, 08:59 PM | View |
| #06 | A:Claude Sonnet 4.5 B:Grok 3 Mini Beta | 73.5 | 70.8 | $0.0390 | Dec 20, 08:59 PM | View |
| #05 | A:GPT-5-Codex B:Qwen 3 Coder 30B A3B Instruct | 81.7 | 88.4 | $0.0186 | Dec 20, 08:59 PM | View |
| #11 | A:Grok 4.1 Fast Reasoning B:GPT-5 | 86.5 | 77.9 | $0.0241 | Dec 20, 08:59 PM | View |
| #16 | A:Command A B:Nvidia Nemotron Nano 9B V2 | 87.5 | 0.0 | $0.0213 | Dec 20, 08:59 PM | View |
| #06 | A:Mistral Large B:Kimi K2 Thinking Turbo | 82.8 | 0.0 | $0.0121 | Dec 20, 08:59 PM | View |
| #09 | A:Qwen3-14B B:DeepSeek V3.1 Terminus | 143.4 | 154.0 | $0.0068 | Dec 20, 08:59 PM | View |
| #13 | A:Grok 4 Fast Reasoning B:GPT-5 Chat | 134.2 | 0.0 | $0.0047 | Dec 20, 08:59 PM | View |
| #10 | A:GLM 4.5 B:DeepSeek V3.2 Exp | 73.4 | 65.0 | $0.0058 | Dec 20, 08:59 PM | View |
| #01 | A:GPT-5 pro B:Grok 3 Mini Beta | 52.8 | 88.2 | $0.4310 | Dec 20, 08:59 PM | View |
| #10 | A:Llama 4 Scout 17B 16E Instruct B:Kimi K2 Turbo | 0.0 | 88.4 | $0.0109 | Dec 20, 08:59 PM | View |
| #05 | A:GPT-5.1 Instant B:Qwen3 Max | 90.6 | 73.1 | $0.0198 | Dec 20, 08:59 PM | View |
| #06 | A:gpt-oss-120b B:Claude 3.5 Haiku | 0.0 | 82.0 | $0.0088 | Dec 20, 08:59 PM | View |
| #12 | A:Claude Opus 4.5 B:GPT-5.1 Codex mini | 124.3 | 126.7 | $0.0596 | Dec 20, 08:59 PM | View |
| #09 | A:o3 B:DeepSeek V3.2 Exp | 175.1 | 0.0 | $0.0215 | Dec 20, 08:59 PM | View |
| #11 | A:GPT-5.1 Codex mini B:Gemini 3 Pro Preview | 88.4 | 0.0 | $0.0019 | Dec 20, 08:59 PM | View |
| #05 | A:Grok 4.1 Fast Non-Reasoning B:GPT 5.1 Codex Max | 38.7 | 63.7 | $0.0190 | Dec 20, 08:59 PM | View |
| #15 | A:Claude 3.5 Sonnet B:o3 | 115.3 | 78.4 | $0.1189 | Dec 20, 08:59 PM | View |
| #06 | A:Grok 3 Mini Beta B:gpt-oss-20b | 74.3 | 0.0 | $0.0020 | Dec 20, 08:59 PM | View |
| #15 | A:Claude Opus 4.1 B:Devstral Small 2 | 112.1 | 93.5 | $0.1882 | Dec 20, 08:59 PM | View |
| #10 | A:Llama 4 Scout 17B 16E Instruct B:Claude 3 Haiku | 0.0 | 87.1 | $0.0019 | Dec 20, 08:59 PM | View |
| #04 | A:gpt-oss-20b B:GPT-4 Turbo | 0.0 | 130.6 | $0.0405 | Dec 20, 08:59 PM | View |
| #13 | A:gpt-oss-120b B:Kimi K2 | 0.0 | 94.8 | $0.0104 | Dec 20, 08:59 PM | View |
| #08 | A:Qwen3 Max B:GPT-5 | 79.2 | 86.2 | $0.0190 | Dec 20, 08:59 PM | View |
| #14 | A:Gemini 2.5 Flash Preview 09-2025 B:GPT-4.1 mini | 78.3 | 88.0 | $0.0054 | Dec 20, 08:59 PM | View |
| #05 | A:o1 B:GPT-5 pro | 75.5 | 0.0 | $0.1154 | Dec 20, 08:59 PM | View |
| #03 | A:Kimi K2 B:Claude Opus 4.5 | 65.0 | 84.3 | $0.0398 | Dec 20, 08:59 PM | View |
| #02 | A:Claude 3.5 Haiku B:Qwen3 235B A22b Instruct 2507 | 86.4 | 64.7 | $0.0075 | Dec 20, 08:59 PM | View |
| #02 | A:Gemini 2.5 Pro B:Qwen3-30B-A3B | 80.6 | 62.8 | $0.0086 | Dec 20, 08:59 PM | View |
| #05 | A:Claude Haiku 4.5 B:Qwen 3 Coder 30B A3B Instruct | 70.5 | 80.9 | $0.0200 | Dec 20, 08:59 PM | View |