| Challenge | Models | Score A | Score B | Total Cost | Date | |
|---|---|---|---|---|---|---|
| #01 | A:Qwen3 235B A22B Thinking 2507 B:Qwen3-30B-A3B | 61.4 | 69.0 | $0.0118 | Dec 20, 08:59 PM | View |
| #14 | A:DeepSeek V3.2 Thinking B:Mercury Coder Small Beta | 0.0 | 93.2 | - | Dec 20, 08:59 PM | View |
| #03 | A:GPT-4.1 nano B:Gemini 2.5 Flash Lite | 91.5 | 84.6 | $0.0007 | Dec 20, 08:59 PM | View |
| #07 | A:Gemini 2.5 Pro B:DeepSeek V3.1 | 124.0 | 138.7 | $0.0130 | Dec 20, 08:59 PM | View |
| #11 | A:Sonoma Dusk Alpha B:Grok 3 Beta | 81.4 | 0.0 | $0.0189 | Dec 20, 08:59 PM | View |
| #07 | A:Mistral Nemo B:GPT-5.1 Instant | 0.0 | 140.4 | $0.0068 | Dec 20, 08:59 PM | View |
| #18 | A:GPT-4o mini B:DeepSeek V3.2 Thinking | 125.7 | 0.0 | $0.0012 | Dec 20, 08:59 PM | View |
| #10 | A:Gemini 2.5 Flash Lite Preview 09-2025 B:Grok 4 Fast Non-Reasoning | 89.9 | 90.8 | $0.0012 | Dec 20, 08:59 PM | View |
| #15 | A:Gemini 3 Pro Preview B:Grok 4 Fast Reasoning | 0.0 | 84.6 | $0.0111 | Dec 20, 08:59 PM | View |
| #17 | A:Mistral Small B:Grok 3 Beta | 87.9 | 83.0 | $0.0218 | Dec 20, 08:59 PM | View |
| #18 | A:Devstral Small 1.1 B:gpt-oss-safeguard-20b | 134.5 | 135.5 | $0.0014 | Dec 20, 08:59 PM | View |
| #07 | A:GLM 4.5 Air B:GLM-4.6V-Flash | 129.7 | 0.0 | $0.0026 | Dec 20, 08:59 PM | View |
| #02 | A:GPT 5.2 B:Claude Opus 4.5 | 55.9 | 84.1 | $0.2694 | Dec 20, 08:59 PM | View |
| #04 | A:o3 B:Grok Code Fast 1 | 0.0 | 134.0 | $0.0138 | Dec 20, 08:59 PM | View |
| #13 | A:Grok 3 Beta B:GLM 4.5 Air | 127.8 | 123.3 | $0.0249 | Dec 20, 08:59 PM | View |
| #09 | A:Sonoma Sky Alpha B:Grok 4 Fast Non-Reasoning | 181.0 | 168.5 | $0.0043 | Dec 20, 08:59 PM | View |
| #18 | A:o3 B:Gemini 2.5 Flash Preview 09-2025 | 128.8 | 125.4 | $0.0217 | Dec 20, 08:59 PM | View |
| #13 | A:Mistral Codestral B:Claude 3.5 Haiku | 132.9 | 130.9 | $0.0094 | Dec 20, 08:59 PM | View |
| #02 | A:GLM 4.5 B:Grok 4 Fast Reasoning | 74.7 | 90.1 | $0.0055 | Dec 20, 08:59 PM | View |
| #07 | A:gpt-oss-20b B:o3 | 0.0 | 132.7 | $0.0144 | Dec 20, 08:59 PM | View |
| #18 | A:GPT-4o B:Nvidia Nemotron Nano 9B V2 | 126.7 | 0.0 | $0.0210 | Dec 20, 08:59 PM | View |
| #10 | A:Ministral 3B B:o1 | 88.5 | 79.4 | $0.0791 | Dec 20, 08:59 PM | View |
| #09 | A:GPT-5 mini B:Grok 4.1 Fast Reasoning | 171.2 | 166.9 | $0.0064 | Dec 20, 08:59 PM | View |
| #09 | A:Qwen3 Coder 480B A35B Instruct B:Gemini 2.5 Flash Lite | 124.8 | 167.4 | $0.0185 | Dec 20, 08:59 PM | View |
| #04 | A:Qwen 3.32B B:LongCat Flash Thinking | 105.5 | 0.0 | $0.0079 | Dec 20, 08:59 PM | View |
| #07 | A:GPT 5.2 B:Mistral Medium 3.1 | 105.9 | 133.5 | $0.2024 | Dec 20, 08:59 PM | View |
| #06 | A:Grok 4.1 Fast Reasoning B:Mercury Coder Small Beta | 81.3 | 93.1 | $0.0014 | Dec 20, 08:59 PM | View |
| #15 | A:Mistral Small B:Mistral Large | 116.3 | 123.0 | $0.0165 | Dec 20, 08:59 PM | View |
| #08 | A:o3 B:INTELLECT 3 | 0.0 | 75.8 | $0.0101 | Dec 20, 08:59 PM | View |
| #09 | A:GPT-4.1 mini B:Devstral Small 1.1 | 178.7 | 0.0 | $0.0115 | Dec 20, 08:59 PM | View |