grube.ai

Benchmarks

AI model performance rankings

Click any two models to compare them side by side

Index ScoresBenchmarksSpeed
#
1
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)Anthropic
57.352.5N/AN/A91.439.6N/A54.5N/AN/A52
2
GPT-5.4 (xhigh)OpenAI
57.257.3N/AN/A92.041.6N/A56.6N/AN/A32
3
Gemini 3.1 Pro PreviewGooglevia Google AI Studio
57.255.5N/AN/A94.144.7N/A58.9N/AN/A71
4
Claude Opus 4.6 (Adaptive Reasoning, Max Effort)Anthropic
53.048.1N/AN/A89.636.7N/A51.9N/AN/A36
5
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)Anthropic
51.750.9N/AN/A87.530.0N/A46.8N/AN/A42
6
GLM-5.1 (Reasoning)Z AIvia Z.AI
51.443.4N/AN/A86.828.0N/A43.8N/AN/A21
7
Qwen3.6 PlusAlibaba
50.042.9N/AN/A88.225.7N/A40.7N/AN/A44
8
GLM-5 (Reasoning)Z AIvia Z.AI
49.844.2N/AN/A82.027.2N/A46.2N/AN/A32
9
GPT-5.4 mini (xhigh)OpenAI
48.151.5N/AN/A87.526.6N/A49.9N/AN/A83
10
Gemini 3 Flash Preview (Reasoning)Google
46.442.697.089.089.834.790.850.6N/AN/A68
11
Qwen3.5 397B A17B (Reasoning)Alibaba
45.041.3N/AN/A89.327.3N/A42.0N/AN/A53
12
GPT-5.4 nano (xhigh)OpenAI
44.443.9N/AN/A81.726.5N/A46.9N/AN/A65
13
MiMo-V2-Flash (Feb 2026)Xiaomi
41.533.5N/AN/A83.520.0N/A38.3N/AN/A35
14
Grok 4xAI
41.540.592.786.687.723.981.945.799.094.345
15
Gemma 4 31B (Reasoning)Googlevia DeepInfra
39.238.7N/AN/A85.722.7N/A43.4N/AN/A32
16
Grok 4.1 Fast (Reasoning)xAI
38.630.989.385.485.317.682.244.2N/AN/A116
17
Claude 4.5 Haiku (Reasoning)Anthropic
37.132.683.776.067.29.761.543.3N/AN/A66
18
NVIDIA Nemotron 3 Super 120B A12B (Reasoning)NVIDIAvia Nebius
36.031.2N/AN/A80.019.2N/A36.0N/AN/A89
19
Grok 4 Fast (Reasoning)xAI
35.127.489.785.084.717.083.244.2N/AN/A123
20
Gemini 3.1 Flash-Lite PreviewGoogle
33.530.1N/AN/A82.216.2N/A41.9N/AN/A44
21
gpt-oss-120B (high)OpenAIvia Groq
33.328.693.480.878.218.587.838.9N/AN/A349
22
gpt-oss-120B (high)OpenAIvia Google
33.328.693.480.878.218.587.838.9N/AN/A218
23
gpt-oss-120B (high)OpenAIvia Cerebras
33.328.693.480.878.218.587.838.9N/AN/A768
24
GPT-4.1OpenAI
26.321.834.780.666.64.645.738.191.343.744
25
GPT-4.1 miniOpenAI
22.918.546.378.166.44.648.340.492.543.051
26
GPT-4o (Aug '24)OpenAI
18.616.6N/AN/A52.12.931.733.179.511.714
27
DeepSeek R1 Distill Qwen 32BDeepSeekvia NextBit
17.2N/A63.073.961.55.527.037.694.168.724
28
DeepSeek R1 Distill Llama 70BDeepSeekvia DeepInfra
16.011.453.779.540.26.126.631.293.567.042
29
Gemini 2.0 Flash-Lite (Feb '25)Google
14.7N/AN/A72.453.53.618.525.087.327.722
30
Llama 3.3 Instruct 70BMetavia Groq
14.510.77.771.349.84.028.826.077.330.0144
31
Llama 4 ScoutMetavia Groq
13.56.714.075.258.74.329.917.084.428.3100
32
GPT-4.1 nanoOpenAI
13.011.224.065.751.23.932.625.984.823.735
33
GPT-4o miniOpenAI
12.6N/A14.764.842.64.023.422.978.911.730

crafted by bart stefanski

vs...