Text
Vision
Image (LM Arena)
Image (Artificial Analysis)
Overall
Hard prompts
Hard prompts (english)
Longer query
Multiturn
English
Math
Instruction following
Creative writing
Coding
Chinese
French
German
Japanese
Korean
Russian
Spanish
Exclude <5 tok query
Exclude refusal
Style control
Rank
Model
Rating
95% CI
1
gemini-2.5-pro-preview
5/6
new
1385
+9/-10
1
gemini-2.5-pro-exp
3/25
1377
+6/-7
1
o3
4/16
1373
+6/-6
4
chatgpt-4o-latest
3/26
1362
+7/-6
4
gpt-4.5-preview
2/27
1357
+4/-4
6
gemini-2.5-flash-preview
4/17
1338
+6/-7
6
gpt-4.1
4/14
1330
+7/-9
6
grok-3-preview
2/24
1328
+5/-4
6
o4-mini
4/16
1325
+9/-11
10
deepseek-v3
3/24
open
1323
+5/-6
10
o1
12/17 2024
1320
+3/-3
10
chatgpt-4o-latest
11/20 2024
1319
+3/-3
10
deepseek-r1
open
1317
+4/-4
14
claude-3-7-sonnet-thinking-32k
2/19
1313
+4/-5
15
o1-preview
1303
+4/-4
15
claude-3-7-sonnet
2/19
1298
+5/-4
15
gpt-4.1-mini
4/14
1295
+9/-7
18
o3-mini-high
1287
+5/-4
18
claude-3-5-sonnet
10/22 2024
1286
+3/-2
18
qwen3-235b-a22b
new
open
1285
+11/-12
18
gemini-2.0-flash
v1
1285
+3/-4
18
gemma-3-27b-it
open
1282
+5/-5
23
deepseek-v3
open
1277
+4/-4
23
gemini-2.0-flash-lite-preview
2/5
1270
+4/-3
25
o3-mini
1269
+4/-4
25
gemini-1.5-pro
v2
1269
+2/-2
25
hunyuan-turbos
2/26
1266
+10/-12
25
gemma-3-12b-it
new
open
1265
+14/-11
25
command-a
Mar
open
1264
+5/-5
30
gpt-4o
5/13 2024
1264
+2/-2
30
claude-3-5-sonnet
6/20 2024
1260
+2/-2
30
hunyuan-turbo
1/10
1260
+9/-9
30
glm-4-plus
1/11
1257
+7/-6
34
qwq-32b
open
1256
+6/-5
34
o1-mini
1256
+2/-2
34
llama-3.1-405b-instruct-bf16
open
1254
+3/-3
34
llama-3.1-405b-instruct-fp8
open
1253
+2/-2
34
step-2-16k-exp
Dec 2024
1252
+8/-10
34
gpt-4o
8/6 2024
1250
+3/-2
34
grok-2
8/13 2024
1249
+2/-2
34
llama-4-maverick-17b-128e-instruct
open
1249
+7/-6
34
llama-3.3-nemotron-49b-super
v1
open
1248
+10/-11
43
yi-lightning
1247
+3/-3
43
hunyuan-large
2/10
1243
+10/-10
43
gpt-4-turbo
4/9 2024
1242
+2/-2
43
gpt-4.1-nano
4/14
1240
+7/-7
47
yi-lightning-lite
1239
+4/-4
47
claude-3-opus
2/29 2024
1239
+2/-1
47
glm-4-plus
1236
+3/-3
47
llama-3.3-70b-instruct
open
1235
+2/-3
47
gpt-4o-mini
7/18 2024
1235
+2/-2
47
gpt-4-1106-preview
1235
+2/-2
47
claude-3-5-haiku
10/22 2024
1234
+3/-2
54
mistral-large
Jul 2024
open
1232
+3/-2
54
athene-v2-chat
open
1231
+3/-3
54
gpt-4-0125-preview
1231
+2/-2
54
hunyuan-standard
2/10
1230
+9/-8
54
gemini-1.5-flash
v2
1230
+4/-3
59
grok-2-mini
8/13 2024
1225
+2/-2
59
athene-70b
7/25
open
1223
+3/-4
59
qwen2.5-72b-instruct
open
1223
+3/-2
59
gemma-3-4b-it
new
open
1222
+9/-9
59
mistral-large
Nov 2024
open
1222
+3/-3
64
llama-3.1-nemotron-70b-instruct
open
1217
+6/-5
64
llama-3.1-70b-instruct
open
1214
+3/-2
64
llama-3.1-tulu-3-70b
open
1209
+9/-9
64
amazon-nova-pro-v1.0
1208
+3/-3
64
reka-core
9/4 2024
1208
+6/-6
69
gemma-2-27b-it
open
1207
+2/-2
69
yi-large-preview
1207
+3/-3
69
jamba-1.5-large
open
1206
+5/-5
69
llama-3.1-nemotron-51b-instruct
open
1204
+7/-9
73
gemma-2-9b-it-simpo
open
1200
+5/-7
73
gpt-4
3/14
1200
+2/-3
73
claude-3-sonnet
2/29 2024
1197
+2/-2
73
command-r-plus
Aug 2024
open
1197
+4/-4
73
nemotron-4-340b-instruct
open
1196
+4/-4
73
llama-3-70b-instruct
open
1195
+2/-2
73
yi-large
1195
+4/-4
73
reka-flash
9/4 2024
1193
+5/-6
73
mistral-small-24b-instruct
Jan
open
1192
+5/-4
73
qwen2.5-coder-32b-instruct
open
1190
+6/-6
73
glm-4
5/20
1189
+5/-5
84
gpt-4
6/13
1185
+2/-2
84
c4ai-aya-expanse-32b
open
1185
+3/-3
84
command-r-plus
open
1183
+2/-2
84
amazon-nova-lite-v1.0
1182
+4/-4
88
gemma-2-9b-it
open
1181
+2/-3
88
qwen2-72b-instruct
open
1181
+3/-3
88
gemini-1.5-flash-8b
v1
1180
+3/-2
88
claude-3-haiku
3/7 2024
1178
+2/-2
88
phi-4
open
1178
+4/-4
88
command-r
Aug 2024
open
1176
+5/-5
88
olmo-2-0325-32b-instruct
new
1174
+10/-12
95
amazon-nova-micro-v1.0
1164
+3/-4
95
glm-4
1/16
1163
+7/-6
95
jamba-1.5-mini
open
1160
+4/-5
95
ministral-8b
Oct 2024
open
1160
+8/-7
95
claude-1
1159
+4/-4
100
mistral-large
Feb 2024
1158
+2/-2
100
hunyuan-standard-256k
1153
+10/-10
102
reka-flash-21b-online
2/26 2024
1151
+4/-4
102
c4ai-aya-expanse-8b
open
1151
+5/-4
102
mixtral-8x22b-instruct
v0.1
open
1149
+3/-2
102
mistral-next
1149
+5/-5
102
command-r
open
1148
+3/-3
102
llama-3.1-tulu-3-8b
open
1148
+7/-8
102
claude-2.0
1147
+5/-6
109
llama-3-8b-instruct
open
1143
+2/-2
109
reka-flash-21b
2/26 2024
1142
+4/-4
109
gpt-3.5-turbo
3/14
1142
+11/-9
109
mistral-medium
1141
+3/-3
113
gpt-3.5-turbo
1/25
1137
+2/-2
113
gpt-3.5-turbo
6/13
1137
+4/-2
113
granite-3.1-8b-instruct
open
1135
+10/-8
113
claude-2.1
1135
+3/-3
113
yi-1.5-34b-chat
open
1134
+3/-3
113
llama-3.1-8b-instruct
open
1133
+3/-2
113
zephyr-orpo-141b-A35b
v0.1
open
1130
+9/-8
120
claude-instant-1
1127
+4/-4
121
phi-3-medium-4k-instruct
open
1118
+4/-3
121
gemma-2-2b-it
open
1118
+3/-3
121
internlm2_5-20b-chat
open
1116
+5/-5
121
mixtral-8x7b-instruct
v0.1
open
1114
+2/-2
121
dbrx-instruct-preview
open
1113
+3/-2
121
granite-3.0-8b-instruct
open
1109
+6/-7
127
gpt-3.5-turbo
11/6
1109
+5/-5
127
wizardlm-70b
open
1105
+7/-6
127
granite-3.1-2b-instruct
open
1104
+8/-9
127
snowflake-arctic-instruct
open
1102
+3/-3
127
yi-34b-chat
open
1101
+4/-5
127
openchat-3.5
1/6
open
1100
+4/-4
133
phi-3-small-8k-instruct
open
1098
+4/-4
133
openchat-3.5
open
1094
+5/-5
133
llama-3.2-3b-instruct
open
1094
+6/-4
133
starling-lm-7b-beta
open
1092
+5/-4
133
vicuna-33b
open
1091
+4/-4
133
openhermes-2.5-mistral-7b
open
1089
+9/-8
139
starling-lm-7b-alpha
open
1083
+6/-5
139
granite-3.0-2b-instruct
open
1082
+5/-7
139
pplx-70b-online
1076
+7/-7
142
nous-hermes-2-mixtral-8x7b-dpo
open
1072
+6/-9
142
dolphin-2.2.1-mistral-7b
1069
+10/-12
142
phi-3-mini-4k-instruct-june-2024
open
1068
+5/-4
142
mistral-7b-instruct
v0.2
open
1067
+4/-2
142
solar-10.7b-instruct
v1
1066
+9/-8
142
wizardlm-13b
open
1063
+7/-8
142
mpt-30b-chat
open
1060
+10/-9
142
falcon-180b-chat
open
1059
+14/-15
150
vicuna-13b
open
1058
+4/-4
150
phi-3-mini-4k-instruct
open
1056
+5/-4
150
smollm2-1.7b-instruct
open
1051
+13/-12
150
zephyr-7b-beta
open
1049
+6/-5
154
phi-3-mini-128k-instruct
open
1049
+5/-3
154
codellama-34b-instruct
open
1047
+6/-6
154
zephyr-7b-alpha
open
1046
+12/-12
154
llama-3.2-1b-instruct
open
1045
+6/-6
154
palm-2
1042
+7/-7
154
pplx-7b-online
1041
+7/-6
154
guanaco-33b
open
1040
+10/-9
154
codellama-70b-instruct
open
1038
+16/-17
162
stripedhyena-nous-7b
open
1034
+8/-6
162
mistral-7b-instruct
open
1026
+7/-5
164
vicuna-7b
open
1021
+7/-6
165
olmo-7b-instruct
open
994
+8/-7
165
koala-13b
open
980
+8/-7
167
chatglm3-6b
open
974
+10/-6
167
gpt4all-13b-snoozy
open
972
+16/-10
167
mpt-7b-chat
open
967
+9/-10
167
alpaca-13b
966
+10/-7
171
RWKV-4-Raven-14B
open
949
+7/-9
171
chatglm2-6b
open
942
+12/-9
173
oasst-pythia-12b
open
931
+7/-7
174
fastchat-t5-3b
open
896
+10/-9
174
chatglm-6b
open
888
+10/-7
174
dolly-v2-12b
open
881
+9/-8
177
stablelm-tuned-alpha-7b
open
859
+11/-10
Settings
Open models only
Visualize scores
With moats
With charts
Price ranges
<$0.20
$0.20-$1
$1-$10
$10+
Filter models
Never
Drop deprecated
Drop old
Best / org
Remember: You need a 70 point difference for a 60% win rate
Share
Done