add a button to download the results in the tables
π
1
#86 opened 7 days ago
by
HUYSOSAT
Files not updated since August
1
#85 opened about 1 month ago
by
Rachel0619
request model evaluation please?
1
#84 opened about 2 months ago
by
legolasyiu
Files have not updated since September 3
#83 opened about 2 months ago
by
MrLittleTexas
Files haven't been updated since Aug 4
β
π
6
7
#80 opened 3 months ago
by
maaxxxcal
Chatbot Arena Leaderboard Runtime Error
π
2
1
#78 opened 4 months ago
by
minpyaemoe
Add filters for 'Unlimited Free Access' and 'No Geo-Restrictions.'
π
1
#74 opened 7 months ago
by
wqqedfh
Let people vote on existing responses?
#73 opened 9 months ago
by
endolith
Latest raw mt-bench results available
#72 opened 9 months ago
by
lucweber
Cameroun
1
#69 opened about 1 year ago
by
EtCeterAi
Add Ovis-1.6 to Chatbot arena ?
#68 opened about 1 year ago
by
xxyyy123
I tried to plot AGI on the same Elo scale by comparing to "both bad" and "tie" votes
#67 opened about 1 year ago
by
endolith
Please add InternLM2.5-20B-Chat and InternLM2.5-7B-Chat to Leaderboard
#61 opened about 1 year ago
by
vansin
Upload leaderboard_table_20240716.csv
#50 opened over 1 year ago
by
connorchenn
Chatbot Arena: Classify requests/votes - ELO per category
#40 opened over 1 year ago
by
NeuralByte
How am I supposed to search models by name when there's live scroll?
π
2
#38 opened over 1 year ago
by
seedmanc
Number of parameters of the model and release date
1
#32 opened over 1 year ago
by
oovm
Is the leaderboard space deprecated then?
π€―
2
#31 opened over 1 year ago
by
zhiminy
Is the notebook version-controlled anywhere?
1
#30 opened over 1 year ago
by
endolith
Support benchmark for Long Context Recall abilities
#29 opened over 1 year ago
by
Nekochu
Is it fair to have web browsing allowed
π
β
5
1
#24 opened over 1 year ago
by
gearunclear
Dataset Update
β
π
6
1
#23 opened over 1 year ago
by
matthiaslau
Request: add two new models
π€
2
2
#21 opened almost 2 years ago
by
rombodawg
Removing LLM version clutter from the leaderboard ?
π
1
2
#20 opened almost 2 years ago
by
zarglu
Re-evaluate GPT-4 ! Add a ELO-graph over time to the leaderboard
π
3
8
#19 opened almost 2 years ago
by
cmp-nct
[enhancement] unaligned ranking column between leaderboards
#17 opened almost 2 years ago
by
zhiminy
Is there any way to download the leaderboard as csv or json format?
π
1
7
#13 opened almost 2 years ago
by
zhiminy
How does GPT-4 Turbo do so well?
π
2
10
#10 opened almost 2 years ago
by
endolith
Human level representation?
π
2
5
#8 opened almost 2 years ago
by
ehalit
Add quantized local models?
β€οΈ
1
#7 opened almost 2 years ago
by
endolith
Synthetic evaluation hypothesis
1
#6 opened almost 2 years ago
by
DmitriSS
You should add nous capybara 34b
π
1
#5 opened almost 2 years ago
by
distantquant