Skip to content
Longterm Wiki
benchmark

Chatbot Arena Elo

Metadata

Source Tablebenchmarks
Source IDddqCoa4u0I
DescriptionCommunity-driven model ranking based on pairwise human preference votes. Users compare anonymous model outputs and vote for the better response. Over 2 million votes collected.
Source URLarena.ai/
Wiki IDchatbot-arena-elo
Children
CreatedMar 14, 2026, 12:43 AM
UpdatedMar 24, 2026, 11:24 PM
SyncedMar 24, 2026, 11:24 PM

Record Data

idddqCoa4u0I
slugchatbot-arena-elo
nameChatbot Arena Elo
categorygeneral
descriptionCommunity-driven model ranking based on pairwise human preference votes. Users compare anonymous model outputs and vote for the better response. Over 2 million votes collected.
websitearena.ai/
scoringMethodelo
higherIsBetterYes
introducedDate2023-05
maintainerLMArena (formerly LMSYS)
sourcearxiv.org/abs/2403.04132
Debug info

Thing ID: ddqCoa4u0I

Source Table: benchmarks

Source ID: ddqCoa4u0I

Wiki ID: chatbot-arena-elo