benchmark
Chatbot Arena Elo
Metadata
| Source Table | benchmarks |
| Source ID | ddqCoa4u0I |
| Description | Community-driven model ranking based on pairwise human preference votes. Users compare anonymous model outputs and vote for the better response. Over 2 million votes collected. |
| Source URL | arena.ai/ |
| Wiki ID | chatbot-arena-elo |
| Children | — |
| Created | Mar 14, 2026, 12:43 AM |
| Updated | Mar 24, 2026, 11:24 PM |
| Synced | Mar 24, 2026, 11:24 PM |
Record Data
id | ddqCoa4u0I |
slug | chatbot-arena-elo |
name | Chatbot Arena Elo |
category | general |
description | Community-driven model ranking based on pairwise human preference votes. Users compare anonymous model outputs and vote for the better response. Over 2 million votes collected. |
website | arena.ai/ |
scoringMethod | elo |
higherIsBetter | Yes |
introducedDate | 2023-05 |
maintainer | LMArena (formerly LMSYS) |
source | arxiv.org/abs/2403.04132 |
Debug info
Thing ID: ddqCoa4u0I
Source Table: benchmarks
Source ID: ddqCoa4u0I
Wiki ID: chatbot-arena-elo