Skip to content
Longterm Wiki
benchmark

BrowseComp

Metadata

Source Tablebenchmarks
Source ID6A4fafVF2n
DescriptionA benchmark evaluating AI systems' ability to find hard-to-locate information on the web, testing browsing, search, and information synthesis capabilities across difficult queries.
Wiki IDbrowsecomp
Children
CreatedMar 24, 2026, 11:23 PM
UpdatedMar 24, 2026, 11:24 PM
SyncedMar 24, 2026, 11:24 PM

Record Data

id6A4fafVF2n
slugbrowsecomp
nameBrowseComp
categoryagentic
descriptionA benchmark evaluating AI systems' ability to find hard-to-locate information on the web, testing browsing, search, and information synthesis capabilities across difficult queries.
website
scoringMethodaccuracy
higherIsBetterYes
introducedDate2025-04
maintainerOpenAI
sourcearxiv.org/abs/2504.12345
Debug info

Thing ID: 6A4fafVF2n

Source Table: benchmarks

Source ID: 6A4fafVF2n

Wiki ID: browsecomp