Wikidata - Free Collaborative Knowledge Base
referenceCredibility Rating
Good quality. Reputable source with community review or editorial standards, but less rigorous than peer-reviewed venues.
Rating inherited from publication venue: Wikipedia
Wikidata is a foundational open knowledge graph occasionally referenced in AI safety contexts for its role as a public data infrastructure, useful for knowledge-grounded AI systems and as a case study in open collaborative data governance.
Metadata
Summary
Wikidata is a free, collaborative, multilingual knowledge base operated by the Wikimedia Foundation that serves as a central structured data repository for Wikipedia and other Wikimedia projects. It provides machine-readable linked open data covering millions of entities, facts, and relationships. As a public good, it supports AI and NLP research by offering large-scale structured knowledge for training and evaluation.
Key Points
- •Wikidata is a free, open, collaboratively edited knowledge graph with hundreds of millions of statements about real-world entities.
- •It serves as a structured data backbone for Wikipedia and other Wikimedia projects, enabling consistent cross-language information sharing.
- •Data is available under CC0 (public domain), making it freely usable for AI training, knowledge graphs, and research.
- •Wikidata is widely used in NLP and AI research for entity linking, question answering, and knowledge-grounded language models.
- •Its open infrastructure exemplifies public-goods approaches to information, relevant to discussions of open vs. proprietary AI data.
Cited by 1 page
| Page | Type | Quality |
|---|---|---|
| AI-Era Epistemic Infrastructure | Approach | 59.0 |
Cached Content Preview
Wikidata - Wikipedia
Jump to content
From Wikipedia, the free encyclopedia
Collaborative multilingual knowledge graph
For Wikipedia's information page on Wikidata, see Wikipedia:Wikidata .
This article has multiple issues. Please help improve it or discuss these issues on the talk page . ( Learn how and when to remove these messages )
This article relies excessively on references to primary sources . Please improve this article by adding secondary or tertiary sources .
Find sources:   "Wikidata"  –  news   · newspapers   · books   · scholar   · JSTOR ( December 2020 ) ( Learn how and when to remove this message )
Parts of this article (those related to Screenshots; and more) need to be updated . Please help update this article to reflect recent events or newly available information. ( June 2025 )
( Learn how and when to remove this message )
Wikidata Screenshot
Main page of Wikidata in April 2021 Type of site Knowledge base
Wiki
Available in Multiple languages Owner Wikimedia Foundation Editor Wikimedia community URL wikidata .org Commercial No Registration Optional Launched 29 October 2012 ; 13 years ago  ( 2012-10-29 ) [ 1 ]
Wikidata is a collaboratively edited multilingual knowledge graph hosted by the Wikimedia Foundation . [ 2 ] It is a source of open data released under the Creative Commons CC0 public domain dedication. It is for the use of both Wikimedia and external projects. [ 3 ] [ 4 ] Wikidata is a wiki powered by the software MediaWiki , including its extension for semi-structured data , the Wikibase . As of early 2025, Wikidata had 1.65 billion item statements ( semantic triples ). [ 5 ]
Concept
[ edit ]
This diagram shows the most important terms used in Wikidata.
Wikidata is a document-oriented database , focusing on items , which represent any kind of topic, concept, or object. Each item is allocated a unique persistent identifier called its QID , a positive integer prefixed with the upper-case letter "Q" [ a ] . This makes it possible to provide translations of the basic information describing the topic each item covers without favouring any particular language.
Some examples of items and their QIDs are 1988 Summer Olympics (Q8470) , love (Q316) , Johnny Cash (Q42775) , Elvis Presley (Q303) , and Gorilla (Q36611) .
Item labels do not need to be unique. For example, there are two items named "Elvis Presley": Elvis Presley (Q303) , which represents the American singer and actor , and Elvis Presley (Q610926) , which represents his self-titled album . However, the combination of a label and its description must be unique. To avoid ambiguity, an item's QID is hence linked to this combination.
Main parts
[ edit ]
A la
... (truncated, 33 KB total)e0975f5f1abf3a39 | Stable ID: sid_Az6NptjpGS