Wikipedia Edit History Dumps

The Wikipedia Edit History Dumps provide the complete revision history of all Wikimedia projects, including every edit made to every article since Wikipedia's founding in 2001. The dumps include article text, edit metadata, editor information, talk page discussions, and administrative actions. This dataset is one of the most extensively studied resources in computational social science, enabling research on collaborative knowledge production, conflict resolution, consensus formation, and the dynamics of volunteer-driven online communities. The sheer scale of the data — encompassing billions of edits across hundreds of language editions — makes it a unique resource for understanding large-scale human collaboration.

✏️ Suggest an edit View Source ↗

The Wikipedia Edit History Dumps provide the complete revision history of all Wikimedia projects, including every edit made to every article since Wikipedia’s founding in 2001. The dumps include article text, edit metadata, editor information, talk page discussions, and administrative actions. This dataset is one of the most extensively studied resources in computational social science, enabling research on collaborative knowledge production, conflict resolution, consensus formation, and the dynamics of volunteer-driven online communities. The sheer scale of the data — encompassing billions of edits across hundreds of language editions — makes it a unique resource for understanding large-scale human collaboration. It is particularly relevant in Citizen Science, Social Sciences and Education.

Wikipedia Edit History Dumps supports crowdsourcing, community-based and open source collaboration and is suited for community-scale initiatives in remote settings.

Wikipedia Edit History Dumps is classified as a well-documented dataset, indicating broad adoption and available documentation. The dataset is hosted on other, in XML, SQL dumps format.