ACM CSCW Proceedings Corpus

Full-text corpus of ACM CSCW conference papers for studying the evolution of collaborative computing research.

well-documented software-engineeringsocial-scienceseducation

arXiv Bulk Access Dataset

Complete metadata and full-text corpus of over 2 million scientific preprints across physics, mathematics, computer science, and other fields.

well-documented software-engineeringsocial-sciencescitizen-scienceeducation

COCCO 2 dataset and analyses

Dataset and analyss of the COCCO 2 experiment

Fresh established education

Crossref Metadata API

Scholarly metadata for over 130 million research outputs enabling citation and co-authorship analysis.

well-documented social-scienceseducationpublishing

European Social Survey

Cross-national survey measuring attitudes, beliefs, and behaviour across 30+ European countries biennially.

well-documented social-sciencespublic-policyeducation

GH Archive

Public dataset of GitHub activity events for studying open-source collaboration at scale.

⭐ 2982 established software-engineeringcitizen-science

Global Biodiversity Information Facility

International open data infrastructure providing access to over 2 billion species occurrence records.

well-documented environmental-sciencecitizen-scienceagriculture

Humanitarian Data Exchange

Open platform for sharing humanitarian data across organizations responding to crises worldwide.

established disaster-responsepublic-policyhealthcare

OpenAlex Research Graph

Open catalog of scholarly works, authors, institutions, and concepts replacing Microsoft Academic Graph.

established social-scienceseducationpublishing

OpenNeuro Brain Imaging

Free platform for sharing neuroimaging data to accelerate collaborative brain research.

established healthcaresocial-sciences

ORCID Public Data File

Annual data dump of researcher identifiers and affiliations enabling analysis of scientific collaboration networks.

well-documented social-scienceseducationpublishing

Recherche Data Gouv

French national research data repository for open science and collaborative data sharing.

Fresh established social-scienceseducation

Stack Overflow Data Dump

Anonymized dump of all questions, answers, and user interactions from the developer Q&A platform.

well-documented software-engineeringeducation

Stack Overflow Developer Survey

Annual survey dataset of developer demographics, technology usage, and work practices from the world's largest programming community.

well-documented software-engineeringeducation

Urban Observatory Dataset

Comparative data from cities worldwide enabling analysis of urban development and governance patterns.

emerging urban-planningpublic-policyenvironmental-science

WHO Global Health Observatory

Comprehensive health statistics and indicators from 194 countries for collaborative health research.

well-documented healthcarepublic-policysocial-sciences

Wikipedia Edit History Dumps

Complete revision history of all Wikimedia projects, enabling analysis of collaborative knowledge production at scale.

well-documented citizen-sciencesocial-scienceseducation