Datasets
Dataset index
BEAR stores each data source separately (see data/) but also makes available a single BEAR.rds dataset to download. For very large datasets BEAR.rds only contains a smaller selection of rows.
Databases
| Dataset | Domain | Data |
|---|---|---|
| clinicaltrials.gov snapshot | clinical trials | registry of 16.6k clinical trials |
| Cochrane Database of Systematic Reviews (CDSR) | medicine & health | 90k studies in 6.6k Cochrane reviews |
| EU Clinical Trials Register (EU CTR) | clinical trials | registry of 8.7k clinical trials |
| Metapsy | psychotherapy | 1.5k studies in 20 meta-analyses |
| psymetadata and Nuijten et al | psychology / intelligence | 2.6k studies across psychology datasets |
| What Works Clearinghouse | education | 1.4k education studies |
Metascience datasets
| Dataset | Domain | Data |
|---|---|---|
| Arel-Bundock et al 2022 | political science | 2.3k studies in 351 meta-analyses |
| Askarov et al 2023 | economics | 1.9k studies from 352 meta-analyses |
| Bartoš et al. 2025 | exercise | 2.2k studies in 215 meta-analyses |
| Brodeur et al. 2024 | economics | 328 RCTs from econ journals |
| Costello and Fox 2022; Yang et al 2023, 2024 | ecology & evolution | 13k studies in 553 meta-analyses |
| Lang 2025 | economics | 736 papers from econ journals |
| Sladekova et al. 2023 | psychology | 3.5k studies in 406 meta-analyses |
| Szucs and Ioannidis 2017 | cognitive neuroscience | 2.3k cognitive neuroscience studies |
Replication efforts
| Dataset | Domain | Data |
|---|---|---|
| Many Labs 2 | psychology | 128 replications of 28 effects |
| Open Science Collaboration 2015 | psychology | 97 replications of experiments |
| SCORE | social & behavioural sciences | replications of 163 claims + 1.9k claims from papers |
PubMed/Medline scraped data
| Dataset | Domain | Data |
|---|---|---|
| Barnett and Wren 2019 | biomedicine | 416k studies from MEDLINE/PubMed |
| Chavalarias et al. 2016 | biomedicine | 1.9mln studies from MEDLINE/PubMed |
| Head et al 2015 | biomedicine | 219k studies from PubMed |
| Jager and Leek 2014 | biomedicine | 5.3k articles from 5 medical journals |
Reference documents
| Document | Contents |
|---|---|
| Documentation | Website documentation for shared derivation rules, currently covering p-values and confidence intervals. |
| Full dataset appendix | Stitched dataset notes and shared derivation notes for p-values and confidence intervals. |
| Dataset construction reference | Source-study selection, source-data extraction, and construction details that align with the paper-facing dataset appendix. |