Array

Versions

HTML (interactive dashboard)
PDF

How to Cite

Ahmad, M. (2026). Cochrane Data Extractor: Automated Dataset Harvesting from Cochrane Systematic Reviews. Synthēsis, 9(1). Retrieved from https://synthesis-medicine.org/index.php/journal/article/view/108 (Original work published June 6, 2026)

Abstract

Is automated extraction the way forward for extracting from the large-scale Cochrane datasets that Cochrane has started publishing? We developed an automated Python pipeline to download and organise the study-level data that Cochrane has put up on its website (pairwise, DTA, NMA). We designed it to check the data links and then download, validate, and organise the data into standardised datasets with covariates. The test set showed 11 pairwise datasets with 1,295 data rows at high accuracy. Uninterrupted runs were then carried out until 501 Cochrane reviews had been downloaded and their data organised into the Pairwise70 repository. We converted the Cochrane open data, which is distributed and hard to use, into a single data artefact to support meta-analysis research. This tool is limited to recent Cochrane reviews and does not work on other databases.

HTML (interactive dashboard)
PDF
Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright (c) 2026 Mahmood Ahmad