Treffer: A Cloud-Based Platform for Harmonized COVID-19 Data: Design and Implementation of the Rapid Acceleration of Diagnostics (RADx) Data Hub.

Title:
A Cloud-Based Platform for Harmonized COVID-19 Data: Design and Implementation of the Rapid Acceleration of Diagnostics (RADx) Data Hub.
Authors:
Martínez-Romero M; Stanford University, Stanford Center for Biomedical Informatics Research, Palo Alto, CA, United States., Horridge M; Stanford University, Stanford Center for Biomedical Informatics Research, Palo Alto, CA, United States., Mistry N; Booz Allen Hamilton Inc., McLean, VA, United States., Weyhmiller A; University of North Carolina at Chapel Hill, Renaissance Computing Institute (RENCI), Chapel Hill, NC, United States., Yu JK; Stanford University, Stanford Center for Biomedical Informatics Research, Palo Alto, CA, United States., Fujimoto A; Booz Allen Hamilton Inc., McLean, VA, United States., Henry A; University of North Carolina at Chapel Hill, Renaissance Computing Institute (RENCI), Chapel Hill, NC, United States., O'Connor MJ; Stanford University, Stanford Center for Biomedical Informatics Research, Palo Alto, CA, United States., Sier A; Booz Allen Hamilton Inc., McLean, VA, United States., Suber S; University of North Carolina at Chapel Hill, Renaissance Computing Institute (RENCI), Chapel Hill, NC, United States., Akdogan MU; Stanford University, Stanford Center for Biomedical Informatics Research, Palo Alto, CA, United States., Cao Y; Stanford University, Stanford Center for Biomedical Informatics Research, Palo Alto, CA, United States., Valliappan S; Booz Allen Hamilton Inc., McLean, VA, United States., Mieczkowska JO; University of North Carolina at Chapel Hill, Renaissance Computing Institute (RENCI), Chapel Hill, NC, United States., Krishnamurthy A; University of North Carolina at Chapel Hill, Renaissance Computing Institute (RENCI), Chapel Hill, NC, United States., Keller MA; Booz Allen Hamilton Inc., McLean, VA, United States., Musen MA; Stanford University, Stanford Center for Biomedical Informatics Research, Palo Alto, CA, United States.
Corporate Authors:
RADx Data Hub Team; see Authors' Contributions section, .
Source:
JMIR public health and surveillance [JMIR Public Health Surveill] 2025 Aug 20; Vol. 11, pp. e72677. Date of Electronic Publication: 2025 Aug 20.
Publication Type:
Journal Article
Language:
English
Journal Info:
Publisher: JMIR Publications Country of Publication: Canada NLM ID: 101669345 Publication Model: Electronic Cited Medium: Internet ISSN: 2369-2960 (Electronic) Linking ISSN: 23692960 NLM ISO Abbreviation: JMIR Public Health Surveill Subsets: MEDLINE
Imprint Name(s):
Original Publication: Toronto : JMIR Publications, [2015]-
References:
Lancet Infect Dis. 2023 Sep;23(9):e383-e388. (PMID: 37150186)
Viruses. 2022 May 18;14(5):. (PMID: 35632824)
Sci Data. 2024 Jan 31;11(1):152. (PMID: 38297013)
Sci Data. 2022 Nov 12;9(1):696. (PMID: 36371407)
Int J Epidemiol. 2017 Feb 1;46(1):103-105. (PMID: 27272186)
Nucleic Acids Res. 2021 Jul 2;49(W1):W619-W623. (PMID: 34048576)
EBioMedicine. 2022 May;79:104008. (PMID: 35460989)
Sci Data. 2024 Feb 14;11(1):204. (PMID: 38355867)
N Engl J Med. 2020 Sep 10;383(11):1071-1077. (PMID: 32706958)
J Am Med Inform Assoc. 2015 Nov;22(6):1148-52. (PMID: 26112029)
Nat Commun. 2023 Jul 10;14(1):3692. (PMID: 37429842)
Sci Adv. 2023 Apr 7;9(14):eade4962. (PMID: 37027461)
Lancet Digit Health. 2023 Oct;5(10):e712-e736. (PMID: 37775189)
Yearb Med Inform. 2021 Aug;30(1):75-83. (PMID: 34479380)
J Am Med Inform Assoc. 2021 Mar 1;28(3):427-443. (PMID: 32805036)
Nucleic Acids Res. 2011 Jul;39(Web Server issue):W541-5. (PMID: 21672956)
J Am Med Dir Assoc. 2021 Jun;22(6):1133-1137. (PMID: 33861978)
Sci Data. 2016 Mar 15;3:160018. (PMID: 26978244)
AMIA Annu Symp Proc. 2018 Dec 05;2018:602-608. (PMID: 30815101)
Vaccines (Basel). 2023 Aug 14;11(8):. (PMID: 37631929)
Cell Stem Cell. 2022 May 5;29(5):810-825.e8. (PMID: 35523141)
Pharmacoepidemiol Drug Saf. 2024 Jun;33(6):e5815. (PMID: 38783412)
Am J Public Health. 2022 Oct;112(10):1399-1403. (PMID: 35952331)
Am J Public Health. 2022 Nov;112(S9):S858-S863. (PMID: 36194852)
Pediatrics. 2022 Jun 1;149(6):. (PMID: 35260896)
Lancet Digit Health. 2022 Jul;4(7):e532-e541. (PMID: 35589549)
Int J Inf Manage. 2021 Aug;59:102352. (PMID: 33824545)
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W170-3. (PMID: 19483092)
Grant Information:
OT2 DB000009 United States DB DB NIH HHS
Contributed Indexing:
Keywords: COVID-19 surveillance; FAIR data sharing; cloud-based data platform; data harmonization and integration; digital health research; health disparities; metadata standards; pandemic response informatics; public health data infrastructure; secondary data analysis
Entry Date(s):
Date Created: 20250820 Date Completed: 20250820 Latest Revision: 20250907
Update Code:
20250907
PubMed Central ID:
PMC12409176
DOI:
10.2196/72677
PMID:
40834404
Database:
MEDLINE

Weitere Informationen

Background: The COVID-19 pandemic exposed significant limitations in existing data infrastructure, particularly the lack of systems for rapidly collecting, integrating, and analyzing data to support timely and evidence-based public health responses. These shortcomings hampered efforts to conduct comprehensive analyses and make rapid, data-driven decisions in response to emerging threats. To overcome these challenges, the US National Institutes of Health launched the Rapid Acceleration of Diagnostics (RADx) initiative. A key component of this initiative is the RADx Data Hub-a centralized, cloud-based platform designed to support data sharing, harmonization, and reuse across multiple COVID-19 research programs and data sources.
Objective: We aim to present the design, implementation, and capabilities of the RADx Data Hub, a cloud-based platform developed to support findable, accessible, interoperable, reusable (FAIR) data practices and enable secondary analyses of the COVID-19-related data contributed by a nationwide network of researchers.
Methods: The RADx Data Hub was developed on a scalable cloud infrastructure, grounded in the FAIR data principles. The platform integrates heterogeneous data types-including clinical data, diagnostic test results, behavioral data, and social determinants of health-submitted by over 100 research organizations across 46 US states and territories. The data pipeline includes automated and manual processes for deidentification, quality validation, expert curation, and harmonization. Metadata standards are enforced using tools such as the Center for Expanded Data Annotation and Retrieval (CEDAR) Workbench and BioPortal. Data files are structured using a unified specification to support consistent representation and machine-actionable metadata.
Results: As of May 2025, the RADx Data Hub hosts 187 studies and over 1700 data files, spanning 4 RADx programs: RADx Underserved Populations (RADx-UP), RADx Radical (RADx-rad), RADx Tech, and RADx Digital Health Technologies (RADx DHT). The Study Explorer and Analytics Workbench components enable researchers to discover relevant studies, inspect rich metadata, and conduct analyses within a secure cloud-based environment. Harmonized data conforming to a core set of common data elements facilitate cross-study integration and support secondary use. The platform provides persistent identifiers (digital object identifiers) for each study and supports access to structured metadata that adhere to the CEDAR specification, available in both JSON and YAML formats for seamless integration into computational workflows.
Conclusions: The RADx Data Hub successfully addresses key data integration challenges by providing a centralized, FAIR-compliant platform for public health research. Its adaptable architecture and data management practices are designed to support secondary analyses and can be repurposed for other scientific disciplines, strengthening data infrastructure and enhancing preparedness for future health crises.
(©Marcos Martínez-Romero, Matthew Horridge, Nilesh Mistry, Aubrie Weyhmiller, Jimmy K Yu, Alissa Fujimoto, Aria Henry, Martin J O'Connor, Ashley Sier, Stephanie Suber, Mete U Akdogan, Yan Cao, Somu Valliappan, Joanna O Mieczkowska, Ashok Krishnamurthy, Michael A Keller, Mark A Musen, RADx Data Hub Team. Originally published in JMIR Public Health and Surveillance (https://publichealth.jmir.org), 20.08.2025.)