The All of Us Research Program (AoURP), led by the National Institutes of Health, is a longitudinal cohort study aimed at advancing precision medicine and improving human health through partnering with one million or more diverse participants across the United States. With an emphasis on reaching historically underrepresented populations in biomedical research, the AoURP datasets include:
Electronic Health Records (EHRs): Participants have the option to share EHR data, which is standardized using the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM).
Biosamples and Bioassays: DNA is extracted via blood, urine, or saliva for subsequent genomic analysis.
Mobile Health Data: Biometric data like heart rate and blood pressure are tracked using wearable devices.
Physical Measurements: At the participant's first appointment, trained All of Us staff members measure and record information such as height, weight, BMI, waist and hip circumferences, blood pressure, heart rate, pregnancy status, and wheelchair use.
Surveys: The database features a total of eight surveys, with The Basics, Lifestyle, and Overall Health being the three primary ones. For more information about the additional surveys and to explore their content, please visit the Survey Explorer.
All of Us Research Hub
TheAll of Us Research Hub stores health data from a diverse group of participants from across the United States.
Public Tier: The dataset contains only aggregate data with identifiers removed. These data are available to everyone through Data Snapshots and the Data Browser, an interactive tool on the Research Hub.
Registered Tier: The curated dataset contains deidentified individual-level data, available only to approved researchers on the Researcher Workbench. The Registered Tier currently includes data from electronic health records (EHRs), wearables, and surveys, as well as physical measurements taken during participant enrollment.
Controlled Tier: The dataset contains genomic data in the form of whole genome sequencing (WGS) and genotyping arrays, previously suppressed demographic data fields from EHRs and surveys, and unshifted dates of events.