The C3 AI COVID-19 Data Lake has integrated many disparate and disjointed COVID-19 data sources in a unified data model. All these COVID-19 data sources are automatically updated and are available through the same RESTful API interface, regardless of their origin. Here, we list all data sources currently included in the C3 AI COVID-19 Data Lake.
Description
Daily update of location and number of confirmed COVID-19 cases, deaths, and recoveries for all affected countries. Cases are reported at the province level in China, county level in the US, state level in Australia and Canada, and at the country level otherwise.
Organization
Johns Hopkins University: COVID-19 Data Repository
Date Added
April 22, 2020
Description
Daily update of location and number of confirmed COVID-19 cases, deaths, tests, and hospitalizations for all 50 US states, the District of Columbia, and 5 other US territories.
Organization
COVID Tracking Project
Date Added
April 22, 2020
Description
Daily update of location and number of confirmed COVID-19 cases and deaths for countries, territories, and areas reporting around the world.
Organization
World Health Organization
Date Added
April 22, 2020
Description
Daily update of location and number of confirmed COVID-19 cases and deaths in the United States, at the state and county level. This time series data is compiled from state and local governments and health departments.
Organization
New York Times
Date Added
April 22, 2020
Description
Daily update of location and number of confirmed COVID-19 cases and deaths worldwide based on reports from health authorities. Every day, up to 500 relevant sources are screened to collect the latest figures, and data entries are validated and documented.
Organization
European Centre for Disease Prevention and Control
Date Added
April 22, 2020
Description
Forecast models showing demand for hospital services, including the availability of ventilators, general hospital beds, and ICU beds, as well as daily and cumulative deaths, infections and testing related to COVID-19.
Organization
University of Washington’s Institute for Health Metrics and Evaluation
Date Added
May 15, 2020. Data available from March 25, 2020 to June 13, 2020.
Description
Data related to COVID-19 in South Korea including COVID-19 case counts and line list patient data. Dataset components include patient routes as well as patient age, gender and date of diagnosis.
Organization
Data Science for COVID-19 and Korea Centers for Disease Control & Prevention
Date Added
May 31, 2020
Description
Daily update of location and number of confirmed COVID-19 cases, deaths, tests, and hospitalizations in Italy at a regional and provincial level.
Organization
Italy’s Department of Civil Protection - Coronavirus Emergency
Date Added
May 15, 2020
Description
Daily update of location and number of COVID-19 cases and tests in India based on state bulletins and official handles, validated by a group of volunteers.
Organization
Covid19India.org
Date Added
May 15, 2020
Description
Daily update of location and number of COVID-19 cases, deaths, tests, recoveries, and hospitalizations gathered at a county level from a variety of openly available world government data sources and curated datasets.
Organization
Corona Data Scraper
Date Added
June 2, 2020
Description
Time series data on the number of deaths from all causes that have occurred during the coronavirus pandemic for 25 countries. The totals in this data include deaths from COVID-19 as well as those from other causes, likely including people who could not be treated or did not seek treatment for other conditions.
Organization
The New York Times
Date Added
June 23, 2020
Description
Provisional death counts for COVID-19 in the United States by select demographic and geographic characteristics such as sex, age, race and Hispanic origin.
Organization
Centers for Disease Control and Prevention (CDC)
Date Added
August 11, 2020
Description
The most complete and up-to-date race and ethnicity data on COVID-19 in the United States gathered through a collaboration between the COVID Tracking Project and the Boston University Center for Antiracist Research.
Organization
COVID Tracking Project at The Atlantic and Boston University Center for Antiracist Research
Date Added
August 11, 2020
Description: Daily update of number of COVID-19 cases, deaths, hospitalizations and recoveries for cities, counties, states, provinces and countries around the world.
Description
Centralized repository of individual-level line list data from laboratory confirmed patients with COVID-19 from around the world with information on age, gender, chronic diseases, and travel history.
Organization
nCoV-2019 – Data Working Group
Date Added
May 15, 2020. Data available from Jan 3, 2020 to April 30, 2020.
Description
Centralized repository of individual-level line list data from laboratory confirmed patients with COVID-19 from around the world with information on age, gender, symptoms, exposure history and hospitalization details.
Organization
Laboratory for the Modeling of Biological Socio-technical Systems (MOBS Lab)
Date Added
April 22, 2020
Repositories of individual-level line list data such as age, gender, symptoms, exposure history and hospitalization details for patients with COVID-19.
Description
Updated genomic sequence data from COVID-19 samples, with standardized metadata including type, length, collection date, location, host, isolation source and related publications.
Organization
National Center for Biotechnology Information Virus Database (NCBI)
Date Added
April 22, 2020
Genomic sequences of COVID-19 nucleotide and protein samples from around the world.
Description
Collection of over 45,000 journal articles about COVID-19 and the coronavirus family of viruses, updated weekly. The corpus is updated regularly as new research is published in peer-reviewed publications and archival services like bioRxiv, medRxiv, and others.
Organization
Allen Institute for AI: COVID-19 Open Research Dataset (CORD-19)
Date Added
April 22, 2020. Data available from March 13, 2020 to April 8, 2020.
Collection of journal articles about COVID-19 and the coronavirus family of viruses in an easy to access format.
Description
A curated collection of therapeutics for COVID-19 being tested and in development. Publicly available information are aggregated from validated sources on different areas of research including vaccines, antibiotics, antivirals, RNA-based treatments, devices and antivirals.
Organization
Milken Institute: COVID-19 Treatment and Vaccine Tracker
Date Added
April 22, 2020
Description
Collection of treatments and vaccines for COVID-19 being tested and under development including therapy types, targets, anticipated next steps and funding sources.
Organization
World Health Organization
Date Added
April 22, 2020
Description
Dataset of >230 chest X-ray and CT images of patients who are positive or suspected of COVID-19 or other viral and bacterial pneumonias (MERS, SARS, and ARDS). Data are collected from public sources as well as through indirect collection from hospitals and physicians.
Organization
University of Montreal
Date Added
May 15, 2020
Description
Repository of clinical characteristics of patients who have taken a COVID-19 test including test results, location, patient age, heart rate, blood pressure, clinical history, oxygen saturation levels, respiratory status and other symptoms.
Organization
Carbon Health and Braid Health
Date Added
May 15, 2020
Description
Information about number of licensed beds, staffed beds, ICU beds, and the bed utilization rate for the hospitals in the United States including hospital types such as long term acute care hospitals, clinical access hospitals, Veterans Affairs (VA) hospitals, and Department of Defense (DoD) hospitals.
Organization
Definitive Healthcare
Date Added
May 15, 2020
Description
Vaccination coverage data estimated as percentage of people who have received specific vaccines for all ages including children (19-35 months), children attending kindergarten, adolescents, and adults (18 years and older).
Organization
Centers for Disease Control and Prevention (CDC)
Date Added
June 2, 2020
Description
A registry of all clinical trials assessing the efficacy and safety of clinical candidate interventions to treat COVID-19. Data are pulled from the International Clinical Trials Registry Platform, including those from the Chinese Clinical Trial Registry, ClinicalTrials.gov, Clinical Research Information Service - Republic of Korea, EU Clinical Trials Register, ISRCTN, Iranian Registry of Clinical Trials, Japan Primary Registries Network, and German Clinical Trials Register.
Organization
Cytel Inc.
Date Added
June 2, 2020
Repositories of clinical assets related to COVID-19 such as CT and X-ray lung images, patient test results, vaccine coverage, active therapeutics and clinical trials.
Description
State data and policy actions to address COVID-19 in the US including social distancing measures, health policy actions to reduce barriers to testing and treatment, as well as testing and provider capacity at the state level.
Organization
Kaiser Family Foundation
Date Added
May 15, 2020
Description
A collection of 17 indicators on common policy responses that governments in 160 countries have taken to respond to the pandemic, such as school closures and travel restrictions.
Organization
University of Oxford
Date Added
June 23, 2020
Collection of actions and policies taken by government and regulatory bodies to address COVID-19.
Description
Daily reports on mobility trends reflecting requests for directions in Apple Maps. Data for driving, walking and transit are included at city, state and country levels from around the world.
Organization
Apple Inc.
Date Added
May 15, 2020
Description
Movement trends over time by geography, across different categories of places such as retail and recreation, groceries and pharmacies, parks, transit stations, workplaces, and residential. These datasets show how visits and length of stay at different places change compared to a baseline. These changes are calculated using the same kind of aggregated and anonymized data used to show popular times for places in Google Maps.
Organization
Google Inc.
Date Added
June 2, 2020
Description
A repository of indices describing exposure derived from anonymized, aggregated smartphone movement data. The indices describe potential exposure varying across locations and time within the United States.
Organization
UC Berkeley and PlaceIQ
Date Added
June 23, 2020
Data sources on movement trends during COVID-19 in different geographies around the world.
Description
County resident population in all 50 US states and District of Columbia by characteristics such as age, sex and race from 2010 through 2018.
Organization
United States Census Bureau
Date Added
May 15, 2020
Description
Annual global population, demographic and health information by country starting in 1960. Indicators include birth rates, cause of death, contraception prevalence, diabetes prevalence, fertility rate and immunization.
Organization
The World Bank
Date Added
May 15, 2020
Description
Estimates of country populations since 1950 with projections through 2050. The dataset includes midyear population figures broken down by age and gender assignment at birth. Additionally, time-series data is provided for attributes including fertility rates, birth rates, death rates, and migration rates.
Organization
United States Census Bureau
Date Added
June 23, 2020
Description
County population from 2010 to 2019 by characteristics such as age, sex, race and Hispanic origin.
Organization
United States Census Bureau
Date Added
August 11, 2020
Data sources on demographic data such as population, age, sex and death rates from around the world.
Description
A collection of policy measures taken in support of the financial sector to address the impact of the COVID-19 pandemic.
Organization
The World Bank
Date Added
June 23, 2020
Description
County-level monthly unemployment statistics in the United States from 1990 to the present.
Organization
US Bureau of Labor Statistics
Date Added
August 11, 2020
Description
County-level monthly housing indicators such as inventory and market volatility from 2016 to the present.
Organization
Realtor.com
Date Added
August 11, 2020
Description
County-level annual GDP and economic profile in the United States from 1959 to 2018.
Organization
Bureau of Economic Analysis
Date Added
August 11, 2020
Description
Tracker for the economic impacts of COVID-19 on people, businesses, and communities across the United States in real time using indicators such as spending, employment, revenue, job postings, education, and public health.
Organization
Opportunity Insights
Date Added
August 11, 2020
Data sources on economic indicators in different geographies around the world.
Description
Weather data that provide current and forecast conditions, seasonal and sub-seasonal forecasts, lifestyle indices, severe weather and historical weather data.
Organization
IBM
Date Added
June 23, 2020
Data sources of environmental factors and indices during the age of COVID-19.
Description
Symptom prevalence, demographic, and political opinion data collected from thousands of respondents since April 2020.
Organization
Swayable and TapResearch
Date Added
August 11, 2020
Data sources of measured public opinions and political views.
Complete the Form to Get Access
Get started by downloading R and Python quickstart notebooks and access documentation for C3 AI COVID-19 RESTful APIs.
One of the primary objectives of the C3 AI COVID-19 Data Lake is to continuously and conceptually expand the corpus of the data lake. Please contact us if you: 1) Are able to directly contribute specific data sets related to COVID-19, or 2) Have a specific request for data sets you want to see included.
By submitting your information, you agree to our Privacy Policy, Terms of Use and our Data Use Conditions.
This website uses cookies to facilitate and enhance your use of the website and track usage patterns. By continuing to use this website, you agree to our use of cookies as described in our Privacy Policy.