Skip to content

kuanrongchan/COVID19-severity

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

79 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DOI

Severe COVID-19 blood transcriptomics database

This database is a curation of 7 transcriptomics datasets comparing the gene expression in severe COVID-19 and mild COVID-19 subjects. Each database contains raw count data and a processed file, where the raw count data may be visualised with the box and strip plots in the app. The processed files contain p-value (t-test), adjusted p-value (BH step-up procedure), ratio, and fold-change data between severe COVID-19 and mild COVID-19, or severe COVID-19 and healthy subjects. The transcriptomics data may be found in the counts and anova folders respectively.

Severe COVID-19 datasets included in the study

1. Fong et al., 2021.

Dataset compares the gene expression differences between 14 severe COVID-19, 18 mild COVID-19, and 6 healthy subjects. This dataset may be found at GSE155454.

2. Bibert et al., 2021

Dataset compares the gene expression differences between 15 severe COVID-19, 63 mild COVID-19, and 27 healthy subjects. This dataset may be found at Bibert 2021.

3. McClain et al., 2021

Dataset compares the gene expression differences between 6 severe COVID-19, 10 mild COVID-19, and 19 healthy subjects. This dataset may be found at GSE161731.

4. Overmyer et al., 2021

Dataset compares the gene expression differences between 50 severe COVID-19 and 50 mild COVID-19 subjects. This dataset may be found at GSE157103.

5. Arunachalam et al., 2020

Dataset compares the gene expression differences between 4 severe COVID-19, 12 mild COVID-19, and 17 healthy subjects. This dataset may be found at GSE152418.

6. Carapito et al., 2022

Dataset compares the gene expression differences between 46 severe COVID-19 and 23 mild COVID-19 subjects. This dataset may be found at GSE172114.

7. Zhang et al., 2021

Dataset compares the gene expression differences between 5 severe COVID-19 and 5 mild COVID-19 subjects. This dataset may be found at GSE164805.

Website

Database app website instance is accessible at https://share.streamlit.io/kuanrongchan/covid19-severity/main/app.py

Users can modify the codes to launch the app on your computer by doing these steps:

  1. Download this repository by clicking on the green code button and downloading the zip file.
  2. In the command-line interface (command prompt on Windows, Terminal on MacOS/Linux), navigate to the directory the file is located at and install its dependencies as shown below.
    cd path/to/file
    pip install -r requirements.txt
    streamlit run app.py
    

Getting Started

To use the app:

  1. Select a database to query
  2. Search for your gene of interest using the filters indicated in the dataframe headers.
  3. Tick the checkbox next to the gene of interest. Note that only one gene can be selected at a time for graph plotting.
  4. Box and strip plots of the selected gene of interest in the dataset will be plotted. Mouse over the various plots for more in-depth statistics of each group.