| title | GEN349: Introduction to scRNA-seq |
|---|---|
| author | Meeta Mistry, Noor Sohail |
| date | January 16th, 2025 |
This repository has teaching materials for a hands-on Introduction to single-cell RNA-seq analysis workshop. This workshop runs over three sessions as outlined in the schedule below. The workshop will introduce participants on best practices for designing a single-cell RNA-seq experiment, and how to efficiently manage and analyze the data starting from count matrices. We will focus on using the Seurat package using R/RStudio. Working knowledge of R programming is required.
| Time | Topic | Instructor |
|---|---|---|
| 13:00 - 13:15 | Workshop introduction | Meeta |
| 13:15 - 13:45 | scRNA-seq pre-reading discussion | All |
| 13:45 - 13:50 | Break | |
| 13:50 - 14:20 | Quality control of Cellranger counts | Noor |
| 14:20 - 14:55 | Quality control setup in R | Meeta |
| 14:55 - 15:00 | Overview of self-learning materials and homework submission | Meeta |
I. Please study the contents and work through all the code within the following lessons:
-
Quality control with additional metrics
Click here for a preview of this lesson
In addition to the QC generated by cellranger, we can also compute some of our own metrics based on the raw data we have loaded into our Seurat object.
In this lesson you will:
- Compute essential QC metrics for each sample
- Create plots to visualize metrics across cells per sample
- Critically evaluate each plot and learn what each QC metric means
-
Click here for a preview of this lesson
Before we can begin the next steps of the workflow, we need to make sure you have a good understanding of Principal Components Analysis (PCA). This method will be utilized in the scRNA-seq analysis workflow, and this foundation will help you better navigate those steps and interpretation of results.
II. Submit your work:
- Each lesson above contains exercises; please go through each of them.
- Submit your answers to the exercises using the word document titled "GEN349_scRNA_day1_exercises" in the GEN349 dropbox (Week 4) on the day before the next class.
- If you get stuck due to an error while runnning code in the lesson, email us
| Time | Topic | Instructor |
|---|---|---|
| 13:00 - 13:30 | Self-learning lessons discussion | All |
| 13:30 - 14:30 | Normalization and regressing out unwanted variation | Noor |
| 14:30 - 14:35 | Break | |
| 14:35 - 15:00 | A brief introduction to Integration | Meeta |
I. Please study the contents and work through all the code within the following lessons:
-
Running CCA integration and complex integration tasks
Click here for a preview of this lesson
In class, we described the theory of integration and in what situations we would implement it.
In this lesson you will:
- Run the code to implement CCA integration
- Evaluate the effect of integration on the UMAP
- Learn about methods for complex integration tasks (Harmonizing samples)
-
Click here for a preview of this lesson
From the UMAP visualization of our data we can see that the cells are positioned into groups. Our next task is to isolate clusters of cells that are most similar to one another based on gene expression.
In this lesson you will:
- Learn the theory behind clustering and how it is performed in Seurat
- Cluster cells and visualize them on the UMAP
-
Click here for a preview of this lesson
After separating cells into clusters, it is crtical to evaluate whether they are biologically meaningful or not. At this point we can also decide if we need to re-cluster and/or potentialy go back to a previous QC step.
In this lesson you will:
- Check to see that clusters are not influenced by uninteresting sources of variation
- Check to see whether the major principal components are driving the different clusters
- Explore the cell type identities by looking at the expression for known markers across the clusters.
-
Click here for a preview of this lesson
At this point, we have populated our seurat object with many different pieces of information. Knowing how to access different values will allow you to interact more efficiently with your dataset.
In this lesson you will:
- Explore the different parts of a seurat object.
- Use the built-in functions from the Seurat package for visualizations and grabbing data.
II. Submit your work:
- Each lesson above contains exercises; please go through each of them.
- Submit your answers to the exercises using the word document titled "GEN349_scRNA_day2_exercises" in the GEN349 dropbox (Week 4) on the day before the next class.
- If you get stuck due to an error while runnning code in the lesson, email us
| Time | Topic | Instructor |
|---|---|---|
| 13:00 - 13:45 | Self-learning lessons discussion | All |
| 13:45 - 14:30 | Marker identification | Noor |
| 14:30 - 14:35 | Break | |
| 14:35 - 14:50 | Workflow summary and Q&A | Meeta |
| 14:50- 15:00 | Wrap up | Meeta |
- Seurat Cheatsheet
- Approaches for Differential expression analysis of scRNA-seq
- The scRNA-seq workflow: A practical guideline to ensure experimental success - a slide deck prepared by the HMS Single Cell Core
We have covered the analysis steps in quite a bit of detail for scRNA-seq exploration of cellular heterogeneity using the Seurat package. For more information on topics covered, we encourage you to take a look at the following resources:
- Seurat vignettes
- Seurat cheatsheet
- Satija Lab: Single Cell Genomics Day
- Additional information about cell cycle scoring
- A nice explanation on CCA and cell label transfer in Seurat
- Using RStudio on O2
- Databases with markers for manual annotation
- CellMarker 2.0
- Cell type signature gene sets from MSigDb
- CELL x GENE from CZI
- Reference-based automated celltype annotation
- "Sampling time-dependent artifacts in single-cell genomics studies." Massoni-Badosa et al. 2019
- "Dissociation of solid tumor tissues with cold active protease for single-cell RNA-seq minimizes conserved collagenase-associated stress responses." O'Flanagan et al. 2020
- "Systematic assessment of tissue dissociation and storage biases in single-cell and single-nucleus RNA-seq workflows." Denisenko et al. 2020
- "Confronting false discoveries in single-cell differential expression", Nature Communications 2021
- Single-nucleus and single-cell transcriptomes compared in matched cortical cell types
- A single-cell and single-nucleus RNA-Seq toolbox for fresh and frozen human tumors
- Ligand-receptor analysis with CellphoneDB
- Best practices for single-cell analysis across modalities