Skip to content

Latest commit

 

History

History
187 lines (138 loc) · 10.5 KB

File metadata and controls

187 lines (138 loc) · 10.5 KB
title GEN349: Introduction to scRNA-seq
author Meeta Mistry, Noor Sohail
date January 16th, 2025

Description

This repository has teaching materials for a hands-on Introduction to single-cell RNA-seq analysis workshop. This workshop runs over three sessions as outlined in the schedule below. The workshop will introduce participants on best practices for designing a single-cell RNA-seq experiment, and how to efficiently manage and analyze the data starting from count matrices. We will focus on using the Seurat package using R/RStudio. Working knowledge of R programming is required.

Pre-reading

Day 1

Time Topic Instructor
13:00 - 13:15 Workshop introduction Meeta
13:15 - 13:45 scRNA-seq pre-reading discussion All
13:45 - 13:50 Break
13:50 - 14:20 Quality control of Cellranger counts Noor
14:20 - 14:55 Quality control setup in R Meeta
14:55 - 15:00 Overview of self-learning materials and homework submission Meeta

Before the next class:

I. Please study the contents and work through all the code within the following lessons:

  1. Quality control with additional metrics

    Click here for a preview of this lesson
    In addition to the QC generated by cellranger, we can also compute some of our own metrics based on the raw data we have loaded into our Seurat object.

    In this lesson you will:
    - Compute essential QC metrics for each sample
    - Create plots to visualize metrics across cells per sample
    - Critically evaluate each plot and learn what each QC metric means

  2. Theory of PCA

    Click here for a preview of this lesson
    Before we can begin the next steps of the workflow, we need to make sure you have a good understanding of Principal Components Analysis (PCA). This method will be utilized in the scRNA-seq analysis workflow, and this foundation will help you better navigate those steps and interpretation of results.

II. Submit your work:

  • Each lesson above contains exercises; please go through each of them.
  • Submit your answers to the exercises using the word document titled "GEN349_scRNA_day1_exercises" in the GEN349 dropbox (Week 4) on the day before the next class.

Questions?

  • If you get stuck due to an error while runnning code in the lesson, email us

Day 2

Time Topic Instructor
13:00 - 13:30 Self-learning lessons discussion All
13:30 - 14:30 Normalization and regressing out unwanted variation Noor
14:30 - 14:35 Break
14:35 - 15:00 A brief introduction to Integration Meeta

Before the next class:

I. Please study the contents and work through all the code within the following lessons:

  1. Running CCA integration and complex integration tasks

    Click here for a preview of this lesson
    In class, we described the theory of integration and in what situations we would implement it.

    In this lesson you will:
    - Run the code to implement CCA integration
    - Evaluate the effect of integration on the UMAP
    - Learn about methods for complex integration tasks (Harmonizing samples)
  2. Clustering

    Click here for a preview of this lesson
    From the UMAP visualization of our data we can see that the cells are positioned into groups. Our next task is to isolate clusters of cells that are most similar to one another based on gene expression.

    In this lesson you will:
    - Learn the theory behind clustering and how it is performed in Seurat
    - Cluster cells and visualize them on the UMAP
  3. Clustering quality control

    Click here for a preview of this lesson
    After separating cells into clusters, it is crtical to evaluate whether they are biologically meaningful or not. At this point we can also decide if we need to re-cluster and/or potentialy go back to a previous QC step.

    In this lesson you will:
    - Check to see that clusters are not influenced by uninteresting sources of variation
    - Check to see whether the major principal components are driving the different clusters
    - Explore the cell type identities by looking at the expression for known markers across the clusters.
  4. Seurat Cheatsheet

    Click here for a preview of this lesson
    At this point, we have populated our seurat object with many different pieces of information. Knowing how to access different values will allow you to interact more efficiently with your dataset.

    In this lesson you will:
    - Explore the different parts of a seurat object.
    - Use the built-in functions from the Seurat package for visualizations and grabbing data.

II. Submit your work:

  • Each lesson above contains exercises; please go through each of them.
  • Submit your answers to the exercises using the word document titled "GEN349_scRNA_day2_exercises" in the GEN349 dropbox (Week 4) on the day before the next class.

Questions?

  • If you get stuck due to an error while runnning code in the lesson, email us

Day 3

Time Topic Instructor
13:00 - 13:45 Self-learning lessons discussion All
13:45 - 14:30 Marker identification Noor
14:30 - 14:35 Break
14:35 - 14:50 Workflow summary and Q&A Meeta
14:50- 15:00 Wrap up Meeta

Helpful HBC resources


Resources

We have covered the analysis steps in quite a bit of detail for scRNA-seq exploration of cellular heterogeneity using the Seurat package. For more information on topics covered, we encourage you to take a look at the following resources:

Seurat-focused

Scaling up: scRNA-seq analysis on HPC

Cell type annotation

Highlighted papers

Other online scRNA-seq courses: