Single Cell RNA-Seq: full workflow in R [ public data to classified UMAP in 30 mins]

preview_player
Показать описание
Here is a full, basic single cell RNA-Seq workflow in R, starting with some aligned publicly available data and ending with a nice UMAP plot with classified clusters and their markers banked down.
Single cell RNA-Seq has captured a lot of attention and imaginations (and publications !) in bioinformatics these last few years. Many starting Phd’s (including me ) needed to suddenly pivot and learn the basics of the field. Theres a steep learning curve, and the most valuable learning comes from iterating over your analysis, and exploring the functions and the stats behind them, and comparing your analysis to others, reading their methods etc.
Getting those first few plots up can seem like quite the hurdle and seeing a pretty tSNE or UMAP is a huge morale boost when youre learning so I think this is a good place to start. The “actual” start is of course the bench, and the start of the computational work is on the .fastq files as they come off the sequencer , but I found that phase even harder to learn actually, so I think its helpful to go for the higher yield, quicker wins first to build some confidence and have some data you can show your team at the lab meeting and start thinking about!
The data was generated using a droplet based workflow by 10x genomics but this can apply to any workflow.

## script here so you can follow along

## home of Seurat package, excellent vignettes

## another excellent course

## UMAP original paper

## viridis
Рекомендации по теме
Комментарии
Автор

Thank you for this great tutorial, I will for sure practice this! Have you considered making a tutorial on Massspec data analysis? Would be amazing ☺️

beecher
Автор

I want to line up logarithmic readings of my scRNA-seq data sets and create an excel file. Can you help me?

cemilmeric
Автор

Hello it’s great video can i have script and data. Thanks

afsanehmaleki
Автор

Thank you very much. Your explanations are very useful.
Although, in the script provided by you, this chunk could not be executed:
top10.dat.markers <- dat.markers %>% group_by(cluster) %>% top_n(n = 10, wt = avg_logFC)

I only managed to execute it once I changed logFC to log2FC... may I ask for some comment on why it is so?

katiashcheglova
Автор

Great tutorial. I am following your script for ScRNA data analysis and getting following errors. Please help me to solve this problem. Thank you.
after this command
> head(rownames(dat.sham))
[1] NA
[3] NA
[5] NA
when I am trying to create Seurat object it gives the following error message!!
Warning: Non-unique features (rownames) present in the input matrix, making unique
Error in if (any(rownames(x = counts) == "")) { :
missing value where TRUE/FALSE needed

Please help :)

explore_with_pinku