跟着Cell学单细胞转录组分析(五):单细胞转录组marker基因鉴定及细胞群注释

书接上回（跟着Cell学单细胞转录组分析(四):单细胞转录组测序UMAP降维聚类）。完成数据降维和细胞聚类后，最主要的环节和工作就是确定各个细胞群，明确是什么类型的细胞，正群的细胞定群很关键，涉及到整个研究，所以这一步宁愿多费时间，也不要出错。当然，这也不是一蹴而就的，需要反复的确认。

要确定各个群是什么细胞，首先需要了解细胞群的marker基因，因为不同类型的细胞突出表达的基因也是不同的。这里使用FindAllMarkers鉴定各个细胞群的高表达基因。


DefaultAssay(scedata) <- "RNA"
all.markers  <- FindAllMarkers(scedata, 
                                only.pos = TRUE, 
                                min.pct = 0.25, 
                                logfc.threshold = 0.75)
significant.markers  <- all.markers [all.markers $p_val_adj < 0.2, ]
write.csv(significant.markers, file = "significant.markers.csv")#保存

Seurat提供了几种函数例如FeaturePlot()、DotPlot()和DoHeatmap()，按照文章中的mrker基因，做一下可视化。


markers <- c("ACKR1","RAMP2","SELE","VWF","PECAM1",
             "LUM","COL3A1","DCN","COL1A1","CFD",
             "KRT14","KRT5","S100A2","CSTA","SPRR1B",
             "CD69","CD52","CXCR4","PTPRC","HCST")
DotPlot(scedata,features = markers)+coord_flip()

点图：

图片

UMAP图：

FeaturePlot(scedata,features = c("ACKR1","LUM","KRT14","CD69"))

图片

热图：


alldata <- ScaleData(scedata, 
                     features = markers, 
                     assay = "RNA")
DoHeatmap(alldata, 
          features = markers,
          group.by = "seurat_clusters",
          assay = "RNA")

图片

很显然，这些都是默认出图，距离发文章还是有一定距离的，后期我们会专门讲解个性化的修饰，争取可视化更好。

接下来就是细胞定群了，对各个细胞群命名。细胞定群有很多方法，目前也有很多工具，但是依照小编的经验，自动定群等一般结果不是完全正确，况且操作复杂，为了保证正确性，最使用的办法还是查询文献定群。定群后，对细胞群重命名。


scedata <- subset(scedata, idents = c("21"), invert = TRUE)#去掉低质量细胞群
new.cluster.ids <- c("0"="Fibroblast", 
                     "1"="Endothelial", 
                     "2"="Endothelial", 
                     "3"="Endothelial", 
                     "4"="Immune", 
                     "5"="Immune", 
                     "6"="Endothelial", 
                     "7"="Fibroblast", 
                     "8"="Other", 
                     "9"="Immune", 
                     "10"="Epithelial", 
                     "11"="Endothelial", 
                     "12"="Fibroblast", 
                     "13"="Immune", 
                     "14"="Other", 
                     "15"="Immune", 
                     "16"="Fibroblast", 
                     "17"="Endothelial", 
                     "18"="Fibroblast", 
                     "19"="Epithelial", 
                     "20"="Endothelial", 
                     "22"="Immune",
                     "23"="Immune", 
                     "24"="Immune", 
                     "25"="Epithelial",
                     "26"="Immune",
                     "27"="Immune", 
                     "28"="Immune", 
                     "29"="Other")
scedata <- RenameIdents(scedata, new.cluster.ids)                        
scedata$celltype <- [email protected]
DimPlot(scedata, group.by = "celltype")
save(scedata, file = "scedata.RData")

图片

最后将命名的文件保存，可视化细胞群！在进行下一步工作之前，之后的内容将会是对目前这些图形结果的修饰和个性化可视化！

跟着Cell学单细胞转录组分析(五):单细胞转录组marker基因鉴定及细胞群注释

你可能感兴趣的:(跟着Cell学单细胞转录组分析(五):单细胞转录组marker基因鉴定及细胞群注释)