Single-cell transcriptome profiling simulation reveals the impact of sequencing parameters and algorithms on clustering
Single-cell transcriptome profiling simulation reveals the impact of sequencing parameters and algorithms on clustering
ABSTRACT Despite of scRNA-seq analytic algorithms developed, their performance for cell clustering cannot be quantified due to the unknown “true” clusters. Referencing the transcriptomic heterogeneity of cell clusters, a “true” mRNA number matrix of cell individuals was defined as ground truth. Based on the matrix and real data generation procedure, a simulation program (SSCRNA) for raw data was developed. Subsequently, the consistence between simulated data and real data was evaluated. Furthermore, the impact of sequencing depth, and algorithms for analyses on cluster accuracy was quantified. As a result, the simulation result is highly consistent with that of the real data. It is found that mis-classification rate can be attributed to multiple reasons on current scRNA platforms, and clustering accuracy is not only sensitive to sequencing depth increasement, but can also be reflected by the position of the cluster on TSNE plot. Among the clustering algorithms, Gaussian normalization method is more appropriate for current workflows. In the clustering algorithms, k-means&louvain clustering method performs better in dimension reduced data than full data, while k-means clustering method is stable under both situations. In conclusion, the scRNA simulation algorithm developed restores the real data generation process, discovered impact of parameters on mis-clustering, compared the normalization/clustering algorithms and provided novel insight into scRNA analyses.
Peng Xueqing、Wu Aoshen、Yuan Zhenghong、Shi Bisheng、Liu Yunhe、Liu Gang、Liu Lei
Institute of Biomedical Sciences, Fudan UniversityInstitute of Biomedical Sciences, Fudan UniversityKey lab of Medical Molecular Virology of MOE/MOH, Shanghai Medical College, Fudan UniversityKey lab of Medical Molecular Virology of MOE/MOH, Shanghai Medical College, Fudan University||Research Unit, Shanghai Public Health Clinical Center, Fudan UniversityInstitute of Biomedical Sciences, Fudan UniversityInstitute of Biomedical Sciences, Fudan UniversityInstitute of Biomedical Sciences, Fudan University||School of Basic Medical Science, Fudan University
生物科学研究方法、生物科学研究技术分子生物学细胞生物学
Peng Xueqing,Wu Aoshen,Yuan Zhenghong,Shi Bisheng,Liu Yunhe,Liu Gang,Liu Lei.Single-cell transcriptome profiling simulation reveals the impact of sequencing parameters and algorithms on clustering[EB/OL].(2025-03-28)[2025-05-13].https://www.biorxiv.org/content/10.1101/2021.03.16.435626.点此复制
评论