|国家预印本平台
首页|Subpopulation identification for single-cell RNA-sequencing data using functional data analysis

Subpopulation identification for single-cell RNA-sequencing data using functional data analysis

Subpopulation identification for single-cell RNA-sequencing data using functional data analysis

来源:bioRxiv_logobioRxiv
英文摘要

Abstract BackgroundIn single-cell RNA-sequencing (scRNA-seq) data analysis, a number of statistical tools in multivariate data analysis (MDA) have been developed to help analyze the gene expression data. This MDA approach is typically focused on examining discrete genomic units of genes that ignores the dependency between the data components. In this paper, we propose a functional data analysis (FDA) approach on scRNA-seq data whereby we consider each cell as a single function. To avoid a large number of dropouts (zero or zero-closed values) and reduce the high dimensionality of the data, we first perform a principal component analysis (PCA) and assign PCs to be the amplitude of the function. Then we use the index of PCs directly from PCA for the phase components. This approach allows us to apply FDA clustering methods to scRNA-seq data analysis. ResultsTo demonstrate the robustness of our method, we apply several existing FDA clustering algorithms to the gene expression data to improve the accuracy of the classification of the cell types against the conventional clustering methods in MDA. As a result, the FDA clustering algorithms achieve superior accuracy on simulated data as well as real data such as human and mouse scRNA-seq data. ConclusionsThis new statistical technique enhances the classification performance and ultimately improves the understanding of stochastic biological processes. This new framework provides an essentially different scRNA-seq data analytical approach, which can complement conventional MDA methods. It can be truly effective when current MDA methods cannot detect or uncover the hidden functional nature of the gene expression dynamics.

Fujiwara Hironobu、Ahn Kyungmin

10.1101/760413

生物科学研究方法、生物科学研究技术细胞生物学分子生物学

functional data analysissingle-cell RNA-sequencing data analysisclusteringclassification

Fujiwara Hironobu,Ahn Kyungmin.Subpopulation identification for single-cell RNA-sequencing data using functional data analysis[EB/OL].(2025-03-28)[2025-05-09].https://www.biorxiv.org/content/10.1101/760413.点此复制

评论