Phylovar: Towards scalable phylogeny-aware inference of single-nucleotide variations from single-cell DNA sequencing data
Phylovar: Towards scalable phylogeny-aware inference of single-nucleotide variations from single-cell DNA sequencing data
Abstract Single-nucleotide variants (SNVs) are the most common variations in the human genome. Recently developed methods for SNV detection from single-cell DNA sequencing (scDNAseq) data, such as SCIΦ and scVILP, leverage the evolutionary history of the cells to overcome the technical errors associated with single-cell sequencing protocols. Despite being accurate, these methods are not scalable to the extensive genomic breadth of single-cell whole-genome (scWGS) and whole-exome sequencing (scWES) data. Here we report on a new scalable method, Phylovar, which extends the phylogeny-guided variant calling approach to sequencing datasets containing millions of loci. Through benchmarking on simulated datasets under different settings, we show that, Phylovar outperforms SCIΦ in terms of running time while being more accurate than Monovar (which is not phylogeny-aware) in terms of SNV detection. Furthermore, we applied Phylovar to two real biological datasets: an scWES triple-negative breast cancer data consisting of 32 cells and 3375 loci as well as an scWGS data of neuron cells from a normal human brain containing 16 cells and approximately 2.5 million loci. For the cancer data, Phylovar detected somatic SNVs with high or moderate functional impact that were also supported by bulk sequencing dataset and for the neuron dataset, Phylovar identified 5745 SNVs with non-synonymous effects some of which were associated with neurodegenerative diseases. We implemented Phylovar and made it publicly available at https://github.com/mae6/Phylovar.git.
Chowdary Sunkara B. V.、Zafar Hamim、Robledo Sergio、Ogilvie Huw A.、Edrisi Mohammadamin、Posada David、Nakhleh Luay、Valecha Monica V.
Department of Computer Science & Engineering, Indian Institute of Technology KanpurDepartment of Computer Science & Engineering, Indian Institute of Technology Kanpur||Department of Biological Sciences & Bioengineering, Institute of Technology Kanpur||Mehta Family Centre for Engineering in Medicine, Indian Institute of Technology KanpurUniversity of HoustonDepartment of Computer Science, Rice UniversityDepartment of Computer Science, Rice UniversityCINBIO, Universidade de Vigo||Galicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO||Department of Biochemistry, Genetics, and Immunology, Universidade de VigoDepartment of Computer Science, Rice UniversityCINBIO, Universidade de Vigo
生物科学研究方法、生物科学研究技术遗传学分子生物学
Single-nucleotide variation detectionSingle-cell whole-genome sequencingSingle-cell whole-exome sequencingIntra-tumor heterogeneity
Chowdary Sunkara B. V.,Zafar Hamim,Robledo Sergio,Ogilvie Huw A.,Edrisi Mohammadamin,Posada David,Nakhleh Luay,Valecha Monica V..Phylovar: Towards scalable phylogeny-aware inference of single-nucleotide variations from single-cell DNA sequencing data[EB/OL].(2025-03-28)[2025-08-02].https://www.biorxiv.org/content/10.1101/2022.01.16.476509.点此复制
评论