|国家预印本平台
首页|ProbAlign: a re-alignment method for long sequencing reads

ProbAlign: a re-alignment method for long sequencing reads

ProbAlign: a re-alignment method for long sequencing reads

来源:bioRxiv_logobioRxiv
英文摘要

Abstract The incorrect alignments are a severe problem in variant calling, and remain as a challenge computational issue in Bioinformatics field. Although there have been some methods utilizing the re-alignment approach to tackle the misalignments, a standalone re-alignment tool for long sequencing reads is lacking. Hence, we present a standalone tool to correct the misalignments, called ProbAlign. It can be integrated into the pipelines of not only variant calling but also other genomic applications. We demonstrate the use of re-alignment in two diverse and important genomics fields: variant calling and viral quasispecies reconstruction. First, variant calling results in the Pacific Biosciences SMRT re-sequencing data of NA12878 show that false positives can be reduced by 43.5%, and true positives can be increased by 24.8% averagely, after re-alignment. Second, results in reconstructing a 5-virus-mix show that the viral population can be completely unraveled, and also the estimation of quasispecies frequencies has been improved, after re-alignment. ProbAlign is freely available in the PyroTools toolkit (https://github.com/homopolymer/PyroTools).

Zeng Feng、Chen Ting、Ji Guoli、Jiang Rui

Xiamen UniversityTsinghua University||University of Southern CaliforniaXiamen UniversityTsinghua University

10.1101/008698

生物科学研究方法、生物科学研究技术遗传学分子生物学

Zeng Feng,Chen Ting,Ji Guoli,Jiang Rui.ProbAlign: a re-alignment method for long sequencing reads[EB/OL].(2025-03-28)[2025-04-27].https://www.biorxiv.org/content/10.1101/008698.点此复制

评论