|国家预印本平台
首页|Multithreaded variant calling in elPrep 5

Multithreaded variant calling in elPrep 5

Multithreaded variant calling in elPrep 5

来源:bioRxiv_logobioRxiv
英文摘要

Abstract We present elPrep 5, which updates the elPrep framework for processing sequencing alignment/map files with variant calling. elPrep 5 can now execute the full pipeline described by the GATK Best Practices for variant calling, which consists of PCR and optical duplicate marking, sorting by coordinate order, base quality score recalibration, and variant calling using the haplotype caller algorithm. elPrep 5 produces identical BAM and VCF output as GATK4 while significantly reducing the runtime by parallelizing and merging the execution of the pipeline steps. Our benchmarks show that elPrep 5 speeds up the runtime of the variant calling pipeline by a factor 8-16x on both whole-exome and whole-genome data while using the same hardware resources as GATK 4. This makes elPrep 5 a suitable drop-in replacement for GATK 4 when faster execution times are needed.

Fostier Jan、Herzeel Charlotte、Decap Dries、Costanza Pascal、Verachtert Wilfried、Wuyts Roel

ExaScience Lab||Department of Information Technology, Ghent University - imecExaScience LabExaScience Lab||Department of Information Technology, Ghent University - imecExaScience LabExaScience LabExaScience Lab

10.1101/2020.12.11.421073

生物科学研究方法、生物科学研究技术计算技术、计算机技术

Fostier Jan,Herzeel Charlotte,Decap Dries,Costanza Pascal,Verachtert Wilfried,Wuyts Roel.Multithreaded variant calling in elPrep 5[EB/OL].(2025-03-28)[2025-08-02].https://www.biorxiv.org/content/10.1101/2020.12.11.421073.点此复制

评论