Multithreaded variant calling in elPrep 5
Multithreaded variant calling in elPrep 5
Abstract We present elPrep 5, which updates the elPrep framework for processing sequencing alignment/map files with variant calling. elPrep 5 can now execute the full pipeline described by the GATK Best Practices for variant calling, which consists of PCR and optical duplicate marking, sorting by coordinate order, base quality score recalibration, and variant calling using the haplotype caller algorithm. elPrep 5 produces identical BAM and VCF output as GATK4 while significantly reducing the runtime by parallelizing and merging the execution of the pipeline steps. Our benchmarks show that elPrep 5 speeds up the runtime of the variant calling pipeline by a factor 8-16x on both whole-exome and whole-genome data while using the same hardware resources as GATK 4. This makes elPrep 5 a suitable drop-in replacement for GATK 4 when faster execution times are needed.
Fostier Jan、Herzeel Charlotte、Decap Dries、Costanza Pascal、Verachtert Wilfried、Wuyts Roel
ExaScience Lab||Department of Information Technology, Ghent University - imecExaScience LabExaScience Lab||Department of Information Technology, Ghent University - imecExaScience LabExaScience LabExaScience Lab
生物科学研究方法、生物科学研究技术计算技术、计算机技术
Fostier Jan,Herzeel Charlotte,Decap Dries,Costanza Pascal,Verachtert Wilfried,Wuyts Roel.Multithreaded variant calling in elPrep 5[EB/OL].(2025-03-28)[2025-08-02].https://www.biorxiv.org/content/10.1101/2020.12.11.421073.点此复制
评论