|国家预印本平台
首页|Optimizing genomics pipeline execution with integer linear programming

Optimizing genomics pipeline execution with integer linear programming

Optimizing genomics pipeline execution with integer linear programming

来源:bioRxiv_logobioRxiv
英文摘要

In the field of genomics, bioinformatics pipelines play a crucial role in processing and analyzing vast biological datasets. These pipelines, consisting of interconnected tasks, can be optimized for efficiency and scalability by leveraging cloud platforms such as Microsoft Azure. The choice of compute resources introduces a trade-off between cost and time. This paper introduces an approach that uses Linear Programming (LP) to optimize pipeline execution. We consider optimizing two competing cases: minimizing cost with a run duration restriction and minimizing duration with a cost restriction. Our results showcase the utility of using LP in guiding researchers to make informed compute decisions based on specific data sets, cost and time requirements, and resource constraints.

Melnichenko Olesya、Malladi Venkat

10.1101/2024.02.06.579197

生物科学研究方法、生物科学研究技术计算技术、计算机技术

Melnichenko Olesya,Malladi Venkat.Optimizing genomics pipeline execution with integer linear programming[EB/OL].(2025-03-28)[2025-05-05].https://www.biorxiv.org/content/10.1101/2024.02.06.579197.点此复制

评论