PyMethylProcess - highly parallelized preprocessing for DNA methylation array data
PyMethylProcess - highly parallelized preprocessing for DNA methylation array data
Abstract SummaryThe ability to perform high-throughput preprocessing of methylation array data is essential in large scale methylation studies. While R is a convenient language for methylation analyses, performing highly parallelized preprocessing using Python can accelerate data preparation for downstream methylation analyses, including large scale production-ready machine learning pipelines. Here, we present a methylation data preprocessing pipeline called PyMethylProcess that is highly reproducible, scalable, and that can be quickly set-up and deployed through Docker and PIP. Availability and ImplementationProject Name: PyMethylProcessProject Home Page: https://github.com/Christensen-Lab-Dartmouth/PyMethylProcess. Available on PyPI as pymethylprocess.Available on DockerHub via joshualevy44/pymethylprocess.Help Documentation: https://christensen-lab-dartmouth.github.io/PyMethylProcess/Operating Systems: Linux, MacOS, Windows (Docker)Programming Language: Python, ROther Requirements: Python 3.6, R 3.5.1, Docker (optional) License: MIT Contactjoshua.j.levy.gr@dartmouth.edu
Levy Joshua J.、Titus Alexander J.、Salas Lucas A.、Christensen Brock C.
Program in Quantitative Biomedical Sciences, Geisel School of Medicine at DartmouthDepartment of Epidemiology, Geisel School of Medicine at DartmouthDepartment of Epidemiology, Geisel School of Medicine at DartmouthDepartment of Epidemiology, Geisel School of Medicine at Dartmouth||Department of Epidemiology and Department of Molecular and Systems Biology
生物工程学计算技术、计算机技术生物科学研究方法、生物科学研究技术
Levy Joshua J.,Titus Alexander J.,Salas Lucas A.,Christensen Brock C..PyMethylProcess - highly parallelized preprocessing for DNA methylation array data[EB/OL].(2025-03-28)[2025-08-02].https://www.biorxiv.org/content/10.1101/604496.点此复制
评论