|国家预印本平台
首页|Efficiently processing amplicon sequencing data for microbial ecology with dadasnake, a DADA2 implementation in Snakemake

Efficiently processing amplicon sequencing data for microbial ecology with dadasnake, a DADA2 implementation in Snakemake

Efficiently processing amplicon sequencing data for microbial ecology with dadasnake, a DADA2 implementation in Snakemake

来源:bioRxiv_logobioRxiv
英文摘要

Abstract BackgroundAmplicon sequencing of phylogenetic marker genes, e.g. 16S, 18S or ITS rRNA sequences, is still the most commonly used method to estimate the structure of microbial communities. Microbial ecologists often have expert knowledge on their biological question and data analysis in general, and most research institutes have computational infrastructures to employ the bioinformatics command line tools and workflows for amplicon sequencing analysis, but requirements of bioinformatics skills often limit the efficient and up-to-date use of computational resources. Resultsdadasnake wraps pre-processing of sequencing reads, delineation of exact sequencing variants using the favorably benchmarked, widely-used the DADA2 algorithm, taxonomic classification and post-processing of the resultant tables, and hand-off in standard formats, into a user-friendly, one-command Snakemake pipeline. The suitability of the provided default configurations is demonstrated using mock-community data from bacteria and archaea, as well as fungi. ConclusionsBy use of Snakemake, dadasnake makes efficient use of high-performance computing infrastructures. Easy user configuration guarantees flexibility of all steps, including the processing of data from multiple sequencing platforms. dadasnake facilitates easy installation via conda environments. dadasnake is available at https://github.com/a-h-b/dadasnake.

Wei?becker Christina、Schnabel Beatrix、Heintz-Buschart Anna

Helmholtz Centre for Environmental Research GmbH - UFZ, Department of Soil EcologyHelmholtz Centre for Environmental Research GmbH - UFZ, Department of Soil EcologyHelmholtz Centre for Environmental Research GmbH - UFZ, Department of Soil Ecology||German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Bioinformatics Unit

10.1101/2020.05.17.095679

微生物学计算技术、计算机技术生物科学研究方法、生物科学研究技术

rRNA gene sequence analysisdenoisingexact sequence variantsRpipelinemicrobiomecommunity structure

Wei?becker Christina,Schnabel Beatrix,Heintz-Buschart Anna.Efficiently processing amplicon sequencing data for microbial ecology with dadasnake, a DADA2 implementation in Snakemake[EB/OL].(2025-03-28)[2025-05-01].https://www.biorxiv.org/content/10.1101/2020.05.17.095679.点此复制

评论