|国家预印本平台
首页|DnoisE: Distance denoising by Entropy. An open-source parallelizable alternative for denoising sequence datasets

DnoisE: Distance denoising by Entropy. An open-source parallelizable alternative for denoising sequence datasets

DnoisE: Distance denoising by Entropy. An open-source parallelizable alternative for denoising sequence datasets

来源:bioRxiv_logobioRxiv
英文摘要

Abstract DNA metabarcoding is broadly used in biodiversity studies encompassing a wide range of organisms. Erroneous amplicons are generated during amplification and sequencing procedures and constitute one of the major sources of concern for the interpretation of metabarcoding results. Several denoising programs have been implemented to detect and eliminate these errors. However, almost all denoising software currently available has been designed to process non-coding ribosomal sequences, most notably prokaryotic 16S rDNA. The growing number of metabarcoding studies using coding markers such as COI or RuBisCO demands a re-assessment and calibration of denoising algorithms. Here we present DnoisE, the first denoising program designed to detect erroneous reads and merge them with the correct ones using information from the natural variability (entropy) associated to each codon position in coding barcodes. We have developed an open-source software using a modified version of the UNOISE3 algorithm. DnoisE implements different merging procedures as options, and can incorporate codon entropy information either retrieved from the data or supplied by the user. In addition, the algorithm of DnoisE is parallelizable, greatly reducing run times on computer clusters. Our program also allows different input file formats, so it can be readily incorporated into existing metabarcoding pipelines.

Palac¨an Creu、Turon Xavier、Wangensteen Owen S.、Antich Adri¨¤

Department of Evolutionary Biology, Ecology and Environmental Sciences, University of Barcelona and Research Institute of Biodiversity (IRBIO)Department of Marine Ecology, Centre for Advanced Studies of Blanes (CEAB-CSIC)Norwegian College of Fishery Science, UiT The Arctic University of NorwayDepartment of Marine Ecology, Centre for Advanced Studies of Blanes (CEAB-CSIC)

10.1101/2021.07.07.451520

生物科学研究方法、生物科学研究技术分子生物学微生物学

Palac¨an Creu,Turon Xavier,Wangensteen Owen S.,Antich Adri¨¤.DnoisE: Distance denoising by Entropy. An open-source parallelizable alternative for denoising sequence datasets[EB/OL].(2025-03-28)[2025-05-04].https://www.biorxiv.org/content/10.1101/2021.07.07.451520.点此复制

评论