|国家预印本平台
首页|tombRaider - improved species and haplotype recovery from metabarcoding data through artefact and pseudogene exclusion.

tombRaider - improved species and haplotype recovery from metabarcoding data through artefact and pseudogene exclusion.

tombRaider - improved species and haplotype recovery from metabarcoding data through artefact and pseudogene exclusion.

来源:bioRxiv_logobioRxiv
英文摘要

Environmental DNA metabarcoding has revolutionized ecological surveys of natural systems. By amplifying and sequencing small gene fragments from environmental samples containing complex DNA mixtures, scientists are now capable of exploring biodiversity patterns across the tree of life in a time-efficient and cost-effective manner. However, the accuracy of species and haplotype identification can be compromised by sequence artefacts and pseudogenes. Despite various strategies developed over the years, effective removal of artefacts remains challenging and inconsistent data reporting standards hinder reproducibility in eDNA metabarcoding experiments. To address these issues, we introduce tombRaider, an open-source command line software program (https://github.com/gjeunen/tombRaider) and R package (https://github.com/gjeunen/tombRaider_R) to remove artefacts and pseudogenes from metabarcoding data post clustering and denoising. tombRaider features a modular algorithm capable of evaluating multiple criteria, including sequence similarity, co-occurrence patterns, taxonomic assignment, and the presence of stop codons. We validated tombRaider using various published data sets, including mock invertebrate communities, air eDNA from a zoo, and salmon haplotypes from aquatic eDNA. Our results demonstrate that tombRaider effectively removed a higher proportion of artefacts while retaining authentic sequences, thus enhancing the accuracy and reliability of eDNA-derived diversity metrics. This user-friendly software program not only improves data quality in eDNA metabarcoding studies, but also contributes to standardised reporting practices, an aspect currently lacking in this emerging research field.

Lamare Miles、Kroos Gracie C.、Miller Allison K.、Jeunen Gert-Jan、Torma Michal、Fernandes Kristen、Mauvisseau Quentin、Gemmell Neil、Dowle Eddy

10.1101/2024.08.23.609468

环境科学技术现状生物科学现状、生物科学发展环境生物学

Lamare Miles,Kroos Gracie C.,Miller Allison K.,Jeunen Gert-Jan,Torma Michal,Fernandes Kristen,Mauvisseau Quentin,Gemmell Neil,Dowle Eddy.tombRaider - improved species and haplotype recovery from metabarcoding data through artefact and pseudogene exclusion.[EB/OL].(2025-03-28)[2025-05-28].https://www.biorxiv.org/content/10.1101/2024.08.23.609468.点此复制

评论