Revising transcriptome assemblies with phylogenetic information in Agalma1.0
Revising transcriptome assemblies with phylogenetic information in Agalma1.0
Abstract MotivationOne of the most common transcriptome assembly errors is to mistake different transcripts of the same gene as transcripts from multiple closely related genes. It is difficult to identify these errors during assembly, but in a phylogenetic analysis these errors can be diagnosed from gene trees containing clades of tips from the same species with improbably short branch lengths. Resultstreeinform is a module implemented in Agalma1.0 that uses phylogenetic analyses across species to refine transcriptome assemblies. It identifies transcripts of the same gene that were incorrectly assigned to multiple genes and reassign them as transcripts of the same gene. Availability and Implementationtreeinform is implemented in Agalma1.0, available at https://bitbucket.org/caseywdunn/agalma. Contactaugust_guang@brown.edu Supplementary informationSupplementary information is available at bioRxiv.
Guang August、Zapata Felipe、Lawrence Charles、Dunn Casey、Howison Mark
Division of Applied Mathematics, Brown UniversityDepartment of Ecology & Evolutionary Biology, University of California-Los AngelesDivision of Applied Mathematics, Brown UniversityDepartment of Ecology & Evolutionary Biology, Yale UniversityComputing & Information Services, Brown University
分子生物学遗传学生物科学研究方法、生物科学研究技术
Guang August,Zapata Felipe,Lawrence Charles,Dunn Casey,Howison Mark.Revising transcriptome assemblies with phylogenetic information in Agalma1.0[EB/OL].(2025-03-28)[2025-06-15].https://www.biorxiv.org/content/10.1101/202416.点此复制
评论