Algebraic invariants for inferring 4-leaf semi-directed phylogenetic networks
Algebraic invariants for inferring 4-leaf semi-directed phylogenetic networks
A core goal of phylogenomics is to determine the evolutionary history of a set of species from biological sequence data. Phylogenetic networks are able to describe more complex evolutionary phenomena than phylogenetic trees but are more difficult to accurately reconstruct. Recently, there has been growing interest in developing methods to infer semi-directed phylogenetic networks. As computing such networks can be computationally intensive, one approach to building such networks is to puzzle together smaller networks. Thus, it is essential to have robust methods for inferring semi-directed phylogenetic networks on small numbers of taxa. In this paper, we investigate an algebraic method for performing phylogenetic network inference from nucleotide sequence data on 4-leaved semi-directed phylogenetic networks by analysing the distribution of leaf-pattern probabilities. On simulated data, we found that we can correctly identify with high accuracy semi-directed networks as sequences approach 10Mbp in length, and that we are able to use our approach to identify tree-like evolution and determine the underlying tree. We also applied our approach to published transcriptome data from swordtail fish to compare its performance with a pseudolikelihood method for inferring semi-directed networks.
Martin Samuel、Moulton Vincent、Leggett Richard Mark
生物科学理论、生物科学方法生物科学研究方法、生物科学研究技术分子生物学
Martin Samuel,Moulton Vincent,Leggett Richard Mark.Algebraic invariants for inferring 4-leaf semi-directed phylogenetic networks[EB/OL].(2025-03-28)[2025-05-09].https://www.biorxiv.org/content/10.1101/2023.09.11.557152.点此复制
评论