|国家预印本平台
首页|vHULK, a new tool for bacteriophage host prediction based on annotated genomic features and deep neural networks

vHULK, a new tool for bacteriophage host prediction based on annotated genomic features and deep neural networks

vHULK, a new tool for bacteriophage host prediction based on annotated genomic features and deep neural networks

来源:bioRxiv_logobioRxiv
英文摘要

ABSTRACT The experimental determination of a bacteriophage host is a laborious procedure. For this reason, there is a pressing need for reliable computational predictions of bacteriophage hosts in phage research in general and in phage therapy in particular. Here, we present a new program called vHULK for phage host prediction based on 9,504 phage genome features. These features take into account alignment significance scores between predicted-protein sequences in the phage genomes and a curated database of viral protein families. The features were fed to a deep neural network, and four distinct models were trained to predict 61 different host genera and 52 host species. In random controlled test sets, the program obtained 99% and 98% accuracy values at the genus and species levels, respectively. On a validation dataset with 2,178 phage genomes, mean accuracies were 82% and 52% at the genus and species levels, respectively. When compared against other phage host prediction programs on the same validation dataset, vHULK achieved substantially better performance, therefore demonstrating that the program is an advance on the state-of-art in phage host prediction. vHULK is freely available at https://github.com/LaboratorioBioinformatica/vHULK.

Iha Bruno Koshin V¨¢zquez、da Silva Aline Maria、Amgarten Deyvid、Piroupo Carlos Morais、Setubal Jo?o Carlos

Departamento de Bioqu¨amica, Instituto de Qu¨amica, Universidade de S?o PauloDepartamento de Bioqu¨amica, Instituto de Qu¨amica, Universidade de S?o PauloDepartamento de Bioqu¨amica, Instituto de Qu¨amica, Universidade de S?o PauloDepartamento de Bioqu¨amica, Instituto de Qu¨amica, Universidade de S?o PauloDepartamento de Bioqu¨amica, Instituto de Qu¨amica, Universidade de S?o Paulo

10.1101/2020.12.06.413476

微生物学生物科学研究方法、生物科学研究技术分子生物学

Iha Bruno Koshin V¨¢zquez,da Silva Aline Maria,Amgarten Deyvid,Piroupo Carlos Morais,Setubal Jo?o Carlos.vHULK, a new tool for bacteriophage host prediction based on annotated genomic features and deep neural networks[EB/OL].(2025-03-28)[2025-05-02].https://www.biorxiv.org/content/10.1101/2020.12.06.413476.点此复制

评论