Predicting drug resistance in M. tuberculosis using a Long-term Recurrent Convolutional Network
Predicting drug resistance in M. tuberculosis using a Long-term Recurrent Convolutional Network
ABSTRACT MotivationDrug resistance in Mycobacterium tuberculosis (MTB) is a growing threat to human health worldwide. One way to mitigate the risk of drug resistance is to enable clinicians to prescribe the right antibiotic drugs to each patient through methods that predict drug resistance in MTB using whole-genome sequencing (WGS) data. Existing machine learning methods for this task typically convert the WGS data from a given bacterial isolate into features corresponding to single-nucleotide polymorphisms (SNPs) or short sequence segments of a fixed length K (K-mers). Here, we introduce a gene burden-based method for predicting drug resistance in TB. We define one numerical feature per gene corresponding to the number of mutations in that gene in a given isolate. This representation greatly reduces the number of model parameters. We further propose a model architecture that considers both gene order and locality structure through a Long-term Recurrent Convolutional Network (LRCN) architecture, which combines convolutional and recurrent layers. ResultsWe find that using these strategies yields a substantial, statistically significant improvement over state-of-the-art methods on a large dataset of M. tuberculosis isolates, and suggest that this improvement is driven by our method’s ability to account for the order of the genes in the genome and their organization into operons. AvailabilityThe implementations of our feature preprocessing pipeline1 and our LRCN model2 are publicly available, as is our complete dataset3. Supplementary informationAdditional data are available in the Supplementary Materials document4.
Chindelevitch Leonid、Safari Amir Hosein、Forna Alpha、Sedaghat Nafiseh、Libbrecht Maxwell、Zabeti Hooman
School of Public Health, Imperial College LondonSchool of Computing Science, Simon Fraser UniversitySchool of Computing Science, Simon Fraser UniversitySchool of Computing Science, Simon Fraser UniversitySchool of Computing Science, Simon Fraser UniversitySchool of Computing Science, Simon Fraser University
医学研究方法生物科学研究方法、生物科学研究技术药学
infectious diseasedeep learningantimicrobial resistancetuberculosisnext-generation sequencing
Chindelevitch Leonid,Safari Amir Hosein,Forna Alpha,Sedaghat Nafiseh,Libbrecht Maxwell,Zabeti Hooman.Predicting drug resistance in M. tuberculosis using a Long-term Recurrent Convolutional Network[EB/OL].(2025-03-28)[2025-08-02].https://www.biorxiv.org/content/10.1101/2020.11.07.372136.点此复制
评论