|国家预印本平台
首页|Matching whole genomes to rare genetic disorders: Identification of potential causative variants using phenotype-weighted knowledge in the CAGI SickKids5 clinical genomes challenge

Matching whole genomes to rare genetic disorders: Identification of potential causative variants using phenotype-weighted knowledge in the CAGI SickKids5 clinical genomes challenge

Matching whole genomes to rare genetic disorders: Identification of potential causative variants using phenotype-weighted knowledge in the CAGI SickKids5 clinical genomes challenge

来源:bioRxiv_logobioRxiv
英文摘要

ABSTRACT Precise identification of causative variants from whole-genome sequencing data, including both coding and non-coding variants, is challenging. The CAGI5 SickKids clinical genome challenge provided an opportunity to assess our ability to extract such information. Participants in the challenge were required to match each of 24 whole-genome sequences to the correct phenotypic profile and to identify the disease class of each genome. These are all rare disease cases that have resisted genetic diagnosis in a state-of-the-art pipeline. The patients have a range of eye, neurological, and connective-tissue disorders. We used a gene-centric approach to address this problem, assigning each gene a multi-phenotype-matching score. Mutations in the top scoring genes for each phenotype profile were ranked on a six-point scale of pathogenicity probability, resulting in an approximately equal number of top ranked coding and non-coding candidate variants overall. We were able to assign the correct disease class for 12 cases and the correct genome to a clinical profile for five cases. The challenge assessor found genes in three of these five cases as likely appropriate. In the post-submission phase, after careful screening of the genes in the correct genome we identified additional potential diagnostic variants, a high proportion of which are non-coding.

Kundu Kunal、Moult John、Yin Yizhou、Pal Lipika R.

Institute for Bioscience and Biotechnology Research, University of Maryland||Computational Biology, Bioinformatics and Genomics, Biological Sciences Graduate Program, University of MarylandInstitute for Bioscience and Biotechnology Research, University of Maryland||Department of Cell Biology and Molecular Genetics, University of Maryland, College ParkInstitute for Bioscience and Biotechnology Research, University of MarylandInstitute for Bioscience and Biotechnology Research, University of Maryland

10.1101/707687

医学研究方法基础医学遗传学

Whole-genome sequencing dataHuman Phenotype Ontology (HPO)Eye disorderConnective-tissue disorderNeurological diseasesDiagnostic variantsCAGI5.

Kundu Kunal,Moult John,Yin Yizhou,Pal Lipika R..Matching whole genomes to rare genetic disorders: Identification of potential causative variants using phenotype-weighted knowledge in the CAGI SickKids5 clinical genomes challenge[EB/OL].(2025-03-28)[2025-04-26].https://www.biorxiv.org/content/10.1101/707687.点此复制

评论