|国家预印本平台
首页|Latent-Based Imputation of Laboratory Measures from Electronic Health Records: Case for Complex Diseases

Latent-Based Imputation of Laboratory Measures from Electronic Health Records: Case for Complex Diseases

Latent-Based Imputation of Laboratory Measures from Electronic Health Records: Case for Complex Diseases

来源:bioRxiv_logobioRxiv
英文摘要

Abstract Imputation is a key step in Electronic Health Records-mining as it can significantly affect the conclusions derived from the downstream analysis. There are three main categories that explain the missingness in clinical settings–incompleteness, inconsistency, and inaccuracy–and these can capture a variety of situations: the patient did not seek treatment, the health care provider did not enter the information, etc. We used EHR data from patients diagnosed with Inflammatory Bowel Disease from Geisinger Health System to design a novel imputation that focuses on a complex phenotype. Our approach is based on latent-based analysis integrated with clustering to group patients based on their comorbidities before imputation. IBD is a chronic illness of unclear etiology and without a complete cure. We have taken advantage of the complexity of IBD to pre-process the EHR data of 10,498 IBD patients and show that imputation can be improved using shared latent comorbidities. The R code and sample simulated input data will be available at a future time.

Lu P.、Ahuja M.、Bassaganya-Riera J.、Abedi V.、Ulloa A.E.、Shivakumar M.K.、Hontecillas R.、Leber A.、Shellenberger M.J.

BiotherapeuticsBiomedical and Translational Informatics InstituteNutritional Immunology and Molecular Medicine Laboratory||BiotherapeuticsBiomedical and Translational Informatics Institute||Nutritional Immunology and Molecular Medicine LaboratoryBiomedical and Translational Informatics InstituteBiomedical and Translational Informatics InstituteNutritional Immunology and Molecular Medicine Laboratory||BiotherapeuticsBiotherapeuticsDepartment of Gastroenterology and Hepatology

10.1101/275743

医学研究方法基础医学临床医学

ImputationSVDclusteringElectronic Health RecordsEHRInflammatory Bowel DiseaseIBDcomplex diseasesmissing data

Lu P.,Ahuja M.,Bassaganya-Riera J.,Abedi V.,Ulloa A.E.,Shivakumar M.K.,Hontecillas R.,Leber A.,Shellenberger M.J..Latent-Based Imputation of Laboratory Measures from Electronic Health Records: Case for Complex Diseases[EB/OL].(2025-03-28)[2025-05-07].https://www.biorxiv.org/content/10.1101/275743.点此复制

评论