Latent-Based Imputation of Laboratory Measures from Electronic Health Records: Case for Complex Diseases
Latent-Based Imputation of Laboratory Measures from Electronic Health Records: Case for Complex Diseases
Abstract Imputation is a key step in Electronic Health Records-mining as it can significantly affect the conclusions derived from the downstream analysis. There are three main categories that explain the missingness in clinical settings–incompleteness, inconsistency, and inaccuracy–and these can capture a variety of situations: the patient did not seek treatment, the health care provider did not enter the information, etc. We used EHR data from patients diagnosed with Inflammatory Bowel Disease from Geisinger Health System to design a novel imputation that focuses on a complex phenotype. Our approach is based on latent-based analysis integrated with clustering to group patients based on their comorbidities before imputation. IBD is a chronic illness of unclear etiology and without a complete cure. We have taken advantage of the complexity of IBD to pre-process the EHR data of 10,498 IBD patients and show that imputation can be improved using shared latent comorbidities. The R code and sample simulated input data will be available at a future time.
Lu P.、Ahuja M.、Bassaganya-Riera J.、Abedi V.、Ulloa A.E.、Shivakumar M.K.、Hontecillas R.、Leber A.、Shellenberger M.J.
BiotherapeuticsBiomedical and Translational Informatics InstituteNutritional Immunology and Molecular Medicine Laboratory||BiotherapeuticsBiomedical and Translational Informatics Institute||Nutritional Immunology and Molecular Medicine LaboratoryBiomedical and Translational Informatics InstituteBiomedical and Translational Informatics InstituteNutritional Immunology and Molecular Medicine Laboratory||BiotherapeuticsBiotherapeuticsDepartment of Gastroenterology and Hepatology
医学研究方法基础医学临床医学
ImputationSVDclusteringElectronic Health RecordsEHRInflammatory Bowel DiseaseIBDcomplex diseasesmissing data
Lu P.,Ahuja M.,Bassaganya-Riera J.,Abedi V.,Ulloa A.E.,Shivakumar M.K.,Hontecillas R.,Leber A.,Shellenberger M.J..Latent-Based Imputation of Laboratory Measures from Electronic Health Records: Case for Complex Diseases[EB/OL].(2025-03-28)[2025-05-07].https://www.biorxiv.org/content/10.1101/275743.点此复制
评论