|国家预印本平台
首页|An Approach for Open Multivariate Analysis of Integrated Clinical and Environmental Exposures Data

An Approach for Open Multivariate Analysis of Integrated Clinical and Environmental Exposures Data

An Approach for Open Multivariate Analysis of Integrated Clinical and Environmental Exposures Data

来源:medRxiv_logomedRxiv
英文摘要

ABSTRACT The Integrated Clinical and Environmental Exposures Service (ICEES) provides regulatory-compliant open access to sensitive patient data that have been integrated with public exposures data. ICEES was designed initially to support dynamic cohort creation and bivariate contingency tests. The objective of the present study was to develop an open approach to support multivariate analyses using existing ICEES functionalities and abiding by all regulatory constraints. We first developed an open approach for generating a multivariate table that maintains contingencies between clinical and environmental variables using programmatic calls to the open ICEES application programming interface. We then applied the approach to data on a large cohort (N = 22,365) of patients with asthma or related conditions and generated an eight-feature table. Due to regulatory constraints, data loss was incurred with the incorporation of each successive feature variable, from a starting sample size of N = 22,365 to a final sample size of N = 4,556 (20.5%), but data loss was < 10% until the addition of the final two feature variables. We then applied a generalized linear model to the subsequent dataset and focused on the impact of seven select feature variables on asthma exacerbations, defined as annual emergency department or inpatient visits for respiratory issues. We identified five feature variables—sex, race, obesity, prednisone, and airborne particulate exposure—as significant predictors of asthma exacerbations. We discuss the advantages and disadvantages of ICEES open multivariate analysis and conclude that, despite limitations, ICEES can provide a valuable resource for open multivariate analysis and can serve as an exemplar for regulatory-compliant informatics solutions to open patient data, with capabilities to explore the impact of environmental exposures on health outcomes.

Fecho Karamarie、Krishnamurthy Ashok、Schmitt Patrick L.、Sinha Meghamala、Ramsey Stephen A.、Lan Bo、Haaland Perry、Sharma Priya、Xu Hao

Renaissance Computing Institute, University of North Carolina at Chapel HillRenaissance Computing Institute, University of North Carolina at Chapel Hill||Department of Computer Science, University of North CarolinaRenaissance Computing Institute, University of North Carolina at Chapel HillOregon State UniversityOregon State UniversityUNC Highway Safety Research Center, University of North Carolina at Chapel HillDepartment of Statistics and Operations Research, University of North Carolina at Chapel HillRenaissance Computing Institute, University of North Carolina at Chapel HillRenaissance Computing Institute, University of North Carolina at Chapel Hill

10.1101/2021.06.30.21259727

医学研究方法环境科学理论环境生物学

open scienceopen clinical datageneralized linear modelasthma

Fecho Karamarie,Krishnamurthy Ashok,Schmitt Patrick L.,Sinha Meghamala,Ramsey Stephen A.,Lan Bo,Haaland Perry,Sharma Priya,Xu Hao.An Approach for Open Multivariate Analysis of Integrated Clinical and Environmental Exposures Data[EB/OL].(2025-03-28)[2025-07-01].https://www.medrxiv.org/content/10.1101/2021.06.30.21259727.点此复制

评论