|国家预印本平台
首页|Multicollinearity Resolution Based on Machine Learning: A Case Study of Carbon Emissions

Multicollinearity Resolution Based on Machine Learning: A Case Study of Carbon Emissions

Multicollinearity Resolution Based on Machine Learning: A Case Study of Carbon Emissions

来源:Arxiv_logoArxiv
英文摘要

This study proposes an analytical framework that integrates DBSCAN clustering with the Elastic Net regression model to address multifactorial problems characterized by structural complexity and multicollinearity, exemplified by carbon emissions analysis. DBSCAN is employed for unsupervised learning to objectively cluster features, while the Elastic Net is utilized for high-dimensional feature selection and complexity control. The Elastic Net is specifically chosen for its ability to balance feature selection and regularization by combining L1 (lasso) and L2 (ridge) penalties, making it particularly suited for datasets with correlated predictors. Applying this framework to energy consumption data from 46 industries in China (2000-2019) resulted in the identification of 16 categories. Emission characteristics and drivers were quantitatively assessed for each category, demonstrating the framework's capacity to identify primary emission sources and provide actionable insights. This research underscores the global applicability of the framework for analyzing complex regional challenges, such as carbon emissions, and highlights qualitative features that humans find meaningful may not be accurate for the model.

Xuanming Zhang

计算技术、计算机技术环境污染、环境污染防治环境科学理论

Xuanming Zhang.Multicollinearity Resolution Based on Machine Learning: A Case Study of Carbon Emissions[EB/OL].(2025-06-25)[2025-07-16].https://arxiv.org/abs/2507.02912.点此复制

评论