基于K-Means聚类的玻璃风化成分分析与鉴别
nalysis and Identification of Weathering Components of Glass Based on K-Means Clustering
玻璃文物易受环境影响而风化,从而造成其内部化学成分变化影响类别判断。本文通过对错误数据进行剔除及缺失值填补,利用控制变量法以是否风化作为固定值,观察与其他三个变量之间的关系。利用卡方检验对已知定类变量进行差异性分析,根据显著性P判断玻璃颜色类型与表面风化是否存在差异性。通过已知化学成分数据选取高钾与铅钡类风化前后具有代表性化学元素统计指标绘制风化前后变化图,观察数据波动情况。建立加权平均值预测模型,选用标准正态分布函数进行赋权,计算出风化前后不同种类玻璃的化学成分含量所占的比例的线性映射关系,最终预测出风化前的化学成分含量。在此基础上基于标准差建立亚类划分模型,利用K-Means算法对不同类型玻璃是否风化的4种类别选择合适指标,并依据显著性P值选取合适指标进行亚类划分,将其分为3个亚类。最后进行灵敏度分析,对数据进行扰动处理,一定程度上反映了模型具有良好的鲁棒性以及普适性。
Glass cultural relics are easily weathered by the environment, resulting in the change of their internal chemical composition, which affects the category judgment. In this paper, by eliminating the wrong data and filling in the missing values, the control variable method is used to observe the relationship with the other three variables, taking whether weathering is a fixed value. Chi-square test was used to analyze the difference of known categorical variables, and the significance P was used to judge whether there was difference between glass color type and surface weathering. Through the known chemical composition data, the representative chemical elements of high potassium and lead-barium before and after weathering were selected to draw the change map before and after weathering, and the fluctuation of data was observed. The weighted average prediction model was established, and the standard normal distribution function was used for weighting. The linear mapping relationship between the chemical composition content of different kinds of glass before and after weathering was calculated, and the chemical composition content before and after weathering was predicted. On this basis, a subclass division model was established based on the standard deviation, and the K-Means algorithm was used to select appropriate indicators for the four categories of whether different types of glass are weathered, and the appropriate indicators were selected according to the significance P-value for subclass division, which was divided into three subclasses. Finally, the sensitivity analysis is carried out and the data is disturbed, which reflects the good robustness and universality of the model in this paper to a certain extent.
刘宇、王可欣
化学环境科学理论科学、科学研究
玻璃成分判别卡方检验系统聚类数据扰动K-Means聚类
Glass composition discriminationChi-square testSystematic clusteringData perturbationK - Means clustering
刘宇,王可欣.基于K-Means聚类的玻璃风化成分分析与鉴别[EB/OL].(2022-10-28)[2025-05-28].http://www.paper.edu.cn/releasepaper/content/202210-5.点此复制
评论