基于回归模型和改进k -means++聚类算法的玻璃组分分析与识别

Rifen Lin, Xuanye Tian, Gang Chen, Xuanran Wang
{"title":"基于回归模型和改进k -means++聚类算法的玻璃组分分析与识别","authors":"Rifen Lin, Xuanye Tian, Gang Chen, Xuanran Wang","doi":"10.1145/3603781.3603893","DOIUrl":null,"url":null,"abstract":"This paper proposes a machine learning-based method for identifying and analyzing ancient glass artifact types, with the aim of improving the efficiency of studying such objects. The method uses partial least squares regression models and an improved K-means++ clustering algorithm. To predict the chemical composition of weathering detection sites before weathering, the paper constructs a partial least squares regression model based on chi-square tests and analyses of variance. An Adaptive-LASSO regression model was then used to analyze the correlation between the chemical composition of different categories of glass artifacts. Additionally, a random forest classification model was established to analyze the classification patterns of high potassium glass and lead-barium glass, and feature screening of the chemical composition was carried out. A stepwise prediction model based on Bayesian parameter optimization of random forest was then used to analyze the chemical composition and identify the type of glass artifacts of unknown categories. To improve the K-means++ algorithm, the paper establishes a K-means++ clustering model based on weighted distance, which classifies the two types of glass separately. The method determines that for high-potassium glass, the fitting goodness-of-fit coefficients R for SiO2 and K2O curves are 0.92 and 0.92. For lead-barium glass, the fitting goodness-of-fit coefficients R for PbO and BaO are 0.91 and 0.82, both at a high level. The model fitting effect is good, and the optimal clustering number for both types of glass is K=3, with a reasonable model classification.","PeriodicalId":391180,"journal":{"name":"Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of Things","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis and identification of glass components based on regression models and improved K-means++ clustering algorithms\",\"authors\":\"Rifen Lin, Xuanye Tian, Gang Chen, Xuanran Wang\",\"doi\":\"10.1145/3603781.3603893\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a machine learning-based method for identifying and analyzing ancient glass artifact types, with the aim of improving the efficiency of studying such objects. The method uses partial least squares regression models and an improved K-means++ clustering algorithm. To predict the chemical composition of weathering detection sites before weathering, the paper constructs a partial least squares regression model based on chi-square tests and analyses of variance. An Adaptive-LASSO regression model was then used to analyze the correlation between the chemical composition of different categories of glass artifacts. Additionally, a random forest classification model was established to analyze the classification patterns of high potassium glass and lead-barium glass, and feature screening of the chemical composition was carried out. A stepwise prediction model based on Bayesian parameter optimization of random forest was then used to analyze the chemical composition and identify the type of glass artifacts of unknown categories. To improve the K-means++ algorithm, the paper establishes a K-means++ clustering model based on weighted distance, which classifies the two types of glass separately. The method determines that for high-potassium glass, the fitting goodness-of-fit coefficients R for SiO2 and K2O curves are 0.92 and 0.92. For lead-barium glass, the fitting goodness-of-fit coefficients R for PbO and BaO are 0.91 and 0.82, both at a high level. The model fitting effect is good, and the optimal clustering number for both types of glass is K=3, with a reasonable model classification.\",\"PeriodicalId\":391180,\"journal\":{\"name\":\"Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of Things\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of Things\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3603781.3603893\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of Things","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3603781.3603893","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文提出了一种基于机器学习的古代玻璃制品类型识别和分析方法,旨在提高研究此类物品的效率。该方法采用偏最小二乘回归模型和改进的k -means++聚类算法。为了预测风化探测点风化前的化学成分,本文构建了基于卡方检验和方差分析的偏最小二乘回归模型。采用自适应lasso回归模型分析了不同类别玻璃制品化学成分之间的相关性。建立随机森林分类模型,分析高钾玻璃和铅钡玻璃的分类模式,并对其化学成分进行特征筛选。采用基于随机森林贝叶斯参数优化的逐步预测模型对未知类别的玻璃制品进行化学成分分析和类型识别。为了改进k -means++算法,本文建立了基于加权距离的k -means++聚类模型,对两类玻璃分别进行分类。方法确定,对于高钾玻璃,SiO2和K2O曲线的拟合优度系数R分别为0.92和0.92。对于铅钡玻璃,PbO和BaO的拟合优度系数R分别为0.91和0.82,均处于较高水平。模型拟合效果良好,两类玻璃的最优聚类数K=3,模型分类合理。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Analysis and identification of glass components based on regression models and improved K-means++ clustering algorithms
This paper proposes a machine learning-based method for identifying and analyzing ancient glass artifact types, with the aim of improving the efficiency of studying such objects. The method uses partial least squares regression models and an improved K-means++ clustering algorithm. To predict the chemical composition of weathering detection sites before weathering, the paper constructs a partial least squares regression model based on chi-square tests and analyses of variance. An Adaptive-LASSO regression model was then used to analyze the correlation between the chemical composition of different categories of glass artifacts. Additionally, a random forest classification model was established to analyze the classification patterns of high potassium glass and lead-barium glass, and feature screening of the chemical composition was carried out. A stepwise prediction model based on Bayesian parameter optimization of random forest was then used to analyze the chemical composition and identify the type of glass artifacts of unknown categories. To improve the K-means++ algorithm, the paper establishes a K-means++ clustering model based on weighted distance, which classifies the two types of glass separately. The method determines that for high-potassium glass, the fitting goodness-of-fit coefficients R for SiO2 and K2O curves are 0.92 and 0.92. For lead-barium glass, the fitting goodness-of-fit coefficients R for PbO and BaO are 0.91 and 0.82, both at a high level. The model fitting effect is good, and the optimal clustering number for both types of glass is K=3, with a reasonable model classification.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信