{"title":"基于回归模型和改进k -means++聚类算法的玻璃组分分析与识别","authors":"Rifen Lin, Xuanye Tian, Gang Chen, Xuanran Wang","doi":"10.1145/3603781.3603893","DOIUrl":null,"url":null,"abstract":"This paper proposes a machine learning-based method for identifying and analyzing ancient glass artifact types, with the aim of improving the efficiency of studying such objects. The method uses partial least squares regression models and an improved K-means++ clustering algorithm. To predict the chemical composition of weathering detection sites before weathering, the paper constructs a partial least squares regression model based on chi-square tests and analyses of variance. An Adaptive-LASSO regression model was then used to analyze the correlation between the chemical composition of different categories of glass artifacts. Additionally, a random forest classification model was established to analyze the classification patterns of high potassium glass and lead-barium glass, and feature screening of the chemical composition was carried out. A stepwise prediction model based on Bayesian parameter optimization of random forest was then used to analyze the chemical composition and identify the type of glass artifacts of unknown categories. To improve the K-means++ algorithm, the paper establishes a K-means++ clustering model based on weighted distance, which classifies the two types of glass separately. The method determines that for high-potassium glass, the fitting goodness-of-fit coefficients R for SiO2 and K2O curves are 0.92 and 0.92. For lead-barium glass, the fitting goodness-of-fit coefficients R for PbO and BaO are 0.91 and 0.82, both at a high level. The model fitting effect is good, and the optimal clustering number for both types of glass is K=3, with a reasonable model classification.","PeriodicalId":391180,"journal":{"name":"Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of Things","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis and identification of glass components based on regression models and improved K-means++ clustering algorithms\",\"authors\":\"Rifen Lin, Xuanye Tian, Gang Chen, Xuanran Wang\",\"doi\":\"10.1145/3603781.3603893\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a machine learning-based method for identifying and analyzing ancient glass artifact types, with the aim of improving the efficiency of studying such objects. The method uses partial least squares regression models and an improved K-means++ clustering algorithm. To predict the chemical composition of weathering detection sites before weathering, the paper constructs a partial least squares regression model based on chi-square tests and analyses of variance. An Adaptive-LASSO regression model was then used to analyze the correlation between the chemical composition of different categories of glass artifacts. Additionally, a random forest classification model was established to analyze the classification patterns of high potassium glass and lead-barium glass, and feature screening of the chemical composition was carried out. A stepwise prediction model based on Bayesian parameter optimization of random forest was then used to analyze the chemical composition and identify the type of glass artifacts of unknown categories. To improve the K-means++ algorithm, the paper establishes a K-means++ clustering model based on weighted distance, which classifies the two types of glass separately. The method determines that for high-potassium glass, the fitting goodness-of-fit coefficients R for SiO2 and K2O curves are 0.92 and 0.92. For lead-barium glass, the fitting goodness-of-fit coefficients R for PbO and BaO are 0.91 and 0.82, both at a high level. The model fitting effect is good, and the optimal clustering number for both types of glass is K=3, with a reasonable model classification.\",\"PeriodicalId\":391180,\"journal\":{\"name\":\"Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of Things\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of Things\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3603781.3603893\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of Things","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3603781.3603893","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Analysis and identification of glass components based on regression models and improved K-means++ clustering algorithms
This paper proposes a machine learning-based method for identifying and analyzing ancient glass artifact types, with the aim of improving the efficiency of studying such objects. The method uses partial least squares regression models and an improved K-means++ clustering algorithm. To predict the chemical composition of weathering detection sites before weathering, the paper constructs a partial least squares regression model based on chi-square tests and analyses of variance. An Adaptive-LASSO regression model was then used to analyze the correlation between the chemical composition of different categories of glass artifacts. Additionally, a random forest classification model was established to analyze the classification patterns of high potassium glass and lead-barium glass, and feature screening of the chemical composition was carried out. A stepwise prediction model based on Bayesian parameter optimization of random forest was then used to analyze the chemical composition and identify the type of glass artifacts of unknown categories. To improve the K-means++ algorithm, the paper establishes a K-means++ clustering model based on weighted distance, which classifies the two types of glass separately. The method determines that for high-potassium glass, the fitting goodness-of-fit coefficients R for SiO2 and K2O curves are 0.92 and 0.92. For lead-barium glass, the fitting goodness-of-fit coefficients R for PbO and BaO are 0.91 and 0.82, both at a high level. The model fitting effect is good, and the optimal clustering number for both types of glass is K=3, with a reasonable model classification.