{"title":"基于决策树和k-均值方法的玻璃伪影识别","authors":"Han Liu, Sinan Wang","doi":"10.1109/TechDev57621.2022.00010","DOIUrl":null,"url":null,"abstract":"K-means algorithm not only has good processing ability of images and data sets, but also has good stability and scalability, and has good clustering effect. In addition, the clustering effect will not be affected by the order of data sets, and there is no need for corresponding constraints on the range of data sets. During the weathering process, ancient glass undergoes a large exchange of internal elements with environmental elements resulting in changes in its composition ratio, which affects the correct judgment of its category. In order to analyze the classification law of high potassium glass and lead-barium glass, a decision tree is used to determine lead oxide (PbO) as the characteristic element, and when lead oxide (PbO)5.26, it is classified as high potassium glass, and vice versa as lead-barium glass; on the On the basis of the two categories, considering that weathering and unweathering have certain influence on the subclass division, a two-layer subclass division is adopted, divided into four major categories, and the standard deviation statistics are conducted for the chemical components of each category, and several chemical components with larger standard deviation are screened out as characteristic elements, and the subclass division is carried out by kmeans clustering method, and finally a decision tree is used to find the chemical components that can be used as the basis for the division. The goodness of the decision tree is evaluated and analyzed by ROC curve to illustrate the rationality of the classification, and sensitivity analysis is performed by the relationship between the number of clusters and the depth of the decision tree to illustrate the reliability of the results.","PeriodicalId":171777,"journal":{"name":"2022 11th International Conference on Computer Technologies and Development (TechDev)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identification of Glass Artifacts Based on Decision Tree and k-Means Method\",\"authors\":\"Han Liu, Sinan Wang\",\"doi\":\"10.1109/TechDev57621.2022.00010\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"K-means algorithm not only has good processing ability of images and data sets, but also has good stability and scalability, and has good clustering effect. In addition, the clustering effect will not be affected by the order of data sets, and there is no need for corresponding constraints on the range of data sets. During the weathering process, ancient glass undergoes a large exchange of internal elements with environmental elements resulting in changes in its composition ratio, which affects the correct judgment of its category. In order to analyze the classification law of high potassium glass and lead-barium glass, a decision tree is used to determine lead oxide (PbO) as the characteristic element, and when lead oxide (PbO)5.26, it is classified as high potassium glass, and vice versa as lead-barium glass; on the On the basis of the two categories, considering that weathering and unweathering have certain influence on the subclass division, a two-layer subclass division is adopted, divided into four major categories, and the standard deviation statistics are conducted for the chemical components of each category, and several chemical components with larger standard deviation are screened out as characteristic elements, and the subclass division is carried out by kmeans clustering method, and finally a decision tree is used to find the chemical components that can be used as the basis for the division. The goodness of the decision tree is evaluated and analyzed by ROC curve to illustrate the rationality of the classification, and sensitivity analysis is performed by the relationship between the number of clusters and the depth of the decision tree to illustrate the reliability of the results.\",\"PeriodicalId\":171777,\"journal\":{\"name\":\"2022 11th International Conference on Computer Technologies and Development (TechDev)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 11th International Conference on Computer Technologies and Development (TechDev)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TechDev57621.2022.00010\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 11th International Conference on Computer Technologies and Development (TechDev)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TechDev57621.2022.00010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Identification of Glass Artifacts Based on Decision Tree and k-Means Method
K-means algorithm not only has good processing ability of images and data sets, but also has good stability and scalability, and has good clustering effect. In addition, the clustering effect will not be affected by the order of data sets, and there is no need for corresponding constraints on the range of data sets. During the weathering process, ancient glass undergoes a large exchange of internal elements with environmental elements resulting in changes in its composition ratio, which affects the correct judgment of its category. In order to analyze the classification law of high potassium glass and lead-barium glass, a decision tree is used to determine lead oxide (PbO) as the characteristic element, and when lead oxide (PbO)5.26, it is classified as high potassium glass, and vice versa as lead-barium glass; on the On the basis of the two categories, considering that weathering and unweathering have certain influence on the subclass division, a two-layer subclass division is adopted, divided into four major categories, and the standard deviation statistics are conducted for the chemical components of each category, and several chemical components with larger standard deviation are screened out as characteristic elements, and the subclass division is carried out by kmeans clustering method, and finally a decision tree is used to find the chemical components that can be used as the basis for the division. The goodness of the decision tree is evaluated and analyzed by ROC curve to illustrate the rationality of the classification, and sensitivity analysis is performed by the relationship between the number of clusters and the depth of the decision tree to illustrate the reliability of the results.