{"title":"基于高级语义层次聚类的构建标签系统","authors":"Wenxin Yang, Zhiming Zhang, G. Huang","doi":"10.1109/IAEAC47372.2019.8997666","DOIUrl":null,"url":null,"abstract":"An improved method based on semantic analysis and clustering algorithm for building tag systems was proposed. First, Hive is employed to process data. Then, Advanced Semantic Hierarchical Clustering (ASHC), which is an adaptation of Semantic Hierarchical Clustering (SHC), is used to build synonym relationship and hypernym-hyponym relationship of the tag trees and enhance the precision and efficiency of tag systems. In the end, removing some obviously incorrect paths and isolated nodes. For evaluating the performance of the method, the tag coincidence rate, the hypernym-hyponym coincidence rate and the accuracy are used to assess the precision of merging and constructing tag systems. The results show that compared with SHC, the accuracy of ASHC increases 2.7% averagely, and after adjusting tags, these metrics are improved more than 3.9%. Based on this, tag systems with higher precision and applicability can be built.","PeriodicalId":164163,"journal":{"name":"2019 IEEE 4th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Building Tag Systems Based on Advanced Semantic Hierarchical Clustering\",\"authors\":\"Wenxin Yang, Zhiming Zhang, G. Huang\",\"doi\":\"10.1109/IAEAC47372.2019.8997666\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An improved method based on semantic analysis and clustering algorithm for building tag systems was proposed. First, Hive is employed to process data. Then, Advanced Semantic Hierarchical Clustering (ASHC), which is an adaptation of Semantic Hierarchical Clustering (SHC), is used to build synonym relationship and hypernym-hyponym relationship of the tag trees and enhance the precision and efficiency of tag systems. In the end, removing some obviously incorrect paths and isolated nodes. For evaluating the performance of the method, the tag coincidence rate, the hypernym-hyponym coincidence rate and the accuracy are used to assess the precision of merging and constructing tag systems. The results show that compared with SHC, the accuracy of ASHC increases 2.7% averagely, and after adjusting tags, these metrics are improved more than 3.9%. Based on this, tag systems with higher precision and applicability can be built.\",\"PeriodicalId\":164163,\"journal\":{\"name\":\"2019 IEEE 4th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)\",\"volume\":\"71 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 4th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IAEAC47372.2019.8997666\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 4th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IAEAC47372.2019.8997666","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Building Tag Systems Based on Advanced Semantic Hierarchical Clustering
An improved method based on semantic analysis and clustering algorithm for building tag systems was proposed. First, Hive is employed to process data. Then, Advanced Semantic Hierarchical Clustering (ASHC), which is an adaptation of Semantic Hierarchical Clustering (SHC), is used to build synonym relationship and hypernym-hyponym relationship of the tag trees and enhance the precision and efficiency of tag systems. In the end, removing some obviously incorrect paths and isolated nodes. For evaluating the performance of the method, the tag coincidence rate, the hypernym-hyponym coincidence rate and the accuracy are used to assess the precision of merging and constructing tag systems. The results show that compared with SHC, the accuracy of ASHC increases 2.7% averagely, and after adjusting tags, these metrics are improved more than 3.9%. Based on this, tag systems with higher precision and applicability can be built.