{"title":"基于定量关联规则的在线军事新闻文本聚类效果研究","authors":"Liang-Chu Chen, Chyi-Bao Yang, Jih-Hsin Chen, Yen-Hsuan Lien","doi":"10.1109/IALP.2009.48","DOIUrl":null,"url":null,"abstract":"Text clustering is an automatic technique to group texts using the approach of feature extraction and term connection to calculate the similarities among subject contents of texts. Since the properties of terms in Chinese text (e.g. segmentation and annotation) are not as clear as the other languages, extracting and distinguishing features from Chinese text is therefore much more difficult, which greatly impacts the effects of clustering. From the perspective of military news, this paper applies both quantitative association rule and hierarchical agglomerative algorithm to cluster Chinese news published in Youth Daily News, and the application results are compared with those by the traditional vector space model approach and by the general association rule approach, respectively. F-measure is used as evaluation metric in the experiments. Experimental results show that the quantitative association rule approach performs more accurately than both the vector space model and association rule in text automatic clustering.","PeriodicalId":156840,"journal":{"name":"2009 International Conference on Asian Language Processing","volume":"106 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Exploring the Effects of Text Clustering on On-Line Military News Based on Quantitative Association Rule\",\"authors\":\"Liang-Chu Chen, Chyi-Bao Yang, Jih-Hsin Chen, Yen-Hsuan Lien\",\"doi\":\"10.1109/IALP.2009.48\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text clustering is an automatic technique to group texts using the approach of feature extraction and term connection to calculate the similarities among subject contents of texts. Since the properties of terms in Chinese text (e.g. segmentation and annotation) are not as clear as the other languages, extracting and distinguishing features from Chinese text is therefore much more difficult, which greatly impacts the effects of clustering. From the perspective of military news, this paper applies both quantitative association rule and hierarchical agglomerative algorithm to cluster Chinese news published in Youth Daily News, and the application results are compared with those by the traditional vector space model approach and by the general association rule approach, respectively. F-measure is used as evaluation metric in the experiments. Experimental results show that the quantitative association rule approach performs more accurately than both the vector space model and association rule in text automatic clustering.\",\"PeriodicalId\":156840,\"journal\":{\"name\":\"2009 International Conference on Asian Language Processing\",\"volume\":\"106 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-12-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 International Conference on Asian Language Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IALP.2009.48\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Asian Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IALP.2009.48","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Exploring the Effects of Text Clustering on On-Line Military News Based on Quantitative Association Rule
Text clustering is an automatic technique to group texts using the approach of feature extraction and term connection to calculate the similarities among subject contents of texts. Since the properties of terms in Chinese text (e.g. segmentation and annotation) are not as clear as the other languages, extracting and distinguishing features from Chinese text is therefore much more difficult, which greatly impacts the effects of clustering. From the perspective of military news, this paper applies both quantitative association rule and hierarchical agglomerative algorithm to cluster Chinese news published in Youth Daily News, and the application results are compared with those by the traditional vector space model approach and by the general association rule approach, respectively. F-measure is used as evaluation metric in the experiments. Experimental results show that the quantitative association rule approach performs more accurately than both the vector space model and association rule in text automatic clustering.