{"title":"Chinese Keyword Extraction Based on Word Platform","authors":"Hui Jiao, Qian Liu, Hui-bo Jia","doi":"10.1109/FSKD.2007.215","DOIUrl":null,"url":null,"abstract":"At present researches on Chinese keyword extraction mainly focus on automatic segmentation which is a pretreatment problem. This paper presents a kind of Chinese encoding method based on word platform, and establishes a new Chinese document format in computer. This method makes word the smallest information unit. Chinese keyword extraction does not rely on segmentation by this new method. Thereby the efficiency and quality could be improved. Statistical analysis is adopted to conduct the experiment of keyword extraction based on word platform, and experimental results are satisfying.","PeriodicalId":201883,"journal":{"name":"Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FSKD.2007.215","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
At present researches on Chinese keyword extraction mainly focus on automatic segmentation which is a pretreatment problem. This paper presents a kind of Chinese encoding method based on word platform, and establishes a new Chinese document format in computer. This method makes word the smallest information unit. Chinese keyword extraction does not rely on segmentation by this new method. Thereby the efficiency and quality could be improved. Statistical analysis is adopted to conduct the experiment of keyword extraction based on word platform, and experimental results are satisfying.