{"title":"领域概念层次结构的自动构建","authors":"Sun Qiao, Z. Chunhui, Chen Zhibo","doi":"10.1109/CYBERC.2010.85","DOIUrl":null,"url":null,"abstract":"A general automatic domain concept hierarchy construction procedure is presented in this paper. This is a domain independent construct a domain concept hierarchy from a domain corpus . The construction procedure mainly includes domain terminology extraction, word sense disambiguation, similarity computation, hierarchy construction and subsumption relation detection. All extracted candidate terms are ranked first, then one can select the top terms as domain terminologies. Frequency ratio and entropy of a word are considered to rank candidate terms. Relations between terms are taken into account for words in WordNet, while distributional similarity is used to compute similarity between words outside WordNet. Experiments on two domain corpus show that the proposed procedure is feasible and can get reasonable concept hierarchy.","PeriodicalId":315132,"journal":{"name":"2010 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Automatic Construction of Domain Concept Hierarchy\",\"authors\":\"Sun Qiao, Z. Chunhui, Chen Zhibo\",\"doi\":\"10.1109/CYBERC.2010.85\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A general automatic domain concept hierarchy construction procedure is presented in this paper. This is a domain independent construct a domain concept hierarchy from a domain corpus . The construction procedure mainly includes domain terminology extraction, word sense disambiguation, similarity computation, hierarchy construction and subsumption relation detection. All extracted candidate terms are ranked first, then one can select the top terms as domain terminologies. Frequency ratio and entropy of a word are considered to rank candidate terms. Relations between terms are taken into account for words in WordNet, while distributional similarity is used to compute similarity between words outside WordNet. Experiments on two domain corpus show that the proposed procedure is feasible and can get reasonable concept hierarchy.\",\"PeriodicalId\":315132,\"journal\":{\"name\":\"2010 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CYBERC.2010.85\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CYBERC.2010.85","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Automatic Construction of Domain Concept Hierarchy
A general automatic domain concept hierarchy construction procedure is presented in this paper. This is a domain independent construct a domain concept hierarchy from a domain corpus . The construction procedure mainly includes domain terminology extraction, word sense disambiguation, similarity computation, hierarchy construction and subsumption relation detection. All extracted candidate terms are ranked first, then one can select the top terms as domain terminologies. Frequency ratio and entropy of a word are considered to rank candidate terms. Relations between terms are taken into account for words in WordNet, while distributional similarity is used to compute similarity between words outside WordNet. Experiments on two domain corpus show that the proposed procedure is feasible and can get reasonable concept hierarchy.