Kaiquan Xu, S. Liao, Raymond Y. K. Lau, L. Liao, Heng Tang
{"title":"面向文本知识发现的自学语义标注方法","authors":"Kaiquan Xu, S. Liao, Raymond Y. K. Lau, L. Liao, Heng Tang","doi":"10.1109/HICSS.2009.898","DOIUrl":null,"url":null,"abstract":"As much valuable domain knowledge is hidden in enterprises' text repositories (e.g., email archives, digital libraries, etc.), it is desirable to develop effective knowledge management tools to process this unstructured data so as to extract domain knowledge for business decision making. Ontology-based semantic annotation of documents is one of the promising ways for knowledge discovery from text repositories. Existing semantic annotation methods usually require many labeled training examples before they can effectively operate, and this bottleneck holds back the widely applications of these semantic annotation methods. In this paper, we propose a semi-supervised semantic annotation method, self-teaching SVM-struct, which uses fewer labeled examples to improve the annotating performance. The key of the self-teaching method is how to identify the reliably predicted examples for retraining. Two novel confidence measures are developed to estimate prediction confidence. The experimental results show that the prediction performance of our self-teaching semantic annotation method is promising.","PeriodicalId":211759,"journal":{"name":"2009 42nd Hawaii International Conference on System Sciences","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Self-Teaching Semantic Annotation Method for Knowledge Discovery from Text\",\"authors\":\"Kaiquan Xu, S. Liao, Raymond Y. K. Lau, L. Liao, Heng Tang\",\"doi\":\"10.1109/HICSS.2009.898\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As much valuable domain knowledge is hidden in enterprises' text repositories (e.g., email archives, digital libraries, etc.), it is desirable to develop effective knowledge management tools to process this unstructured data so as to extract domain knowledge for business decision making. Ontology-based semantic annotation of documents is one of the promising ways for knowledge discovery from text repositories. Existing semantic annotation methods usually require many labeled training examples before they can effectively operate, and this bottleneck holds back the widely applications of these semantic annotation methods. In this paper, we propose a semi-supervised semantic annotation method, self-teaching SVM-struct, which uses fewer labeled examples to improve the annotating performance. The key of the self-teaching method is how to identify the reliably predicted examples for retraining. Two novel confidence measures are developed to estimate prediction confidence. The experimental results show that the prediction performance of our self-teaching semantic annotation method is promising.\",\"PeriodicalId\":211759,\"journal\":{\"name\":\"2009 42nd Hawaii International Conference on System Sciences\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-01-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 42nd Hawaii International Conference on System Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HICSS.2009.898\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 42nd Hawaii International Conference on System Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HICSS.2009.898","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Self-Teaching Semantic Annotation Method for Knowledge Discovery from Text
As much valuable domain knowledge is hidden in enterprises' text repositories (e.g., email archives, digital libraries, etc.), it is desirable to develop effective knowledge management tools to process this unstructured data so as to extract domain knowledge for business decision making. Ontology-based semantic annotation of documents is one of the promising ways for knowledge discovery from text repositories. Existing semantic annotation methods usually require many labeled training examples before they can effectively operate, and this bottleneck holds back the widely applications of these semantic annotation methods. In this paper, we propose a semi-supervised semantic annotation method, self-teaching SVM-struct, which uses fewer labeled examples to improve the annotating performance. The key of the self-teaching method is how to identify the reliably predicted examples for retraining. Two novel confidence measures are developed to estimate prediction confidence. The experimental results show that the prediction performance of our self-teaching semantic annotation method is promising.