{"title":"从歧义词到关键概念提取","authors":"M. Sajgalík, M. Barla, M. Bieliková","doi":"10.1109/DEXA.2013.16","DOIUrl":null,"url":null,"abstract":"Automatic acquisition of keywords for given document is still an area of active research. In this paper, we consider shift from keyword-based representation to other perspective on representation of document's focus in form of key-concepts. The advantage of using concepts over simple words is that concepts, apart from words, are unambiguous. This leads to better understanding of key-concepts than keywords. We present novel method of key-concept extraction, which provides an efficient way of automatic acquisition of key-concepts in machine processing. We evaluate our approach on classification problem, where we compare it to baseline TF-IDF keyword model and present its competitive results. We discuss its potential of its utilisation on the Web.","PeriodicalId":428515,"journal":{"name":"2013 24th International Workshop on Database and Expert Systems Applications","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"From Ambiguous Words to Key-Concept Extraction\",\"authors\":\"M. Sajgalík, M. Barla, M. Bieliková\",\"doi\":\"10.1109/DEXA.2013.16\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic acquisition of keywords for given document is still an area of active research. In this paper, we consider shift from keyword-based representation to other perspective on representation of document's focus in form of key-concepts. The advantage of using concepts over simple words is that concepts, apart from words, are unambiguous. This leads to better understanding of key-concepts than keywords. We present novel method of key-concept extraction, which provides an efficient way of automatic acquisition of key-concepts in machine processing. We evaluate our approach on classification problem, where we compare it to baseline TF-IDF keyword model and present its competitive results. We discuss its potential of its utilisation on the Web.\",\"PeriodicalId\":428515,\"journal\":{\"name\":\"2013 24th International Workshop on Database and Expert Systems Applications\",\"volume\":\"60 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-08-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 24th International Workshop on Database and Expert Systems Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DEXA.2013.16\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 24th International Workshop on Database and Expert Systems Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DEXA.2013.16","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Automatic acquisition of keywords for given document is still an area of active research. In this paper, we consider shift from keyword-based representation to other perspective on representation of document's focus in form of key-concepts. The advantage of using concepts over simple words is that concepts, apart from words, are unambiguous. This leads to better understanding of key-concepts than keywords. We present novel method of key-concept extraction, which provides an efficient way of automatic acquisition of key-concepts in machine processing. We evaluate our approach on classification problem, where we compare it to baseline TF-IDF keyword model and present its competitive results. We discuss its potential of its utilisation on the Web.