{"title":"基于多尺度卷积窗的科学文献关键词提取方法","authors":"Yuhong Zhang, Yuxin Xie, Peipei Li, Xuegang Hu","doi":"10.1109/CCIS53392.2021.9754645","DOIUrl":null,"url":null,"abstract":"The key-phrase extraction is important for the downstream tasks in natural language process, and has attracted a lot of attention. Compared with other documents, scientific literatures contain many long phrases. Most existing methods perform poor on these literatures. To address this problem, a key-phrase extraction method based on multi-size convolution windows (KE-MCW) is proposed for scientific literatures in this paper. More specifically, in order to represent more contextual information, a convolutional neural network(CNN) with multi-size filters is introduced to map the documents into distributed feature vectors, then each vector can represent different size phrases. Next, in order to determine whether each word is a part of a keyphrase, a deep recurrent neural network is used to mark the role of each word. Finally, the attention mechanism is used to further judge the importance of each phrase. Experimental results show that our proposed method performs better than some competitive methods for technology literatures.","PeriodicalId":191226,"journal":{"name":"2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS)","volume":"183 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Key-phrase Extraction Method Based on Multi-size Convolution Windows for Scientific Literatures\",\"authors\":\"Yuhong Zhang, Yuxin Xie, Peipei Li, Xuegang Hu\",\"doi\":\"10.1109/CCIS53392.2021.9754645\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The key-phrase extraction is important for the downstream tasks in natural language process, and has attracted a lot of attention. Compared with other documents, scientific literatures contain many long phrases. Most existing methods perform poor on these literatures. To address this problem, a key-phrase extraction method based on multi-size convolution windows (KE-MCW) is proposed for scientific literatures in this paper. More specifically, in order to represent more contextual information, a convolutional neural network(CNN) with multi-size filters is introduced to map the documents into distributed feature vectors, then each vector can represent different size phrases. Next, in order to determine whether each word is a part of a keyphrase, a deep recurrent neural network is used to mark the role of each word. Finally, the attention mechanism is used to further judge the importance of each phrase. Experimental results show that our proposed method performs better than some competitive methods for technology literatures.\",\"PeriodicalId\":191226,\"journal\":{\"name\":\"2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS)\",\"volume\":\"183 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCIS53392.2021.9754645\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCIS53392.2021.9754645","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Key-phrase Extraction Method Based on Multi-size Convolution Windows for Scientific Literatures
The key-phrase extraction is important for the downstream tasks in natural language process, and has attracted a lot of attention. Compared with other documents, scientific literatures contain many long phrases. Most existing methods perform poor on these literatures. To address this problem, a key-phrase extraction method based on multi-size convolution windows (KE-MCW) is proposed for scientific literatures in this paper. More specifically, in order to represent more contextual information, a convolutional neural network(CNN) with multi-size filters is introduced to map the documents into distributed feature vectors, then each vector can represent different size phrases. Next, in order to determine whether each word is a part of a keyphrase, a deep recurrent neural network is used to mark the role of each word. Finally, the attention mechanism is used to further judge the importance of each phrase. Experimental results show that our proposed method performs better than some competitive methods for technology literatures.