{"title":"A Key-phrase Extraction Method Based on Multi-size Convolution Windows for Scientific Literatures","authors":"Yuhong Zhang, Yuxin Xie, Peipei Li, Xuegang Hu","doi":"10.1109/CCIS53392.2021.9754645","DOIUrl":null,"url":null,"abstract":"The key-phrase extraction is important for the downstream tasks in natural language process, and has attracted a lot of attention. Compared with other documents, scientific literatures contain many long phrases. Most existing methods perform poor on these literatures. To address this problem, a key-phrase extraction method based on multi-size convolution windows (KE-MCW) is proposed for scientific literatures in this paper. More specifically, in order to represent more contextual information, a convolutional neural network(CNN) with multi-size filters is introduced to map the documents into distributed feature vectors, then each vector can represent different size phrases. Next, in order to determine whether each word is a part of a keyphrase, a deep recurrent neural network is used to mark the role of each word. Finally, the attention mechanism is used to further judge the importance of each phrase. Experimental results show that our proposed method performs better than some competitive methods for technology literatures.","PeriodicalId":191226,"journal":{"name":"2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS)","volume":"183 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCIS53392.2021.9754645","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The key-phrase extraction is important for the downstream tasks in natural language process, and has attracted a lot of attention. Compared with other documents, scientific literatures contain many long phrases. Most existing methods perform poor on these literatures. To address this problem, a key-phrase extraction method based on multi-size convolution windows (KE-MCW) is proposed for scientific literatures in this paper. More specifically, in order to represent more contextual information, a convolutional neural network(CNN) with multi-size filters is introduced to map the documents into distributed feature vectors, then each vector can represent different size phrases. Next, in order to determine whether each word is a part of a keyphrase, a deep recurrent neural network is used to mark the role of each word. Finally, the attention mechanism is used to further judge the importance of each phrase. Experimental results show that our proposed method performs better than some competitive methods for technology literatures.