{"title":"Extractive Myanmar News Summarization Using Centroid Based Word Embedding","authors":"Soe Soe Lwin, K. Nwet","doi":"10.1109/AITC.2019.8921386","DOIUrl":null,"url":null,"abstract":"Nowadays, many researches are going on for text summarization because there are a lot of data on the internet and it is required to process, store and manage. Text summarization is a process of distilling important information from the original text and presents that information in the form of summary. The system is proposed to summarize Myanmar news with centroid based method. Centroid based method ranks the sentences based on their similarity to the centroid. Centroid based method uses the bags of words model to represent sentences. Bags of words representation does not capture the semantic relationship between words. To overcome this problem, centroid based method is combined with word embedding representation instead of bags of words in this paper. Experiments were done on Myanmar news dataset. Centroid based on word embedding method gets better performance than centroid based on bags of words method.","PeriodicalId":388642,"journal":{"name":"2019 International Conference on Advanced Information Technologies (ICAIT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Advanced Information Technologies (ICAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AITC.2019.8921386","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Nowadays, many researches are going on for text summarization because there are a lot of data on the internet and it is required to process, store and manage. Text summarization is a process of distilling important information from the original text and presents that information in the form of summary. The system is proposed to summarize Myanmar news with centroid based method. Centroid based method ranks the sentences based on their similarity to the centroid. Centroid based method uses the bags of words model to represent sentences. Bags of words representation does not capture the semantic relationship between words. To overcome this problem, centroid based method is combined with word embedding representation instead of bags of words in this paper. Experiments were done on Myanmar news dataset. Centroid based on word embedding method gets better performance than centroid based on bags of words method.