{"title":"An Approach for Automatic Aspect Extraction by Latent Dirichlet Allocation","authors":"S. Das, B. Chakraborty","doi":"10.1109/ICAwST.2019.8923417","DOIUrl":null,"url":null,"abstract":"Now a days internet has taken over all form of communication and activities. One of the most affected area is e-commerce which survives through internet. One of the vital process of business survival is customer feedback. There are several form of platforms where the feedbacks can be posted. The main task is to accumulate all the reviews and summarize them in a conceivable manner. The approach here is to summarize the reviews in aspect based manner. This representation will help future consumers to make well informed decision. LDA is one of the popular methods to extract latent topics comprising a document. The present approach intends to use that characteristic to extract aspects from review corpora. The current scope of work is to extract and improve the quality of topics. After several corpora of product reviews were processed through this method, the results were examined through graphically plotting the topics and also examining the dominant keywords of the topics. Finally the accumulated results of the present method are compared with the results of previously implemented Word2Vec based model and human extracted aspects.","PeriodicalId":156538,"journal":{"name":"2019 IEEE 10th International Conference on Awareness Science and Technology (iCAST)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 10th International Conference on Awareness Science and Technology (iCAST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAwST.2019.8923417","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Now a days internet has taken over all form of communication and activities. One of the most affected area is e-commerce which survives through internet. One of the vital process of business survival is customer feedback. There are several form of platforms where the feedbacks can be posted. The main task is to accumulate all the reviews and summarize them in a conceivable manner. The approach here is to summarize the reviews in aspect based manner. This representation will help future consumers to make well informed decision. LDA is one of the popular methods to extract latent topics comprising a document. The present approach intends to use that characteristic to extract aspects from review corpora. The current scope of work is to extract and improve the quality of topics. After several corpora of product reviews were processed through this method, the results were examined through graphically plotting the topics and also examining the dominant keywords of the topics. Finally the accumulated results of the present method are compared with the results of previously implemented Word2Vec based model and human extracted aspects.