Going Big and Deep: Using Convolutional Neural Network to Leverage Training Data from Multiple Domains for Cross-Domain Sentiment Classification on Product Reviews
{"title":"Going Big and Deep: Using Convolutional Neural Network to Leverage Training Data from Multiple Domains for Cross-Domain Sentiment Classification on Product Reviews","authors":"Aditi Gupta, Jasy Liew Suet Yan, Cheah Yu-N","doi":"10.1109/IICAIET49801.2020.9257815","DOIUrl":null,"url":null,"abstract":"Training a classifier for sentiment polarity detection in product reviews when labeled data is not available for a particular domain poses a challenge, which can be addressed through cross-domain sentiment analysis. We experimented with Convolutional Neural Network (CNN) to learn sentiment polarity (positive or negative) from labeled data available in many different source domains and test its performance on a target domain that it is not trained on. Extensive experiments were conducted on 14 different domains using Amazon product reviews. Our preliminary findings show that cross-domain CNN models trained with multiple source domains achieved accuracy of above 80% across all the domains and outperform the in-domain models trained using limited labeled data from the same domain. In fact, the cross-domain CNN models demonstrated better performance when a larger number of source domains are used for training. Therefore, going deep and big is a promising direction to explore for cross-domain sentiment classification.","PeriodicalId":300885,"journal":{"name":"2020 IEEE 2nd International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 2nd International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IICAIET49801.2020.9257815","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Training a classifier for sentiment polarity detection in product reviews when labeled data is not available for a particular domain poses a challenge, which can be addressed through cross-domain sentiment analysis. We experimented with Convolutional Neural Network (CNN) to learn sentiment polarity (positive or negative) from labeled data available in many different source domains and test its performance on a target domain that it is not trained on. Extensive experiments were conducted on 14 different domains using Amazon product reviews. Our preliminary findings show that cross-domain CNN models trained with multiple source domains achieved accuracy of above 80% across all the domains and outperform the in-domain models trained using limited labeled data from the same domain. In fact, the cross-domain CNN models demonstrated better performance when a larger number of source domains are used for training. Therefore, going deep and big is a promising direction to explore for cross-domain sentiment classification.