Transferring Multi-Channel Convolutional Neural Network Model for Cross-Domain Sentiment Analysis

A. Rozie, Andria Arisal, D. Munandar
{"title":"Transferring Multi-Channel Convolutional Neural Network Model for Cross-Domain Sentiment Analysis","authors":"A. Rozie, Andria Arisal, D. Munandar","doi":"10.1109/ISRITI48646.2019.9034599","DOIUrl":null,"url":null,"abstract":"Analyzing sentiment analysis with deep learning requires massive labeled datasets where such data is not always available. The annotation process is also time-consuming and tedious. Further, even after we train the sentiment analysis, it creates another problem. Because this model is domain-dependent, the performance in another domain estimated to perform poorly. In this paper, we present the transfer learning approach to transfer knowledge gained from the source dataset into the target dataset with the expectation to improve the target model. Multichannel Convolutional Neural Network deploys different n-grams as the input channel in a single CNN model to grasp meaningful features from the text. This method has proven to perform well in sentiment analysis problems. We train our three datasets with different domains using this method as the baseline. The largest dataset then becomes the source model for transfer learning and other datasets as the target. Fine-tuning our source model also needed when retraining it into the target dataset. From the evaluation, we show that several transfer learning strategies outperform the domain-specific model, even when the data is imbalanced. We also highlight certain failing strategies that inflict lousy results on the target model performance.","PeriodicalId":367363,"journal":{"name":"2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISRITI48646.2019.9034599","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Analyzing sentiment analysis with deep learning requires massive labeled datasets where such data is not always available. The annotation process is also time-consuming and tedious. Further, even after we train the sentiment analysis, it creates another problem. Because this model is domain-dependent, the performance in another domain estimated to perform poorly. In this paper, we present the transfer learning approach to transfer knowledge gained from the source dataset into the target dataset with the expectation to improve the target model. Multichannel Convolutional Neural Network deploys different n-grams as the input channel in a single CNN model to grasp meaningful features from the text. This method has proven to perform well in sentiment analysis problems. We train our three datasets with different domains using this method as the baseline. The largest dataset then becomes the source model for transfer learning and other datasets as the target. Fine-tuning our source model also needed when retraining it into the target dataset. From the evaluation, we show that several transfer learning strategies outperform the domain-specific model, even when the data is imbalanced. We also highlight certain failing strategies that inflict lousy results on the target model performance.
基于多通道卷积神经网络模型的跨域情感分析
使用深度学习进行情感分析需要大量标记数据集,而这些数据并不总是可用的。注释过程也很耗时和繁琐。此外,即使在我们训练了情感分析之后,它也会产生另一个问题。因为这个模型是领域相关的,所以在另一个领域的性能估计会很差。在本文中,我们提出了一种迁移学习方法,将从源数据集中获得的知识迁移到目标数据集中,以期改进目标模型。多通道卷积神经网络在单个CNN模型中部署不同的n-gram作为输入通道,从文本中抓取有意义的特征。该方法已被证明在情感分析问题中表现良好。我们使用该方法作为基线,对三个不同域的数据集进行训练。然后最大的数据集成为迁移学习的源模型,其他数据集作为目标。在将源模型重新训练到目标数据集时也需要对其进行微调。从评估中,我们发现即使在数据不平衡的情况下,几种迁移学习策略也优于特定领域模型。我们还强调了某些失败的策略,这些策略会对目标模型的性能造成糟糕的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信