Transferring Multi-Channel Convolutional Neural Network Model for Cross-Domain Sentiment Analysis

2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI) Pub Date : 2019-12-01 DOI:10.1109/ISRITI48646.2019.9034599

A. Rozie, Andria Arisal, D. Munandar

{"title":"Transferring Multi-Channel Convolutional Neural Network Model for Cross-Domain Sentiment Analysis","authors":"A. Rozie, Andria Arisal, D. Munandar","doi":"10.1109/ISRITI48646.2019.9034599","DOIUrl":null,"url":null,"abstract":"Analyzing sentiment analysis with deep learning requires massive labeled datasets where such data is not always available. The annotation process is also time-consuming and tedious. Further, even after we train the sentiment analysis, it creates another problem. Because this model is domain-dependent, the performance in another domain estimated to perform poorly. In this paper, we present the transfer learning approach to transfer knowledge gained from the source dataset into the target dataset with the expectation to improve the target model. Multichannel Convolutional Neural Network deploys different n-grams as the input channel in a single CNN model to grasp meaningful features from the text. This method has proven to perform well in sentiment analysis problems. We train our three datasets with different domains using this method as the baseline. The largest dataset then becomes the source model for transfer learning and other datasets as the target. Fine-tuning our source model also needed when retraining it into the target dataset. From the evaluation, we show that several transfer learning strategies outperform the domain-specific model, even when the data is imbalanced. We also highlight certain failing strategies that inflict lousy results on the target model performance.","PeriodicalId":367363,"journal":{"name":"2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISRITI48646.2019.9034599","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Analyzing sentiment analysis with deep learning requires massive labeled datasets where such data is not always available. The annotation process is also time-consuming and tedious. Further, even after we train the sentiment analysis, it creates another problem. Because this model is domain-dependent, the performance in another domain estimated to perform poorly. In this paper, we present the transfer learning approach to transfer knowledge gained from the source dataset into the target dataset with the expectation to improve the target model. Multichannel Convolutional Neural Network deploys different n-grams as the input channel in a single CNN model to grasp meaningful features from the text. This method has proven to perform well in sentiment analysis problems. We train our three datasets with different domains using this method as the baseline. The largest dataset then becomes the source model for transfer learning and other datasets as the target. Fine-tuning our source model also needed when retraining it into the target dataset. From the evaluation, we show that several transfer learning strategies outperform the domain-specific model, even when the data is imbalanced. We also highlight certain failing strategies that inflict lousy results on the target model performance.

查看原文本刊更多论文

基于多通道卷积神经网络模型的跨域情感分析

使用深度学习进行情感分析需要大量标记数据集，而这些数据并不总是可用的。注释过程也很耗时和繁琐。此外，即使在我们训练了情感分析之后，它也会产生另一个问题。因为这个模型是领域相关的，所以在另一个领域的性能估计会很差。在本文中，我们提出了一种迁移学习方法，将从源数据集中获得的知识迁移到目标数据集中，以期改进目标模型。多通道卷积神经网络在单个CNN模型中部署不同的n-gram作为输入通道，从文本中抓取有意义的特征。该方法已被证明在情感分析问题中表现良好。我们使用该方法作为基线，对三个不同域的数据集进行训练。然后最大的数据集成为迁移学习的源模型，其他数据集作为目标。在将源模型重新训练到目标数据集时也需要对其进行微调。从评估中，我们发现即使在数据不平衡的情况下，几种迁移学习策略也优于特定领域模型。我们还强调了某些失败的策略，这些策略会对目标模型的性能造成糟糕的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)

自引率

0.00%

发文量