Chinese Microblog Sentiment Classification Based on Deep Belief Nets with Extended Multi-Modality Features

Xiao Sun, Chengcheng Li, Wanyi Xu, F. Ren
{"title":"Chinese Microblog Sentiment Classification Based on Deep Belief Nets with Extended Multi-Modality Features","authors":"Xiao Sun, Chengcheng Li, Wanyi Xu, F. Ren","doi":"10.1109/ICDMW.2014.101","DOIUrl":null,"url":null,"abstract":"This paper presents a DBN (deep belief nets) model and a multi-modality feature extraction method to extend features' dimensionalities of short text for Chinese micro blogging sentiment classification. Besides traditional features sets for document classification, comments for certain posts are also extracted as part of the micro blogging features according to the relationship between commenters and posters though constructing micro blogging social network as input information. Then, the integration of the above modality features is combined and represented as input vector for DBN. In this paper, a DBN model, which is stacked with several layers of RBM (Restricted Boltzmann Machine), is implemented to initialize the structure of neural network. The RBM layers can take probability distribution samples of original data to learn hidden structures for better feature representation. A Class RBM (Classification RBM) layer, which is stacked on top of several RBM layers, is adapted to achieve the final sentiment classification. The results demonstrate that, with proper structure and parameter, the performance of the proposed deep learning method on sentiment classification is better than state of the art surface learning models such as SVM or NB, which proves that DBN is suitable for short-length document classification with the proposed feature dimensionality extension method.","PeriodicalId":289269,"journal":{"name":"2014 IEEE International Conference on Data Mining Workshop","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE International Conference on Data Mining Workshop","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDMW.2014.101","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

This paper presents a DBN (deep belief nets) model and a multi-modality feature extraction method to extend features' dimensionalities of short text for Chinese micro blogging sentiment classification. Besides traditional features sets for document classification, comments for certain posts are also extracted as part of the micro blogging features according to the relationship between commenters and posters though constructing micro blogging social network as input information. Then, the integration of the above modality features is combined and represented as input vector for DBN. In this paper, a DBN model, which is stacked with several layers of RBM (Restricted Boltzmann Machine), is implemented to initialize the structure of neural network. The RBM layers can take probability distribution samples of original data to learn hidden structures for better feature representation. A Class RBM (Classification RBM) layer, which is stacked on top of several RBM layers, is adapted to achieve the final sentiment classification. The results demonstrate that, with proper structure and parameter, the performance of the proposed deep learning method on sentiment classification is better than state of the art surface learning models such as SVM or NB, which proves that DBN is suitable for short-length document classification with the proposed feature dimensionality extension method.
基于扩展多模态特征的深度信念网络的中文微博情感分类
本文提出了一种深度信念网络模型和多模态特征提取方法,用于中文微博情感分类短文本特征维度的扩展。除了传统的特征集用于文档分类之外,通过构建微博社交网络作为输入信息,根据评论者与发帖者之间的关系,提取特定帖子的评论作为微博特征的一部分。然后,将上述模态特征的积分组合为DBN的输入向量。本文实现了一种由若干层受限玻尔兹曼机(Restricted Boltzmann Machine, RBM)叠加而成的DBN模型来初始化神经网络的结构。RBM层可以获取原始数据的概率分布样本来学习隐藏结构,以获得更好的特征表示。一个类RBM(分类RBM)层,它被堆叠在几个RBM层之上,用于实现最终的情感分类。结果表明,在适当的结构和参数下,所提出的深度学习方法在情感分类上的性能优于目前最先进的表面学习模型(如SVM或NB),这证明DBN适用于使用所提出的特征维数扩展方法进行短长度文档分类。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信