Attention-based LSTM-CNNs for uncertainty identification on Chinese social media texts

Binyang Li, Kaiming Zhou, Wei Gao, Xu Han, Linna Zhou
{"title":"Attention-based LSTM-CNNs for uncertainty identification on Chinese social media texts","authors":"Binyang Li, Kaiming Zhou, Wei Gao, Xu Han, Linna Zhou","doi":"10.1109/SPAC.2017.8304349","DOIUrl":null,"url":null,"abstract":"Uncertainty identification is an important semantic processing task, which is crucial to the quality of information in terms of factuality in many techniques, e.g. topic detection, question answering. Especially in social media, the texts are written informally which are widely used in many applications, so the factuality has become a premier concern. However, existing approaches that still rely on lexical cues suffer greatly from the casual or word-of-mouth peculiarity of social media, in which the cue phrases are often expressed in sub-standard form or even omitted from sentences. To tackle these problems, this paper proposes the attention-based LSTM-CNNs for the uncertainty identification on social media texts, named ALUNI. ALUNI incorporates attention-based LSTM networks to represent the semantics of words, and convolutional neural networks to capture the most important semantics of uncertainty for identification. Experiments are conducted on both Chinese Weibo and news datasets, and 78.19% and 73.95% of F1-measure scores are achieved with 11% and 3% improvement over the baseline, respectively.","PeriodicalId":161647,"journal":{"name":"2017 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPAC.2017.8304349","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Uncertainty identification is an important semantic processing task, which is crucial to the quality of information in terms of factuality in many techniques, e.g. topic detection, question answering. Especially in social media, the texts are written informally which are widely used in many applications, so the factuality has become a premier concern. However, existing approaches that still rely on lexical cues suffer greatly from the casual or word-of-mouth peculiarity of social media, in which the cue phrases are often expressed in sub-standard form or even omitted from sentences. To tackle these problems, this paper proposes the attention-based LSTM-CNNs for the uncertainty identification on social media texts, named ALUNI. ALUNI incorporates attention-based LSTM networks to represent the semantics of words, and convolutional neural networks to capture the most important semantics of uncertainty for identification. Experiments are conducted on both Chinese Weibo and news datasets, and 78.19% and 73.95% of F1-measure scores are achieved with 11% and 3% improvement over the baseline, respectively.
基于注意力的lstm - cnn中文社交媒体文本不确定性识别
不确定性识别是一项重要的语义处理任务,在主题检测、问答等许多技术中,不确定性识别对信息的真实性至关重要。特别是在社交媒体中,文本是非正式的,在许多应用中被广泛使用,因此事实性已经成为首要关注的问题。然而,现有的仍然依赖词汇线索的方法受到社交媒体随意或口口相传的特点的极大影响,在社交媒体中,线索短语往往以不标准的形式表达,甚至从句子中省略。为了解决这些问题,本文提出了一种基于注意力的lstm - cnn用于社交媒体文本的不确定性识别,称为ALUNI。ALUNI结合了基于注意力的LSTM网络来表示单词的语义,以及卷积神经网络来捕获最重要的不确定性语义用于识别。在中文微博和新闻数据集上进行实验,f1测量得分达到78.19%和73.95%,分别比基线提高11%和3%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信