Attention-based LSTM-CNNs for uncertainty identification on Chinese social media texts

2017 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC) Pub Date : 2017-12-01 DOI:10.1109/SPAC.2017.8304349

Binyang Li, Kaiming Zhou, Wei Gao, Xu Han, Linna Zhou

{"title":"Attention-based LSTM-CNNs for uncertainty identification on Chinese social media texts","authors":"Binyang Li, Kaiming Zhou, Wei Gao, Xu Han, Linna Zhou","doi":"10.1109/SPAC.2017.8304349","DOIUrl":null,"url":null,"abstract":"Uncertainty identification is an important semantic processing task, which is crucial to the quality of information in terms of factuality in many techniques, e.g. topic detection, question answering. Especially in social media, the texts are written informally which are widely used in many applications, so the factuality has become a premier concern. However, existing approaches that still rely on lexical cues suffer greatly from the casual or word-of-mouth peculiarity of social media, in which the cue phrases are often expressed in sub-standard form or even omitted from sentences. To tackle these problems, this paper proposes the attention-based LSTM-CNNs for the uncertainty identification on social media texts, named ALUNI. ALUNI incorporates attention-based LSTM networks to represent the semantics of words, and convolutional neural networks to capture the most important semantics of uncertainty for identification. Experiments are conducted on both Chinese Weibo and news datasets, and 78.19% and 73.95% of F1-measure scores are achieved with 11% and 3% improvement over the baseline, respectively.","PeriodicalId":161647,"journal":{"name":"2017 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPAC.2017.8304349","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Uncertainty identification is an important semantic processing task, which is crucial to the quality of information in terms of factuality in many techniques, e.g. topic detection, question answering. Especially in social media, the texts are written informally which are widely used in many applications, so the factuality has become a premier concern. However, existing approaches that still rely on lexical cues suffer greatly from the casual or word-of-mouth peculiarity of social media, in which the cue phrases are often expressed in sub-standard form or even omitted from sentences. To tackle these problems, this paper proposes the attention-based LSTM-CNNs for the uncertainty identification on social media texts, named ALUNI. ALUNI incorporates attention-based LSTM networks to represent the semantics of words, and convolutional neural networks to capture the most important semantics of uncertainty for identification. Experiments are conducted on both Chinese Weibo and news datasets, and 78.19% and 73.95% of F1-measure scores are achieved with 11% and 3% improvement over the baseline, respectively.

查看原文本刊更多论文

基于注意力的lstm - cnn中文社交媒体文本不确定性识别

不确定性识别是一项重要的语义处理任务，在主题检测、问答等许多技术中，不确定性识别对信息的真实性至关重要。特别是在社交媒体中，文本是非正式的，在许多应用中被广泛使用，因此事实性已经成为首要关注的问题。然而，现有的仍然依赖词汇线索的方法受到社交媒体随意或口口相传的特点的极大影响，在社交媒体中，线索短语往往以不标准的形式表达，甚至从句子中省略。为了解决这些问题，本文提出了一种基于注意力的lstm - cnn用于社交媒体文本的不确定性识别，称为ALUNI。ALUNI结合了基于注意力的LSTM网络来表示单词的语义，以及卷积神经网络来捕获最重要的不确定性语义用于识别。在中文微博和新闻数据集上进行实验，f1测量得分达到78.19%和73.95%，分别比基线提高11%和3%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)

自引率

0.00%

发文量