错误信息检测的动机、方法和度量:NLP视角

Qi Su, Mingyu Wan, Xiaoqian Liu, Chu-Ren Huang
{"title":"错误信息检测的动机、方法和度量:NLP视角","authors":"Qi Su, Mingyu Wan, Xiaoqian Liu, Chu-Ren Huang","doi":"10.2991/nlpr.d.200522.001","DOIUrl":null,"url":null,"abstract":"ive summarization is also a relevant task that can be useful for facilitating misinformation detection. Specifically, the summarization model can be applied to identify the central claims of the input texts and serves as a feature extractor prior to misinformation detection. For example, Esmaeilzadeh et al. [24] use a text summarization model to first summarize an article and then input the summarized sequences into a RNN-based neural network to do misinformation detection. The experimental results are compared against the task using only the original texts, and finally demonstrate higher performance. Fact checking is the task of assessing the truthfulness of claims especially made by public figures such as politicians [25]. Usually, there is no clear distinction between misinformation detection and fact checking since both of them aim to assess the truthfulness of claims, thoughmisinformation detection usually focuses on certain pieces of information while fact checking is broader [26]. However, fact checking can also be a relevant task of misinformation detection when a piece of information contains claims that need to be verified as true or false. Rumor detection is often confused with fake news detection, since rumor refers to a statement consisting of unverified information at the posting time. Rumor detection task is then defined as separating personal statements into rumor or nonrumor [27]. Thus, rumor detection can also serve as another relevant task of misinformation detection to first detect worth-checking statements prior to classifying the statement as true or false. This can help mitigate the impact that subjective opinions or feelings have on the selection of statements that need to be further verified. Sentiment analysis is the task of extracting emotions from texts or user stances. The sentiment in the true and misrepresented information can be different, since publishers of misinformation focus more on the degree to impress the audience and the spreading speed of the information. Thus, misinformation typically either contains intense emotion which could easily resonate with the public, or Q. Su et al. / Natural Language Processing Research 1(1-2) 1–13 3 controversial statements aiming to evoke intense emotion among receivers. Thus, misinformation detection can also utilize emotion analysis through both the content and user comments. Guo et al. [28] propose a Emotion-based misinformation Detection framework to learn contentand comment-emotion representations for publishers and users respectively so as to exploit content and social emotions simultaneously for misinformation detection. 1.3. An Overview of the Survey This survey aims to present a comprehensive review on studying misinformation in terms of its characteristics and detection methods. It first introduces the related concepts and highlights the significance of misinformation detection. It then uses a two-dimensional model to decompose this task: the internal dimension of descriptive analysis (i.e., the characterization of low-credibility information) and the external dimension of predictive modeling (i.e., the automatic detection of misinformation). In particular, the publicly available datasets and the state-of-the-art technologies are reviewed in terms of the detection approaches, feature representations and model construction. Finally, challenges of misinformation detection are summarized andnewprospects are provided for futuremisinformation detection works.","PeriodicalId":332352,"journal":{"name":"Natural Language Processing Research","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"45","resultStr":"{\"title\":\"Motivations, Methods and Metrics of Misinformation Detection: An NLP Perspective\",\"authors\":\"Qi Su, Mingyu Wan, Xiaoqian Liu, Chu-Ren Huang\",\"doi\":\"10.2991/nlpr.d.200522.001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ive summarization is also a relevant task that can be useful for facilitating misinformation detection. Specifically, the summarization model can be applied to identify the central claims of the input texts and serves as a feature extractor prior to misinformation detection. For example, Esmaeilzadeh et al. [24] use a text summarization model to first summarize an article and then input the summarized sequences into a RNN-based neural network to do misinformation detection. The experimental results are compared against the task using only the original texts, and finally demonstrate higher performance. Fact checking is the task of assessing the truthfulness of claims especially made by public figures such as politicians [25]. Usually, there is no clear distinction between misinformation detection and fact checking since both of them aim to assess the truthfulness of claims, thoughmisinformation detection usually focuses on certain pieces of information while fact checking is broader [26]. However, fact checking can also be a relevant task of misinformation detection when a piece of information contains claims that need to be verified as true or false. Rumor detection is often confused with fake news detection, since rumor refers to a statement consisting of unverified information at the posting time. Rumor detection task is then defined as separating personal statements into rumor or nonrumor [27]. Thus, rumor detection can also serve as another relevant task of misinformation detection to first detect worth-checking statements prior to classifying the statement as true or false. This can help mitigate the impact that subjective opinions or feelings have on the selection of statements that need to be further verified. Sentiment analysis is the task of extracting emotions from texts or user stances. The sentiment in the true and misrepresented information can be different, since publishers of misinformation focus more on the degree to impress the audience and the spreading speed of the information. Thus, misinformation typically either contains intense emotion which could easily resonate with the public, or Q. Su et al. / Natural Language Processing Research 1(1-2) 1–13 3 controversial statements aiming to evoke intense emotion among receivers. Thus, misinformation detection can also utilize emotion analysis through both the content and user comments. Guo et al. [28] propose a Emotion-based misinformation Detection framework to learn contentand comment-emotion representations for publishers and users respectively so as to exploit content and social emotions simultaneously for misinformation detection. 1.3. An Overview of the Survey This survey aims to present a comprehensive review on studying misinformation in terms of its characteristics and detection methods. It first introduces the related concepts and highlights the significance of misinformation detection. It then uses a two-dimensional model to decompose this task: the internal dimension of descriptive analysis (i.e., the characterization of low-credibility information) and the external dimension of predictive modeling (i.e., the automatic detection of misinformation). In particular, the publicly available datasets and the state-of-the-art technologies are reviewed in terms of the detection approaches, feature representations and model construction. Finally, challenges of misinformation detection are summarized andnewprospects are provided for futuremisinformation detection works.\",\"PeriodicalId\":332352,\"journal\":{\"name\":\"Natural Language Processing Research\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"45\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Natural Language Processing Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2991/nlpr.d.200522.001\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Natural Language Processing Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2991/nlpr.d.200522.001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 45

摘要

摘要也是一项相关的任务,有助于促进错误信息的检测。具体来说,摘要模型可以用于识别输入文本的中心声明,并在错误信息检测之前作为特征提取器。例如,esmailzadeh等人[24]使用文本摘要模型首先对文章进行摘要,然后将摘要序列输入到基于rnn的神经网络中进行误信息检测。实验结果与仅使用原始文本的任务进行了比较,最终证明了更高的性能。事实核查的任务是评估言论的真实性,尤其是政治家等公众人物的言论[25]。通常,错误信息检测和事实检查之间没有明确的区别,因为它们都旨在评估声明的真实性,尽管错误信息检测通常侧重于某些信息,而事实检查则更广泛[26]。然而,当一条信息包含需要验证为真或假的声明时,事实检查也可能是错误信息检测的相关任务。谣言检测常常与假新闻检测相混淆,因为谣言是指在发布时由未经证实的信息组成的声明。然后将谣言检测任务定义为将个人陈述分为谣言或非谣言[27]。因此,谣言检测也可以作为错误信息检测的另一个相关任务,首先检测值得检查的语句,然后将其分类为真或假。这可以帮助减轻主观意见或感觉对需要进一步验证的陈述选择的影响。情感分析是从文本或用户立场中提取情感的任务。真实和虚假信息中的情绪可能是不同的,因为虚假信息的发布者更关注给受众留下深刻印象的程度和信息的传播速度。因此,错误信息通常要么包含强烈的情绪,很容易引起公众的共鸣,要么Q. Su等人/自然语言处理研究1(1-2)1- 13 3有争议的陈述,旨在唤起接受者的强烈情绪。因此,错误信息检测也可以通过内容和用户评论来利用情感分析。Guo等[28]提出了一种基于情感的错误信息检测框架,分别学习发布者和用户的内容和评论情感表征,从而同时利用内容和社会情感进行错误信息检测。1.3. 本调查旨在对错误信息的特征和检测方法进行全面的综述。首先介绍了相关概念,强调了误报检测的重要性。然后,它使用二维模型来分解该任务:描述性分析的内部维度(即低可信度信息的表征)和预测建模的外部维度(即错误信息的自动检测)。特别地,从检测方法、特征表示和模型构建方面回顾了公开可用的数据集和最先进的技术。最后,总结了错误信息检测面临的挑战,并对未来的错误信息检测工作提出了新的展望。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Motivations, Methods and Metrics of Misinformation Detection: An NLP Perspective
ive summarization is also a relevant task that can be useful for facilitating misinformation detection. Specifically, the summarization model can be applied to identify the central claims of the input texts and serves as a feature extractor prior to misinformation detection. For example, Esmaeilzadeh et al. [24] use a text summarization model to first summarize an article and then input the summarized sequences into a RNN-based neural network to do misinformation detection. The experimental results are compared against the task using only the original texts, and finally demonstrate higher performance. Fact checking is the task of assessing the truthfulness of claims especially made by public figures such as politicians [25]. Usually, there is no clear distinction between misinformation detection and fact checking since both of them aim to assess the truthfulness of claims, thoughmisinformation detection usually focuses on certain pieces of information while fact checking is broader [26]. However, fact checking can also be a relevant task of misinformation detection when a piece of information contains claims that need to be verified as true or false. Rumor detection is often confused with fake news detection, since rumor refers to a statement consisting of unverified information at the posting time. Rumor detection task is then defined as separating personal statements into rumor or nonrumor [27]. Thus, rumor detection can also serve as another relevant task of misinformation detection to first detect worth-checking statements prior to classifying the statement as true or false. This can help mitigate the impact that subjective opinions or feelings have on the selection of statements that need to be further verified. Sentiment analysis is the task of extracting emotions from texts or user stances. The sentiment in the true and misrepresented information can be different, since publishers of misinformation focus more on the degree to impress the audience and the spreading speed of the information. Thus, misinformation typically either contains intense emotion which could easily resonate with the public, or Q. Su et al. / Natural Language Processing Research 1(1-2) 1–13 3 controversial statements aiming to evoke intense emotion among receivers. Thus, misinformation detection can also utilize emotion analysis through both the content and user comments. Guo et al. [28] propose a Emotion-based misinformation Detection framework to learn contentand comment-emotion representations for publishers and users respectively so as to exploit content and social emotions simultaneously for misinformation detection. 1.3. An Overview of the Survey This survey aims to present a comprehensive review on studying misinformation in terms of its characteristics and detection methods. It first introduces the related concepts and highlights the significance of misinformation detection. It then uses a two-dimensional model to decompose this task: the internal dimension of descriptive analysis (i.e., the characterization of low-credibility information) and the external dimension of predictive modeling (i.e., the automatic detection of misinformation). In particular, the publicly available datasets and the state-of-the-art technologies are reviewed in terms of the detection approaches, feature representations and model construction. Finally, challenges of misinformation detection are summarized andnewprospects are provided for futuremisinformation detection works.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信