在线论坛中姿态分类的预训练方法

Jean Marie Tshimula, B. Chikhaoui, Shengrui Wang
{"title":"在线论坛中姿态分类的预训练方法","authors":"Jean Marie Tshimula, B. Chikhaoui, Shengrui Wang","doi":"10.1109/ASONAM49781.2020.9381467","DOIUrl":null,"url":null,"abstract":"Stance detection is the task of automatically determining whether the author of a piece of text is in favor of, against, or neutral towards a target such as a topic, entity, or claim. In this paper, we propose a method based on RoBERTa to classify stances by capturing the context of the discussion through the examination of pairs of stances and relational structures of debates specific to each topic within the defined window of each forum participant's interventions. Furthermore, we examine the degree of disagreement and neutrality in various debate topics to measure divergence of opinion in the course of the debate and estimate the emotional state manifested in different debate topics. We conduct extensive experiments using two publicly available datasets and demonstrate that our method considers more stance classes, provides better results and yields statistical improvements over existing techniques. Our quantitative analysis of model performance yields F-1 scores of over 0.745. Interestingly, we obtained the highest F-1 score, 0.814, on a stance class which was not taken into consideration in prior work. We report that none of the metrics utilized to measure divergence of opinion yield values exceeding 50 % and the correlations between the same topics over 10-fold cross-validation are statistically significant for the majority of them (p < 0.005). Several future research avenues are proposed.","PeriodicalId":196317,"journal":{"name":"2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A Pre-training Approach for Stance Classification in Online Forums\",\"authors\":\"Jean Marie Tshimula, B. Chikhaoui, Shengrui Wang\",\"doi\":\"10.1109/ASONAM49781.2020.9381467\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Stance detection is the task of automatically determining whether the author of a piece of text is in favor of, against, or neutral towards a target such as a topic, entity, or claim. In this paper, we propose a method based on RoBERTa to classify stances by capturing the context of the discussion through the examination of pairs of stances and relational structures of debates specific to each topic within the defined window of each forum participant's interventions. Furthermore, we examine the degree of disagreement and neutrality in various debate topics to measure divergence of opinion in the course of the debate and estimate the emotional state manifested in different debate topics. We conduct extensive experiments using two publicly available datasets and demonstrate that our method considers more stance classes, provides better results and yields statistical improvements over existing techniques. Our quantitative analysis of model performance yields F-1 scores of over 0.745. Interestingly, we obtained the highest F-1 score, 0.814, on a stance class which was not taken into consideration in prior work. We report that none of the metrics utilized to measure divergence of opinion yield values exceeding 50 % and the correlations between the same topics over 10-fold cross-validation are statistically significant for the majority of them (p < 0.005). Several future research avenues are proposed.\",\"PeriodicalId\":196317,\"journal\":{\"name\":\"2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ASONAM49781.2020.9381467\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASONAM49781.2020.9381467","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

立场检测是自动确定一段文本的作者是支持、反对还是对目标(如主题、实体或主张)保持中立的任务。在本文中,我们提出了一种基于RoBERTa的方法,通过在每个论坛参与者的干预的定义窗口内检查特定于每个主题的辩论的立场对和关系结构来捕捉讨论的背景,从而对立场进行分类。此外,我们考察了不同辩论主题的分歧程度和中立程度,以衡量辩论过程中的意见分歧,并估计不同辩论主题所表现出的情绪状态。我们使用两个公开可用的数据集进行了广泛的实验,并证明我们的方法考虑了更多的姿态类,提供了更好的结果,并产生了比现有技术更好的统计改进。我们对模型性能的定量分析得出F-1分数超过0.745。有趣的是,我们在之前的工作中没有考虑到的一个姿态类上获得了最高的F-1分数0.814。我们报告说,用于衡量意见分歧产生值的指标没有一个超过50%,并且相同主题之间超过10倍交叉验证的相关性在大多数情况下具有统计显著性(p < 0.005)。提出了未来的研究方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Pre-training Approach for Stance Classification in Online Forums
Stance detection is the task of automatically determining whether the author of a piece of text is in favor of, against, or neutral towards a target such as a topic, entity, or claim. In this paper, we propose a method based on RoBERTa to classify stances by capturing the context of the discussion through the examination of pairs of stances and relational structures of debates specific to each topic within the defined window of each forum participant's interventions. Furthermore, we examine the degree of disagreement and neutrality in various debate topics to measure divergence of opinion in the course of the debate and estimate the emotional state manifested in different debate topics. We conduct extensive experiments using two publicly available datasets and demonstrate that our method considers more stance classes, provides better results and yields statistical improvements over existing techniques. Our quantitative analysis of model performance yields F-1 scores of over 0.745. Interestingly, we obtained the highest F-1 score, 0.814, on a stance class which was not taken into consideration in prior work. We report that none of the metrics utilized to measure divergence of opinion yield values exceeding 50 % and the correlations between the same topics over 10-fold cross-validation are statistically significant for the majority of them (p < 0.005). Several future research avenues are proposed.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信