利用多智能体系统检测社交网络中的网络欺凌

V. Nahar, Xue Li, H. Zhang, C. Pang
{"title":"利用多智能体系统检测社交网络中的网络欺凌","authors":"V. Nahar, Xue Li, H. Zhang, C. Pang","doi":"10.3233/WIA-140301","DOIUrl":null,"url":null,"abstract":"State-of-the-art studies on cyberbullying detection, using text classification, predominantly take it for granted that streaming text can be completely labelled. However, the rapid growth of unlabelled data generated in real time from online content renders this virtually impossible. In this paper, we propose a session-based framework for automatic detection of cyberbullying within the large volume of unlabelled streaming text. Given that the streaming data from Social Networks arrives in large volume at the server system, we incorporate an ensemble of one-class classifiers in the session-based framework. System uses Multi-Agent distributed environment to process streaming data from multiple social network sources. The proposed strategy tackles real world situations, where only a few positive instances of cyberbullying are available for initial training. Our main contribution in this paper is to automatically detect cyberbullying in real world situations, where labelled data is not readily available. Initial results indicate the suggested approach is reasonably effective for detecting cyberbullying automatically on social networks. The experiments indicate that the ensemble learner outperforms the single window and fixed window approaches, while the learning process is based on positive and unlabelled data only, no negative data is available for training.","PeriodicalId":263450,"journal":{"name":"Web Intell. Agent Syst.","volume":"87 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Detecting cyberbullying in social networks using multi-agent system\",\"authors\":\"V. Nahar, Xue Li, H. Zhang, C. Pang\",\"doi\":\"10.3233/WIA-140301\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"State-of-the-art studies on cyberbullying detection, using text classification, predominantly take it for granted that streaming text can be completely labelled. However, the rapid growth of unlabelled data generated in real time from online content renders this virtually impossible. In this paper, we propose a session-based framework for automatic detection of cyberbullying within the large volume of unlabelled streaming text. Given that the streaming data from Social Networks arrives in large volume at the server system, we incorporate an ensemble of one-class classifiers in the session-based framework. System uses Multi-Agent distributed environment to process streaming data from multiple social network sources. The proposed strategy tackles real world situations, where only a few positive instances of cyberbullying are available for initial training. Our main contribution in this paper is to automatically detect cyberbullying in real world situations, where labelled data is not readily available. Initial results indicate the suggested approach is reasonably effective for detecting cyberbullying automatically on social networks. The experiments indicate that the ensemble learner outperforms the single window and fixed window approaches, while the learning process is based on positive and unlabelled data only, no negative data is available for training.\",\"PeriodicalId\":263450,\"journal\":{\"name\":\"Web Intell. Agent Syst.\",\"volume\":\"87 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Web Intell. Agent Syst.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3233/WIA-140301\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Web Intell. Agent Syst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/WIA-140301","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

摘要

最新的网络欺凌检测研究使用文本分类,主要想当然地认为流文本可以完全标记。然而,在线内容实时生成的未标记数据的快速增长使得这实际上是不可能的。在本文中,我们提出了一个基于会话的框架,用于在大量未标记的流文本中自动检测网络欺凌。考虑到来自Social Networks的流数据大量到达服务器系统,我们在基于会话的框架中合并了一个单类分类器的集合。系统采用Multi-Agent分布式环境对来自多个社交网络源的流数据进行处理。拟议的策略解决了现实世界的情况,其中只有少数积极的网络欺凌实例可用于初始培训。我们在本文中的主要贡献是自动检测现实世界中的网络欺凌,其中标记数据不容易获得。初步结果表明,该方法对于自动检测社交网络上的网络欺凌是相当有效的。实验表明,集成学习器优于单窗口和固定窗口方法,而学习过程仅基于正数据和未标记数据,没有可用于训练的负数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Detecting cyberbullying in social networks using multi-agent system
State-of-the-art studies on cyberbullying detection, using text classification, predominantly take it for granted that streaming text can be completely labelled. However, the rapid growth of unlabelled data generated in real time from online content renders this virtually impossible. In this paper, we propose a session-based framework for automatic detection of cyberbullying within the large volume of unlabelled streaming text. Given that the streaming data from Social Networks arrives in large volume at the server system, we incorporate an ensemble of one-class classifiers in the session-based framework. System uses Multi-Agent distributed environment to process streaming data from multiple social network sources. The proposed strategy tackles real world situations, where only a few positive instances of cyberbullying are available for initial training. Our main contribution in this paper is to automatically detect cyberbullying in real world situations, where labelled data is not readily available. Initial results indicate the suggested approach is reasonably effective for detecting cyberbullying automatically on social networks. The experiments indicate that the ensemble learner outperforms the single window and fixed window approaches, while the learning process is based on positive and unlabelled data only, no negative data is available for training.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信