Xiangtao Liu, Xueqi Cheng, Jingyuan Li, Haijun Zhai, Shuo Bai
{"title":"Identifying vulgar content in eMule network through text classification","authors":"Xiangtao Liu, Xueqi Cheng, Jingyuan Li, Haijun Zhai, Shuo Bai","doi":"10.1109/ISI.2010.5484751","DOIUrl":null,"url":null,"abstract":"Through years of development, the cyberspace has been dominated by traffic of peer-to-peer (P2P) file sharing applications. Among them, eMule is especially favored by millions of P2P users all over the world. However, it is very difficult to manage the content which is delivered through eMule due to its distributed property, thus a large number of vulgar content (e.g., pornographic and violent files) is existing in eMule. Since children and adolescents are the main force of eMule users, it is quite necessary to provide an efficient method to identify and filter the vulgar content for the sake of innocent children and adolescents. In this study, an automatic framework based on text classification is proposed to identify and filter vulgar content in eMule. Filename is used as the feature to carry out the elementary research on the effectiveness of our framework, although filename may be changed freely by eMule users. We aim to achieve high accuracy when identifying and filtering vulgar content, thus to raise the quality of the content delivered in eMule to a higher level.","PeriodicalId":434501,"journal":{"name":"2010 IEEE International Conference on Intelligence and Security Informatics","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Intelligence and Security Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISI.2010.5484751","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Through years of development, the cyberspace has been dominated by traffic of peer-to-peer (P2P) file sharing applications. Among them, eMule is especially favored by millions of P2P users all over the world. However, it is very difficult to manage the content which is delivered through eMule due to its distributed property, thus a large number of vulgar content (e.g., pornographic and violent files) is existing in eMule. Since children and adolescents are the main force of eMule users, it is quite necessary to provide an efficient method to identify and filter the vulgar content for the sake of innocent children and adolescents. In this study, an automatic framework based on text classification is proposed to identify and filter vulgar content in eMule. Filename is used as the feature to carry out the elementary research on the effectiveness of our framework, although filename may be changed freely by eMule users. We aim to achieve high accuracy when identifying and filtering vulgar content, thus to raise the quality of the content delivered in eMule to a higher level.