A mechanism to detect urdu Spam emails

Ayesha Akhtar, Ghulam Rasool Tahir, Khadija Shakeel
{"title":"A mechanism to detect urdu Spam emails","authors":"Ayesha Akhtar, Ghulam Rasool Tahir, Khadija Shakeel","doi":"10.1109/UEMCON.2017.8249019","DOIUrl":null,"url":null,"abstract":"Electronic mail (Email) is being used for communication in all over the world as a basic communicational media. Few decades ago English was the only language of email but now almost facility of each language for email is present. Now people can use Urdu language as medium of Emails, social media and blog discussions. This Paper finds approaches of Spam/Ham categorization of English Emails are not able to categorize the Urdu emails because Urdu and English script are totally different in many aspects, so these are not directly applicable to Urdu. To categorization of emails into spam and ham is called Emails Spam filtering. Moreover, we need to develop tool by using machine learning technique that is appropriate in Urdu Language. In our work we have performed spam classification for Emails in Urdu language. We have collected different Spam and Ham Urdu Emails from different users. Most of the algorithms and techniques that are used for Spam classification in English and others languages are discussed and evaluated from different countries in this paper.","PeriodicalId":403890,"journal":{"name":"2017 IEEE 8th Annual Ubiquitous Computing, Electronics and Mobile Communication Conference (UEMCON)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 8th Annual Ubiquitous Computing, Electronics and Mobile Communication Conference (UEMCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UEMCON.2017.8249019","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Electronic mail (Email) is being used for communication in all over the world as a basic communicational media. Few decades ago English was the only language of email but now almost facility of each language for email is present. Now people can use Urdu language as medium of Emails, social media and blog discussions. This Paper finds approaches of Spam/Ham categorization of English Emails are not able to categorize the Urdu emails because Urdu and English script are totally different in many aspects, so these are not directly applicable to Urdu. To categorization of emails into spam and ham is called Emails Spam filtering. Moreover, we need to develop tool by using machine learning technique that is appropriate in Urdu Language. In our work we have performed spam classification for Emails in Urdu language. We have collected different Spam and Ham Urdu Emails from different users. Most of the algorithms and techniques that are used for Spam classification in English and others languages are discussed and evaluated from different countries in this paper.
检测乌尔都垃圾邮件的机制
电子邮件(Email)作为一种基本的通信媒介在世界各地被用于通信。几十年前,英语是电子邮件的唯一语言,但现在几乎每种语言都能收发电子邮件。现在人们可以使用乌尔都语作为电子邮件、社交媒体和博客讨论的媒介。本文发现英语电子邮件的Spam/Ham分类方法不能对乌尔都语电子邮件进行分类,因为乌尔都语和英语文字在很多方面完全不同,所以这些方法不能直接适用于乌尔都语。将电子邮件分为垃圾邮件和非垃圾邮件,称为垃圾邮件过滤。此外,我们需要利用机器学习技术开发适合乌尔都语的工具。在我们的工作中,我们对乌尔都语的电子邮件进行了垃圾邮件分类。我们从不同的用户收集了不同的垃圾邮件和火腿乌尔都邮件。本文从不同的国家对英语和其他语言的垃圾邮件分类中使用的大多数算法和技术进行了讨论和评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信