Recommendation System for Bangla News Article with Anaphora Resolution

Kazi Wohiduzzaman, Sabir Ismail
{"title":"Recommendation System for Bangla News Article with Anaphora Resolution","authors":"Kazi Wohiduzzaman, Sabir Ismail","doi":"10.1109/CEEICT.2018.8628075","DOIUrl":null,"url":null,"abstract":"This paper represents an efficient approach for Bangla News Recommendation. In traditional Bangla news recommendation system, recommend news from the different newspapers of the same day. Actually, they contain the same news just from different sources. From the user's view, it is more desired if users get to know more diverse information on the same news. In this paper, we have represented a noble approach for recommending news on the same topic with more diverse information. At first, we have done news clustering; it is an automatic learning technique aimed to create clusters that are coherent internally, but substantially different from each other. In this approach, we have used anaphora resolution to increase the keywords frequency. We build an automatic word tagger for anaphora resolution, which can tag all nouns and pronouns with five different criteria (Number, Person, Status, Gender, and POS). Next we have counted document wise unique words to calculate tf-Idf algorithm with cosine similarity to make the recommendation. Finally, we have done three different modified technique of reverse hierarchical clustering on the same cluster to identify more distinct news which is related to the same subject.","PeriodicalId":417359,"journal":{"name":"2018 4th International Conference on Electrical Engineering and Information & Communication Technology (iCEEiCT)","volume":"237 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 4th International Conference on Electrical Engineering and Information & Communication Technology (iCEEiCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CEEICT.2018.8628075","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

This paper represents an efficient approach for Bangla News Recommendation. In traditional Bangla news recommendation system, recommend news from the different newspapers of the same day. Actually, they contain the same news just from different sources. From the user's view, it is more desired if users get to know more diverse information on the same news. In this paper, we have represented a noble approach for recommending news on the same topic with more diverse information. At first, we have done news clustering; it is an automatic learning technique aimed to create clusters that are coherent internally, but substantially different from each other. In this approach, we have used anaphora resolution to increase the keywords frequency. We build an automatic word tagger for anaphora resolution, which can tag all nouns and pronouns with five different criteria (Number, Person, Status, Gender, and POS). Next we have counted document wise unique words to calculate tf-Idf algorithm with cosine similarity to make the recommendation. Finally, we have done three different modified technique of reverse hierarchical clustering on the same cluster to identify more distinct news which is related to the same subject.
带回指消解的孟加拉语新闻文章推荐系统
本文为孟加拉语新闻推荐提供了一种有效的方法。在传统的孟加拉新闻推荐系统中,推荐当天不同报纸的新闻。实际上,它们包含相同的新闻,只是来自不同的来源。从用户的角度来看,用户更希望在同一条新闻中了解到更多不同的信息。在本文中,我们代表了一种高尚的方法来推荐具有更多样化信息的同一主题的新闻。首先,我们做了新闻聚类;它是一种自动学习技术,旨在创建内部连贯但彼此之间本质不同的集群。在这种方法中,我们使用了回指分辨率来增加关键词的频率。我们构建了一个用于回指解析的自动单词标注器,它可以用五个不同的标准(Number、Person、Status、Gender和POS)标记所有名词和代词。接下来,我们统计了文档智能唯一词,计算了余弦相似度的tf-Idf算法,并给出了推荐。最后,我们在同一聚类上进行了三种不同的反向分层聚类改进技术,以识别与同一主题相关的更多不同的新闻。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信