Hybrid Recommender System for Detection of Rare Cases Applied to Pulsar Candidate Selection

Di Pang, K. Goseva-Popstojanova, M. Mclaughlin
{"title":"Hybrid Recommender System for Detection of Rare Cases Applied to Pulsar Candidate Selection","authors":"Di Pang, K. Goseva-Popstojanova, M. Mclaughlin","doi":"10.1109/DSAA53316.2021.9564139","DOIUrl":null,"url":null,"abstract":"Detection of extremely rare cases is a challenging problem for most machine learning algorithms, especially if class overlapping is present. In this paper we propose a hybrid recommender system that uses a target rare case to state users' requirements and ranks the candidates using a similarity function which is calculated as a weighted sum of individual feature similarities. Specifically, the weight of each feature is computed as a product of its association with the class label and the outlyingness of its value. We apply this hybrid recommender system on the radio pulsar candidate selection problem, for detection of two different types of rare cases: low signal-to-noise (S/N) pulsars and Fast Radio Bursts (FRBs). Our results show that the proposed approach successfully detects both low S/N pulsars and FRBs. When there is class overlapping, as in case of low S/N pulsars, treating rare feature values as outliers and increasing their weights in the similarity function improve the detection performance. For FRBs, which compared to the low S/N pulsars are relatively more distinguishable from the non-astrophysical signals, uniform weighting outperformed the feature-weighting methods. The proposed hybrid recommender system can be used in other application domains that share similar requirements such as high recall and face similar challenges such as class imbalance and class overlapping.","PeriodicalId":129612,"journal":{"name":"2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DSAA53316.2021.9564139","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Detection of extremely rare cases is a challenging problem for most machine learning algorithms, especially if class overlapping is present. In this paper we propose a hybrid recommender system that uses a target rare case to state users' requirements and ranks the candidates using a similarity function which is calculated as a weighted sum of individual feature similarities. Specifically, the weight of each feature is computed as a product of its association with the class label and the outlyingness of its value. We apply this hybrid recommender system on the radio pulsar candidate selection problem, for detection of two different types of rare cases: low signal-to-noise (S/N) pulsars and Fast Radio Bursts (FRBs). Our results show that the proposed approach successfully detects both low S/N pulsars and FRBs. When there is class overlapping, as in case of low S/N pulsars, treating rare feature values as outliers and increasing their weights in the similarity function improve the detection performance. For FRBs, which compared to the low S/N pulsars are relatively more distinguishable from the non-astrophysical signals, uniform weighting outperformed the feature-weighting methods. The proposed hybrid recommender system can be used in other application domains that share similar requirements such as high recall and face similar challenges such as class imbalance and class overlapping.
罕见情况检测混合推荐系统在脉冲星候选体选择中的应用
对于大多数机器学习算法来说,检测极其罕见的情况是一个具有挑战性的问题,特别是当存在类重叠时。在本文中,我们提出了一种混合推荐系统,该系统使用目标罕见情况来陈述用户的需求,并使用相似度函数对候选对象进行排名,该相似度函数计算为单个特征相似度的加权和。具体来说,每个特征的权重被计算为其与类标签的关联及其值的离群值的乘积。我们将这种混合推荐系统应用于射电脉冲星的候选选择问题,用于检测两种不同类型的罕见情况:低信噪比(S/N)脉冲星和快速射电暴(frb)。结果表明,该方法能够成功地探测到低信噪比脉冲星和快速射电暴。当存在类重叠时,如低信噪比脉冲星,将稀有特征值作为离群值,在相似函数中增加其权重,可以提高检测性能。对于快速射电暴,相对于低信噪比脉冲星,快速射电暴与非天体物理信号的区别更明显,均匀加权方法优于特征加权方法。所提出的混合推荐系统可用于其他具有类似要求的应用领域,如高召回率,并面临类似的挑战,如类不平衡和类重叠。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信