{"title":"Rare Association Rules Mining of Diabetic Complications Based on Improved Rarity Algorithm","authors":"Qiao Pan, Lan Xiang, Yanhong Jin","doi":"10.1109/ICBCB.2019.8854639","DOIUrl":null,"url":null,"abstract":"Although the frequent pattern mining has attracted widespread attention of scholars, it is undeniable that the rare pattern mining plays a significant role in many fields, such as medical, financial, and scientific fields. And it is more valuable to study the rare pattern mining, because it tends to find some unknown and unexpected associations. There are some previous algorithms of rare itemsets mining, however, Arima spends much time and Rarity wastes much space. So based on the Rarity algorithm, this paper presents an improved top-down approach to efficiently mine all rare itemsets and their association rules, which uses the graph structure to indicate all combinations of existing items in the database, defines a pattern matrix to record all itemsets and the support_count, and combines the hash table to accelerate support calculation to quickly find all rare itemsets, and then generate all patterns to choose useful rules according to their interesting rate. In the experiment, this paper uses the real diabetic clinical data to verify this improved approach and mines some useful rules among the diabetic complications. Moreover, compared with the two methods mentioned above, this method decreases much time and space complexity in the association rules mining.","PeriodicalId":136995,"journal":{"name":"2019 IEEE 7th International Conference on Bioinformatics and Computational Biology ( ICBCB)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 7th International Conference on Bioinformatics and Computational Biology ( ICBCB)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICBCB.2019.8854639","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Although the frequent pattern mining has attracted widespread attention of scholars, it is undeniable that the rare pattern mining plays a significant role in many fields, such as medical, financial, and scientific fields. And it is more valuable to study the rare pattern mining, because it tends to find some unknown and unexpected associations. There are some previous algorithms of rare itemsets mining, however, Arima spends much time and Rarity wastes much space. So based on the Rarity algorithm, this paper presents an improved top-down approach to efficiently mine all rare itemsets and their association rules, which uses the graph structure to indicate all combinations of existing items in the database, defines a pattern matrix to record all itemsets and the support_count, and combines the hash table to accelerate support calculation to quickly find all rare itemsets, and then generate all patterns to choose useful rules according to their interesting rate. In the experiment, this paper uses the real diabetic clinical data to verify this improved approach and mines some useful rules among the diabetic complications. Moreover, compared with the two methods mentioned above, this method decreases much time and space complexity in the association rules mining.