{"title":"Interpretable Granulation of Medical Data with DC","authors":"Corrado Mencar, A. Consiglio, A. Fanelli","doi":"10.1109/HIS.2007.15","DOIUrl":null,"url":null,"abstract":"In this paper we describe an approach for mining interpretable diagnostic rules through a fuzzy information granulation process. Specifically, this process is performed by the DC* algorithm (Double Clustering with A*), which is aimed at mining from data a set of fuzzy information granules that satisfy a number of interpretability constraints. Such granules can be labelled with linguistic terms and used as building blocks for deriving diagnostic rules. The DC* is based on two clustering steps. The first step applies the LVQ1 algorithm to find a number of prototypes in the input space, which represent hidden relationships among data. The second clustering step .based on the A* search. takes place on the projections of such prototypes, and is aimed at finding an optimal number of granules that verify interpretability constraints. The application of DC* to two well-known medical datasets provided a set of intelligible rules with satisfactory accuracy.","PeriodicalId":359991,"journal":{"name":"7th International Conference on Hybrid Intelligent Systems (HIS 2007)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"7th International Conference on Hybrid Intelligent Systems (HIS 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HIS.2007.15","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper we describe an approach for mining interpretable diagnostic rules through a fuzzy information granulation process. Specifically, this process is performed by the DC* algorithm (Double Clustering with A*), which is aimed at mining from data a set of fuzzy information granules that satisfy a number of interpretability constraints. Such granules can be labelled with linguistic terms and used as building blocks for deriving diagnostic rules. The DC* is based on two clustering steps. The first step applies the LVQ1 algorithm to find a number of prototypes in the input space, which represent hidden relationships among data. The second clustering step .based on the A* search. takes place on the projections of such prototypes, and is aimed at finding an optimal number of granules that verify interpretability constraints. The application of DC* to two well-known medical datasets provided a set of intelligible rules with satisfactory accuracy.
本文描述了一种通过模糊信息粒化过程挖掘可解释诊断规则的方法。具体来说,这个过程是由DC*算法(Double Clustering with A*)完成的,该算法旨在从数据中挖掘出一组满足许多可解释性约束的模糊信息颗粒。这些颗粒可以用语言术语标记,并用作派生诊断规则的构建块。DC*基于两个聚类步骤。第一步使用LVQ1算法在输入空间中找到一些原型,这些原型表示数据之间的隐藏关系。第二步聚类是基于A*搜索。发生在这种原型的投影上,旨在找到验证可解释性约束的颗粒的最佳数量。将DC*应用于两个知名的医疗数据集,提供了一组具有令人满意精度的可理解规则。