A Novel Hybrid Approach: Grid Partition and Rough Set-Based Fuzzy Rule Generation for Accurate Dataset Classification

International Journal of Enterprise Modelling Pub Date : 2019-09-30 DOI:10.35335/emod.v13i3.15

Dalzon Marie, Moïse Etzer

{"title":"A Novel Hybrid Approach: Grid Partition and Rough Set-Based Fuzzy Rule Generation for Accurate Dataset Classification","authors":"Dalzon Marie, Moïse Etzer","doi":"10.35335/emod.v13i3.15","DOIUrl":null,"url":null,"abstract":"Accurate dataset classification is a fundamental task in various domains such as machine learning, pattern recognition, and data mining. This research proposes a novel hybrid approach that combines grid partitioning, rough set-based feature reduction, and fuzzy rule generation to enhance classification accuracy and interpretability. The approach begins with the partitioning of the dataset into a grid of cells, enabling localized analysis and capturing intricate patterns. Next, rough set-based feature reduction is applied to identify essential features and reduce dimensionality. This process helps overcome the curse of dimensionality commonly associated with complex datasets. Subsequently, fuzzy rule generation is employed, leveraging linguistic variables and membership functions to represent imprecise and uncertain information. This enhances interpretability by providing transparent decision-making rules. To evaluate the effectiveness of the proposed approach, comparative analysis with traditional classification methods, including decision trees, support vector machines, and neural networks, is conducted. The results demonstrate the superiority or at least comparability of the hybrid approach in terms of classification accuracy, computational complexity, and interpretability. However, it is essential to acknowledge the limitations of the research, such as the sensitivity to grid size and the interpretability-performance trade-off. Future research can focus on refining the approach by exploring optimal grid size selection methods and mitigating the interpretability-performance trade-off.The findings of this research contribute to the advancement of accurate dataset classification techniques. The proposed hybrid approach offers improved classification accuracy, handles complex datasets effectively, and enhances interpretability through fuzzy rules. The practical implications of the research span domains such as bioinformatics, IoT, and financial analysis. Overall, this research provides a foundation for further exploration, refinement, and real-world applications of the hybrid approach in accurate dataset classification scenarios.","PeriodicalId":262913,"journal":{"name":"International Journal of Enterprise Modelling","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Enterprise Modelling","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.35335/emod.v13i3.15","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Accurate dataset classification is a fundamental task in various domains such as machine learning, pattern recognition, and data mining. This research proposes a novel hybrid approach that combines grid partitioning, rough set-based feature reduction, and fuzzy rule generation to enhance classification accuracy and interpretability. The approach begins with the partitioning of the dataset into a grid of cells, enabling localized analysis and capturing intricate patterns. Next, rough set-based feature reduction is applied to identify essential features and reduce dimensionality. This process helps overcome the curse of dimensionality commonly associated with complex datasets. Subsequently, fuzzy rule generation is employed, leveraging linguistic variables and membership functions to represent imprecise and uncertain information. This enhances interpretability by providing transparent decision-making rules. To evaluate the effectiveness of the proposed approach, comparative analysis with traditional classification methods, including decision trees, support vector machines, and neural networks, is conducted. The results demonstrate the superiority or at least comparability of the hybrid approach in terms of classification accuracy, computational complexity, and interpretability. However, it is essential to acknowledge the limitations of the research, such as the sensitivity to grid size and the interpretability-performance trade-off. Future research can focus on refining the approach by exploring optimal grid size selection methods and mitigating the interpretability-performance trade-off.The findings of this research contribute to the advancement of accurate dataset classification techniques. The proposed hybrid approach offers improved classification accuracy, handles complex datasets effectively, and enhances interpretability through fuzzy rules. The practical implications of the research span domains such as bioinformatics, IoT, and financial analysis. Overall, this research provides a foundation for further exploration, refinement, and real-world applications of the hybrid approach in accurate dataset classification scenarios.

查看原文本刊更多论文

一种新的混合方法:网格划分和基于粗糙集的模糊规则生成用于数据集的精确分类

准确的数据集分类是机器学习、模式识别和数据挖掘等各个领域的基本任务。本研究提出了一种结合网格划分、粗糙集特征约简和模糊规则生成的新型混合方法，以提高分类精度和可解释性。该方法首先将数据集划分为网格单元，从而实现本地化分析和捕获复杂的模式。其次，采用基于粗糙集的特征约简来识别基本特征并进行降维。这个过程有助于克服通常与复杂数据集相关的维度问题。随后，采用模糊规则生成，利用语言变量和隶属函数来表示不精确和不确定的信息。这通过提供透明的决策规则来增强可解释性。为了评估该方法的有效性，将其与传统的分类方法(包括决策树、支持向量机和神经网络)进行了比较分析。结果证明了混合方法在分类精度、计算复杂度和可解释性方面的优越性或至少可比性。然而，必须承认研究的局限性，例如对网格大小的敏感性和可解释性-性能权衡。未来的研究可以通过探索最佳网格大小选择方法和减轻可解释性与性能之间的权衡来改进该方法。本研究结果有助于数据集准确分类技术的发展。提出的混合方法提高了分类精度，有效地处理复杂数据集，并通过模糊规则增强了可解释性。该研究的实际意义涉及生物信息学、物联网和金融分析等领域。总的来说，本研究为进一步探索、改进和混合方法在准确数据集分类场景中的实际应用奠定了基础。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Enterprise Modelling

自引率

0.00%

发文量