提高数据集分类的准确性和可解释性:用于模糊规则生成的混合网格划分和粗糙集方法的进展

Josea Moreno Chawla, Herrera Rocío
{"title":"提高数据集分类的准确性和可解释性:用于模糊规则生成的混合网格划分和粗糙集方法的进展","authors":"Josea Moreno Chawla, Herrera Rocío","doi":"10.35335/emod.v13i1.5","DOIUrl":null,"url":null,"abstract":"Accurate and interpretable classification of datasets plays a crucial role in various domains, including healthcare, finance, and image recognition. This research focuses on enhancing accuracy and interpretability in dataset classification through the integration of hybrid grid partition and rough set methods for fuzzy rule generation. The proposed mathematical model leverages the grid partition approach to handle the curse of dimensionality and reduce dataset complexity, while the rough set method identifies essential features and generates meaningful fuzzy rules. The assigned membership values to linguistic terms further enhance interpretability. The model's accuracy and interpretability were evaluated using a diabetes dataset, achieving an accuracy rate of 85% on the validation dataset and 83% on the testing dataset. Comparative analysis demonstrated competitive performance against existing methods. The iterative refinement process contributed to the model's optimization. However, limitations include dataset dependency, parameter sensitivity, and scalability. Future research directions include advanced rule pruning techniques, optimization of model parameters, handling imbalanced datasets, incorporating feature selection, robustness and scalability evaluation, comparative studies, and real-world application validation. The proposed model presents a promising approach to enhance accuracy and interpretability in dataset classification.","PeriodicalId":262913,"journal":{"name":"International Journal of Enterprise Modelling","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing Accuracy and Interpretability in Dataset Classification: Advancements in Hybrid Grid Partition and Rough Set Methods for Fuzzy Rule Generation\",\"authors\":\"Josea Moreno Chawla, Herrera Rocío\",\"doi\":\"10.35335/emod.v13i1.5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accurate and interpretable classification of datasets plays a crucial role in various domains, including healthcare, finance, and image recognition. This research focuses on enhancing accuracy and interpretability in dataset classification through the integration of hybrid grid partition and rough set methods for fuzzy rule generation. The proposed mathematical model leverages the grid partition approach to handle the curse of dimensionality and reduce dataset complexity, while the rough set method identifies essential features and generates meaningful fuzzy rules. The assigned membership values to linguistic terms further enhance interpretability. The model's accuracy and interpretability were evaluated using a diabetes dataset, achieving an accuracy rate of 85% on the validation dataset and 83% on the testing dataset. Comparative analysis demonstrated competitive performance against existing methods. The iterative refinement process contributed to the model's optimization. However, limitations include dataset dependency, parameter sensitivity, and scalability. Future research directions include advanced rule pruning techniques, optimization of model parameters, handling imbalanced datasets, incorporating feature selection, robustness and scalability evaluation, comparative studies, and real-world application validation. The proposed model presents a promising approach to enhance accuracy and interpretability in dataset classification.\",\"PeriodicalId\":262913,\"journal\":{\"name\":\"International Journal of Enterprise Modelling\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-12-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Enterprise Modelling\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.35335/emod.v13i1.5\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Enterprise Modelling","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.35335/emod.v13i1.5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

准确和可解释的数据集分类在各个领域(包括医疗保健、金融和图像识别)中起着至关重要的作用。本研究将混合网格划分和粗糙集方法相结合,用于模糊规则生成,提高数据集分类的准确性和可解释性。该数学模型利用网格划分方法处理数据的维数问题,降低数据集的复杂度,而粗糙集方法识别基本特征,生成有意义的模糊规则。为语言术语指定的隶属值进一步增强了可解释性。使用糖尿病数据集评估模型的准确性和可解释性,验证数据集的准确率为85%,测试数据集的准确率为83%。对比分析显示了与现有方法相比的竞争表现。迭代细化过程有助于模型的优化。然而,限制包括数据集依赖性、参数敏感性和可伸缩性。未来的研究方向包括先进的规则修剪技术、模型参数优化、不平衡数据集处理、融合特征选择、鲁棒性和可扩展性评估、比较研究以及实际应用验证。该模型为提高数据集分类的准确性和可解释性提供了一种有前途的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Enhancing Accuracy and Interpretability in Dataset Classification: Advancements in Hybrid Grid Partition and Rough Set Methods for Fuzzy Rule Generation
Accurate and interpretable classification of datasets plays a crucial role in various domains, including healthcare, finance, and image recognition. This research focuses on enhancing accuracy and interpretability in dataset classification through the integration of hybrid grid partition and rough set methods for fuzzy rule generation. The proposed mathematical model leverages the grid partition approach to handle the curse of dimensionality and reduce dataset complexity, while the rough set method identifies essential features and generates meaningful fuzzy rules. The assigned membership values to linguistic terms further enhance interpretability. The model's accuracy and interpretability were evaluated using a diabetes dataset, achieving an accuracy rate of 85% on the validation dataset and 83% on the testing dataset. Comparative analysis demonstrated competitive performance against existing methods. The iterative refinement process contributed to the model's optimization. However, limitations include dataset dependency, parameter sensitivity, and scalability. Future research directions include advanced rule pruning techniques, optimization of model parameters, handling imbalanced datasets, incorporating feature selection, robustness and scalability evaluation, comparative studies, and real-world application validation. The proposed model presents a promising approach to enhance accuracy and interpretability in dataset classification.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信