Ran Li, Hongchang Chen, Shuxin Liu, Haocong Jiang, Biao Wang
{"title":"基于邻域信息系统的不完全混合数据属性约简","authors":"Ran Li, Hongchang Chen, Shuxin Liu, Haocong Jiang, Biao Wang","doi":"10.1080/03081079.2023.2256464","DOIUrl":null,"url":null,"abstract":"In an era of data-based and information-centric Industry 4.0, extracting potential knowledge and valuable information from data is central to data mining tasks. Yet, the ambiguity, imprecision, incompleteness, and hybrid in real-world data pose tremendous challenges to critical information mining. Accordingly, we propose a new Max-Correlation Min-Redundant (MCMR) attribute reduction model from the uncertainty relation of attributes to avoid information loss in incomplete mixed data. Specifically, the neighbor relations are primarily developed based on the soft computing approach of the neighborhood information system, which divides the objects into neighborhood covers to maximize the utilization of the information in the incomplete mixed data. Then, we detailly analyze the internal and external consistency relationships of the four main uncertainty functions. Based on this, a new MCMR uncertain function is designed with maximum relevance and minimum redundancy. Experiments on nine real-world datasets validate the proposed model can improve data quality by mining critical information in classification tasks and achieving optimal performance with a minimum number of attributes.","PeriodicalId":50322,"journal":{"name":"International Journal of General Systems","volume":"26 1","pages":"0"},"PeriodicalIF":2.4000,"publicationDate":"2023-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Attribute reduction for incomplete mixed data based on neighborhood information system\",\"authors\":\"Ran Li, Hongchang Chen, Shuxin Liu, Haocong Jiang, Biao Wang\",\"doi\":\"10.1080/03081079.2023.2256464\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In an era of data-based and information-centric Industry 4.0, extracting potential knowledge and valuable information from data is central to data mining tasks. Yet, the ambiguity, imprecision, incompleteness, and hybrid in real-world data pose tremendous challenges to critical information mining. Accordingly, we propose a new Max-Correlation Min-Redundant (MCMR) attribute reduction model from the uncertainty relation of attributes to avoid information loss in incomplete mixed data. Specifically, the neighbor relations are primarily developed based on the soft computing approach of the neighborhood information system, which divides the objects into neighborhood covers to maximize the utilization of the information in the incomplete mixed data. Then, we detailly analyze the internal and external consistency relationships of the four main uncertainty functions. Based on this, a new MCMR uncertain function is designed with maximum relevance and minimum redundancy. Experiments on nine real-world datasets validate the proposed model can improve data quality by mining critical information in classification tasks and achieving optimal performance with a minimum number of attributes.\",\"PeriodicalId\":50322,\"journal\":{\"name\":\"International Journal of General Systems\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2023-09-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of General Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/03081079.2023.2256464\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of General Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/03081079.2023.2256464","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
Attribute reduction for incomplete mixed data based on neighborhood information system
In an era of data-based and information-centric Industry 4.0, extracting potential knowledge and valuable information from data is central to data mining tasks. Yet, the ambiguity, imprecision, incompleteness, and hybrid in real-world data pose tremendous challenges to critical information mining. Accordingly, we propose a new Max-Correlation Min-Redundant (MCMR) attribute reduction model from the uncertainty relation of attributes to avoid information loss in incomplete mixed data. Specifically, the neighbor relations are primarily developed based on the soft computing approach of the neighborhood information system, which divides the objects into neighborhood covers to maximize the utilization of the information in the incomplete mixed data. Then, we detailly analyze the internal and external consistency relationships of the four main uncertainty functions. Based on this, a new MCMR uncertain function is designed with maximum relevance and minimum redundancy. Experiments on nine real-world datasets validate the proposed model can improve data quality by mining critical information in classification tasks and achieving optimal performance with a minimum number of attributes.
期刊介绍:
International Journal of General Systems is a periodical devoted primarily to the publication of original research contributions to system science, basic as well as applied. However, relevant survey articles, invited book reviews, bibliographies, and letters to the editor are also published.
The principal aim of the journal is to promote original systems ideas (concepts, principles, methods, theoretical or experimental results, etc.) that are broadly applicable to various kinds of systems. The term “general system” in the name of the journal is intended to indicate this aim–the orientation to systems ideas that have a general applicability. Typical subject areas covered by the journal include: uncertainty and randomness; fuzziness and imprecision; information; complexity; inductive and deductive reasoning about systems; learning; systems analysis and design; and theoretical as well as experimental knowledge regarding various categories of systems. Submitted research must be well presented and must clearly state the contribution and novelty. Manuscripts dealing with particular kinds of systems which lack general applicability across a broad range of systems should be sent to journals specializing in the respective topics.