{"title":"优化特征选择:mRMR-Boruta/RFE混合方法的比较研究","authors":"Manu Sharma, D. Sharma","doi":"10.1109/ISCON57294.2023.10112125","DOIUrl":null,"url":null,"abstract":"Feature selection is an essential component in the data preprocessing pipeline, particularly when dealing with datasets that possess a vast array of dimensions. In this paper, we present a time efficient wrapper technique Boruta to improve the overall complexity of our feature selection process. We have combined this wrapper technique with the filter class Minimum Redundancy Maximum Relevance (mRMR) to enhance the selection of relevant features. Additionally, our scope includes refining a previously proposed hybrid model that combines filter class Minimum Redundancy Maximum Relevance (mRMR) known for faster processing speed with wrapper class Recursive Feature Elimination (RFE) known for its high classification accuracy. We demonstrated the effectiveness of our approach on a variety of datasets and showed that our model is able to identify a smaller and more interpretable subset of features while generating better performance. Our results suggest that the combination of preprocessing and hybrid feature selection model is a promising approach to process a dataset with high dimensions.","PeriodicalId":280183,"journal":{"name":"2023 6th International Conference on Information Systems and Computer Networks (ISCON)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimising Feature Selection: A Comparative Study of mRMR-Boruta/RFE Hybrid Approach\",\"authors\":\"Manu Sharma, D. Sharma\",\"doi\":\"10.1109/ISCON57294.2023.10112125\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Feature selection is an essential component in the data preprocessing pipeline, particularly when dealing with datasets that possess a vast array of dimensions. In this paper, we present a time efficient wrapper technique Boruta to improve the overall complexity of our feature selection process. We have combined this wrapper technique with the filter class Minimum Redundancy Maximum Relevance (mRMR) to enhance the selection of relevant features. Additionally, our scope includes refining a previously proposed hybrid model that combines filter class Minimum Redundancy Maximum Relevance (mRMR) known for faster processing speed with wrapper class Recursive Feature Elimination (RFE) known for its high classification accuracy. We demonstrated the effectiveness of our approach on a variety of datasets and showed that our model is able to identify a smaller and more interpretable subset of features while generating better performance. Our results suggest that the combination of preprocessing and hybrid feature selection model is a promising approach to process a dataset with high dimensions.\",\"PeriodicalId\":280183,\"journal\":{\"name\":\"2023 6th International Conference on Information Systems and Computer Networks (ISCON)\",\"volume\":\"46 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 6th International Conference on Information Systems and Computer Networks (ISCON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCON57294.2023.10112125\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 6th International Conference on Information Systems and Computer Networks (ISCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCON57294.2023.10112125","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Optimising Feature Selection: A Comparative Study of mRMR-Boruta/RFE Hybrid Approach
Feature selection is an essential component in the data preprocessing pipeline, particularly when dealing with datasets that possess a vast array of dimensions. In this paper, we present a time efficient wrapper technique Boruta to improve the overall complexity of our feature selection process. We have combined this wrapper technique with the filter class Minimum Redundancy Maximum Relevance (mRMR) to enhance the selection of relevant features. Additionally, our scope includes refining a previously proposed hybrid model that combines filter class Minimum Redundancy Maximum Relevance (mRMR) known for faster processing speed with wrapper class Recursive Feature Elimination (RFE) known for its high classification accuracy. We demonstrated the effectiveness of our approach on a variety of datasets and showed that our model is able to identify a smaller and more interpretable subset of features while generating better performance. Our results suggest that the combination of preprocessing and hybrid feature selection model is a promising approach to process a dataset with high dimensions.