{"title":"基于干预相关比的新型随机森林变体","authors":"Tao Zhang;Tao Li;Zaifa Xue;Xin Lu;Le Gao","doi":"10.1109/TETCI.2024.3369320","DOIUrl":null,"url":null,"abstract":"Random forest (RF) is a classical machine learning model, and many variants have been proposed to improve the performance or interpretability in recent years. To improve the classification performance and interpretability of RF under the premise of consistency, a novel RF variant named intervention correlation ratio random forest (ICR\n<sup>2</sup>\nF) is proposed. First, intervention correlation ratio (ICR) is proposed as a novel causality evaluation method by the ratio of pre- and post intervention on features which is used to select features and thresholds to divide a non-leaf node when building a decision tree. And then, decision trees are built based on ICR to construct ICR\n<sup>2</sup>\nF through ensemble learning. In addition, ICR\n<sup>2</sup>\nF is proven to satisfy consistency in exploring random forest in theory. Finally, experimental results on 20 UCI datasets have shown that ICR\n<sup>2</sup>\nF surpasses classical classifiers and the latest RF variants in classification performance under the premise of consistency and interpretability.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"8 3","pages":"2541-2553"},"PeriodicalIF":5.3000,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Novel Random Forest Variant Based on Intervention Correlation Ratio\",\"authors\":\"Tao Zhang;Tao Li;Zaifa Xue;Xin Lu;Le Gao\",\"doi\":\"10.1109/TETCI.2024.3369320\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Random forest (RF) is a classical machine learning model, and many variants have been proposed to improve the performance or interpretability in recent years. To improve the classification performance and interpretability of RF under the premise of consistency, a novel RF variant named intervention correlation ratio random forest (ICR\\n<sup>2</sup>\\nF) is proposed. First, intervention correlation ratio (ICR) is proposed as a novel causality evaluation method by the ratio of pre- and post intervention on features which is used to select features and thresholds to divide a non-leaf node when building a decision tree. And then, decision trees are built based on ICR to construct ICR\\n<sup>2</sup>\\nF through ensemble learning. In addition, ICR\\n<sup>2</sup>\\nF is proven to satisfy consistency in exploring random forest in theory. Finally, experimental results on 20 UCI datasets have shown that ICR\\n<sup>2</sup>\\nF surpasses classical classifiers and the latest RF variants in classification performance under the premise of consistency and interpretability.\",\"PeriodicalId\":13135,\"journal\":{\"name\":\"IEEE Transactions on Emerging Topics in Computational Intelligence\",\"volume\":\"8 3\",\"pages\":\"2541-2553\"},\"PeriodicalIF\":5.3000,\"publicationDate\":\"2024-03-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Emerging Topics in Computational Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10466613/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Emerging Topics in Computational Intelligence","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10466613/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
A Novel Random Forest Variant Based on Intervention Correlation Ratio
Random forest (RF) is a classical machine learning model, and many variants have been proposed to improve the performance or interpretability in recent years. To improve the classification performance and interpretability of RF under the premise of consistency, a novel RF variant named intervention correlation ratio random forest (ICR
2
F) is proposed. First, intervention correlation ratio (ICR) is proposed as a novel causality evaluation method by the ratio of pre- and post intervention on features which is used to select features and thresholds to divide a non-leaf node when building a decision tree. And then, decision trees are built based on ICR to construct ICR
2
F through ensemble learning. In addition, ICR
2
F is proven to satisfy consistency in exploring random forest in theory. Finally, experimental results on 20 UCI datasets have shown that ICR
2
F surpasses classical classifiers and the latest RF variants in classification performance under the premise of consistency and interpretability.
期刊介绍:
The IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI) publishes original articles on emerging aspects of computational intelligence, including theory, applications, and surveys.
TETCI is an electronics only publication. TETCI publishes six issues per year.
Authors are encouraged to submit manuscripts in any emerging topic in computational intelligence, especially nature-inspired computing topics not covered by other IEEE Computational Intelligence Society journals. A few such illustrative examples are glial cell networks, computational neuroscience, Brain Computer Interface, ambient intelligence, non-fuzzy computing with words, artificial life, cultural learning, artificial endocrine networks, social reasoning, artificial hormone networks, computational intelligence for the IoT and Smart-X technologies.