Investigating Tree-Based Classifiers and Selected Ensemble Learning on Iris Flower Species Classification

Ramoni Tirimisiyu Amosa, Adekiigbe Adebanjo, Fabiyi Aderanti Alifat, Olorunlomerue Adam Biodun, Oni Esther Kemi, Adejola Aanu Adeyinka, Adigun Olajide Israel, Joseph Babatunde Isaac
{"title":"Investigating Tree-Based Classifiers and Selected Ensemble Learning on Iris Flower Species Classification","authors":"Ramoni Tirimisiyu Amosa, Adekiigbe Adebanjo, Fabiyi Aderanti Alifat, Olorunlomerue Adam Biodun, Oni Esther Kemi, Adejola Aanu Adeyinka, Adigun Olajide Israel, Joseph Babatunde Isaac","doi":"10.14445/23488387/ijcse-v10i5p105","DOIUrl":null,"url":null,"abstract":"- Eloquence, hope, knowledge, the ability to communicate effectively, and faith are some of the meanings associated with the iris flower in the language of flowers. Iris has different species types, and each type has its own medicinal purpose. Classifying the flower has become a serious task for researchers due to the high volume of datasets (big data), hence the introduction of machine learning algorithms for accurate and reliable classification. This paper focuses on the classification of the Iris flower using five tree-based algorithms; Best First Tree (BFTree), Least Absolute deviation Tree (LADTree), Cost-Sensitive Decision Forest (CSForest), Functional Tree (FT) and Random Tree (RT). Three selected ensemble learning (Bagging, Dagging and cascade generalisation) were equally implemented in the algorithm. The dataset that was utilised in this investigation is open source and may be downloaded without cost from a public repository (kaggle.com). The result of the classification showed that the FT classifiers outperform other tree-based classifiers with an accuracy of 96.67% and an AUC of 1.00. The ensemble algorithm has a significant impact on the performance of single classifiers (tree-based). Outperform tree based. AUC/ROC (Area Under Curve/Receiver Operating Characteristics) was used to evaluate the algorithm's performance. Bagging ensemble outperforms other ensembles (Dagging and Cascade) with an accuracy of 96.00% and AUC of 1.00.","PeriodicalId":186366,"journal":{"name":"International Journal of Computer Science and Engineering","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computer Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14445/23488387/ijcse-v10i5p105","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

- Eloquence, hope, knowledge, the ability to communicate effectively, and faith are some of the meanings associated with the iris flower in the language of flowers. Iris has different species types, and each type has its own medicinal purpose. Classifying the flower has become a serious task for researchers due to the high volume of datasets (big data), hence the introduction of machine learning algorithms for accurate and reliable classification. This paper focuses on the classification of the Iris flower using five tree-based algorithms; Best First Tree (BFTree), Least Absolute deviation Tree (LADTree), Cost-Sensitive Decision Forest (CSForest), Functional Tree (FT) and Random Tree (RT). Three selected ensemble learning (Bagging, Dagging and cascade generalisation) were equally implemented in the algorithm. The dataset that was utilised in this investigation is open source and may be downloaded without cost from a public repository (kaggle.com). The result of the classification showed that the FT classifiers outperform other tree-based classifiers with an accuracy of 96.67% and an AUC of 1.00. The ensemble algorithm has a significant impact on the performance of single classifiers (tree-based). Outperform tree based. AUC/ROC (Area Under Curve/Receiver Operating Characteristics) was used to evaluate the algorithm's performance. Bagging ensemble outperforms other ensembles (Dagging and Cascade) with an accuracy of 96.00% and AUC of 1.00.
基于树的分类器和选择集成学习在鸢尾花分类中的应用研究
-雄辩、希望、知识、有效沟通的能力和信仰是鸢尾花在花的语言中的一些含义。鸢尾有不同的种类类型,每种类型都有自己的药用目的。由于大量的数据集(大数据),对花进行分类已经成为研究人员的一项严肃的任务,因此引入机器学习算法来进行准确可靠的分类。本文重点研究了鸢尾花的五种树分类算法;最佳第一树(BFTree)、最小绝对偏差树(LADTree)、成本敏感决策树(CSForest)、功能树(FT)和随机树(RT)。三种选择的集成学习(Bagging、Dagging和级联泛化)在算法中被平等地实现。本调查中使用的数据集是开源的,可以从公共存储库(kaggle.com)免费下载。分类结果表明,FT分类器的准确率为96.67%,AUC为1.00,优于其他基于树的分类器。集成算法对单分类器(基于树)的性能有显著影响。优于基于树的。采用AUC/ROC(曲线下面积/接收者工作特征)来评价算法的性能。Bagging集合优于其他集合(Dagging和Cascade),准确率为96.00%,AUC为1.00。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信