{"title":"Performance analysis and optimization of decision tree classifiers on embedded devices: work-in-progress","authors":"A. Krishnakumar, Ümit Y. Ogras","doi":"10.1145/3477244.3477618","DOIUrl":null,"url":null,"abstract":"Decision trees (DTs) offer a popular implementation choice for machine learning classifiers since they are highly interpretable and easy to use. Resource management decision overheads must be minimal in embedded systems to meet latency targets and deadline constraints. While the literature has preferred hardware architectures for DTs to meet latency targets, they are not suitable for ultra-low latency applications due to their data movement overheads despite the parallelism they offer. Therefore, we propose software optimization techniques for decision trees. The proposed DTs achieve lower than 50 ns latencies for depth 12, making them highly suitable for classification in embedded resource management.","PeriodicalId":354206,"journal":{"name":"Proceedings of the 2021 International Conference on Embedded Software","volume":"179 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 International Conference on Embedded Software","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3477244.3477618","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Decision trees (DTs) offer a popular implementation choice for machine learning classifiers since they are highly interpretable and easy to use. Resource management decision overheads must be minimal in embedded systems to meet latency targets and deadline constraints. While the literature has preferred hardware architectures for DTs to meet latency targets, they are not suitable for ultra-low latency applications due to their data movement overheads despite the parallelism they offer. Therefore, we propose software optimization techniques for decision trees. The proposed DTs achieve lower than 50 ns latencies for depth 12, making them highly suitable for classification in embedded resource management.