基于FPGA的多值决策图随机森林

2017 IEEE 47th International Symposium on Multiple-Valued Logic (ISMVL) Pub Date : 2017-05-22 DOI:10.1109/ISMVL.2017.40

Hiroki Nakahara, Akira Jinguji, S. Sato, Tsutomu Sasao

{"title":"基于FPGA的多值决策图随机森林","authors":"Hiroki Nakahara, Akira Jinguji, S. Sato, Tsutomu Sasao","doi":"10.1109/ISMVL.2017.40","DOIUrl":null,"url":null,"abstract":"A random forest (RF) is a kind of an ensemblemachine learning algorithm used for a classification and aregression. It consists of multiple decision trees that are built fromrandomly sampled data. The RF has a simple, fast learning, andidentification capability compared with other machine learningalgorithms. It is widely used for various recognition systems. Theconventional RF consisted of binary decision trees (BDTs), whilein this paper, we used a multi-valued decision diagrams (MDDs). In the MDD, each variable appears only once on a path, however, in the BDT, some variable may appear multiple times. Sincethe path length is short in the MDD, it can be evaluated at ahigh speed. The disadvantage is that the number of nodes inthe MDD increases with O(2N), where N denotes the numberof input variables. Fortunately, random forests encourage to usethe small number of N for each tree in order to avoid overfitting. Therefore, in several data sets used in the experimental, the number of nodes did not increase even if the MDD wasused. To reduce the development time, the Altera SDK forOpenCL (AOCL), a kind of a high-level synthesis tool, was used. To accelerate the RF classification using the AOCL, we proposethe fully pipelined architecture to increase the memory bandwidthusing on-chip memories on the FPGA. Also, we apply optimalprecision fixed point representation instead of 32 bit floating pointone. We compared the performance with the CPU and the GPUimplementations. As for the LPS (lookups per second), the FPGArealization was 10.7 times faster than the GPU one, and it was14.0 times faster than the CPU one. As for the LPS per powerconsumption, the FPGA realization was 61.3 times better thanthe GPU one, and it was 12.1 times better than the CPU one.","PeriodicalId":393724,"journal":{"name":"2017 IEEE 47th International Symposium on Multiple-Valued Logic (ISMVL)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":"{\"title\":\"A Random Forest Using a Multi-valued Decision Diagram on an FPGA\",\"authors\":\"Hiroki Nakahara, Akira Jinguji, S. Sato, Tsutomu Sasao\",\"doi\":\"10.1109/ISMVL.2017.40\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A random forest (RF) is a kind of an ensemblemachine learning algorithm used for a classification and aregression. It consists of multiple decision trees that are built fromrandomly sampled data. The RF has a simple, fast learning, andidentification capability compared with other machine learningalgorithms. It is widely used for various recognition systems. Theconventional RF consisted of binary decision trees (BDTs), whilein this paper, we used a multi-valued decision diagrams (MDDs). In the MDD, each variable appears only once on a path, however, in the BDT, some variable may appear multiple times. Sincethe path length is short in the MDD, it can be evaluated at ahigh speed. The disadvantage is that the number of nodes inthe MDD increases with O(2N), where N denotes the numberof input variables. Fortunately, random forests encourage to usethe small number of N for each tree in order to avoid overfitting. Therefore, in several data sets used in the experimental, the number of nodes did not increase even if the MDD wasused. To reduce the development time, the Altera SDK forOpenCL (AOCL), a kind of a high-level synthesis tool, was used. To accelerate the RF classification using the AOCL, we proposethe fully pipelined architecture to increase the memory bandwidthusing on-chip memories on the FPGA. Also, we apply optimalprecision fixed point representation instead of 32 bit floating pointone. We compared the performance with the CPU and the GPUimplementations. As for the LPS (lookups per second), the FPGArealization was 10.7 times faster than the GPU one, and it was14.0 times faster than the CPU one. As for the LPS per powerconsumption, the FPGA realization was 61.3 times better thanthe GPU one, and it was 12.1 times better than the CPU one.\",\"PeriodicalId\":393724,\"journal\":{\"name\":\"2017 IEEE 47th International Symposium on Multiple-Valued Logic (ISMVL)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-05-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"23\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 47th International Symposium on Multiple-Valued Logic (ISMVL)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISMVL.2017.40\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 47th International Symposium on Multiple-Valued Logic (ISMVL)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISMVL.2017.40","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 23

摘要

随机森林(RF)是一种用于分类和回归的集成机器学习算法。它由随机采样数据构建的多个决策树组成。与其他机器学习算法相比，RF具有简单、快速的学习和识别能力。它被广泛应用于各种识别系统。传统的决策树是由二叉决策树(bdt)组成的，而本文使用了多值决策图(mdd)。在MDD中，每个变量只在路径上出现一次，然而在BDT中，有些变量可能出现多次。由于MDD中的路径长度很短，因此可以高速计算。缺点是MDD中的节点数量随着O(2N)的增加而增加，其中N表示输入变量的数量。幸运的是，随机森林鼓励对每棵树使用少量的N，以避免过拟合。因此，在实验中使用的几个数据集中，即使使用了MDD，节点数也没有增加。为了缩短开发时间，使用了Altera SDK forOpenCL (AOCL)这一高级综合工具。为了加速使用AOCL的射频分类，我们提出了全流水线架构，以增加FPGA上的片上存储器的内存带宽。此外，我们采用最佳精度的定点表示代替32位浮点数。我们将性能与CPU和gpu实现进行了比较。在LPS(每秒查找次数)方面，fpga比GPU快10.7倍，比CPU快14.0倍。在每功耗LPS方面，FPGA实现比GPU高61.3倍，比CPU高12.1倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Random Forest Using a Multi-valued Decision Diagram on an FPGA

A random forest (RF) is a kind of an ensemblemachine learning algorithm used for a classification and aregression. It consists of multiple decision trees that are built fromrandomly sampled data. The RF has a simple, fast learning, andidentification capability compared with other machine learningalgorithms. It is widely used for various recognition systems. Theconventional RF consisted of binary decision trees (BDTs), whilein this paper, we used a multi-valued decision diagrams (MDDs). In the MDD, each variable appears only once on a path, however, in the BDT, some variable may appear multiple times. Sincethe path length is short in the MDD, it can be evaluated at ahigh speed. The disadvantage is that the number of nodes inthe MDD increases with O(2N), where N denotes the numberof input variables. Fortunately, random forests encourage to usethe small number of N for each tree in order to avoid overfitting. Therefore, in several data sets used in the experimental, the number of nodes did not increase even if the MDD wasused. To reduce the development time, the Altera SDK forOpenCL (AOCL), a kind of a high-level synthesis tool, was used. To accelerate the RF classification using the AOCL, we proposethe fully pipelined architecture to increase the memory bandwidthusing on-chip memories on the FPGA. Also, we apply optimalprecision fixed point representation instead of 32 bit floating pointone. We compared the performance with the CPU and the GPUimplementations. As for the LPS (lookups per second), the FPGArealization was 10.7 times faster than the GPU one, and it was14.0 times faster than the CPU one. As for the LPS per powerconsumption, the FPGA realization was 61.3 times better thanthe GPU one, and it was 12.1 times better than the CPU one.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 IEEE 47th International Symposium on Multiple-Valued Logic (ISMVL)

自引率

0.00%

发文量