Zishen Wan, Yuyang Zhang, A. Raychowdhury, Bo Yu, Yanjun Zhang, Shaoshan Liu
{"title":"基于FPGA平台的自主机器节能四摄像头视觉系统","authors":"Zishen Wan, Yuyang Zhang, A. Raychowdhury, Bo Yu, Yanjun Zhang, Shaoshan Liu","doi":"10.1109/AICAS51828.2021.9458486","DOIUrl":null,"url":null,"abstract":"In our past few years’ of commercial deployment experiences, we identify localization as a critical task in autonomous machine applications, and a great acceleration target. In this paper, based on the observation that the visual frontend is a major performance and energy consumption bottleneck, we present our design and implementation of an energy-efficient hardware architecture for ORB (Oriented-Fast and Rotated-BRIEF) based localization system on FPGAs. To support our multi-sensor autonomous machine localization system, we present hardware synchronization, frame-multiplexing, and parallelization techniques, which are integrated in our design. Compared to Nvidia TX1 and Intel i7, our FPGA-based implementation achieves $5.6\\times$ and $3.4\\times$ speedup, as well as $3.0\\times$ and $34.6\\times$ power reduction, respectively.","PeriodicalId":173204,"journal":{"name":"2021 IEEE 3rd International Conference on Artificial Intelligence Circuits and Systems (AICAS)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"An Energy-Efficient Quad-Camera Visual System for Autonomous Machines on FPGA Platform\",\"authors\":\"Zishen Wan, Yuyang Zhang, A. Raychowdhury, Bo Yu, Yanjun Zhang, Shaoshan Liu\",\"doi\":\"10.1109/AICAS51828.2021.9458486\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In our past few years’ of commercial deployment experiences, we identify localization as a critical task in autonomous machine applications, and a great acceleration target. In this paper, based on the observation that the visual frontend is a major performance and energy consumption bottleneck, we present our design and implementation of an energy-efficient hardware architecture for ORB (Oriented-Fast and Rotated-BRIEF) based localization system on FPGAs. To support our multi-sensor autonomous machine localization system, we present hardware synchronization, frame-multiplexing, and parallelization techniques, which are integrated in our design. Compared to Nvidia TX1 and Intel i7, our FPGA-based implementation achieves $5.6\\\\times$ and $3.4\\\\times$ speedup, as well as $3.0\\\\times$ and $34.6\\\\times$ power reduction, respectively.\",\"PeriodicalId\":173204,\"journal\":{\"name\":\"2021 IEEE 3rd International Conference on Artificial Intelligence Circuits and Systems (AICAS)\",\"volume\":\"77 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 3rd International Conference on Artificial Intelligence Circuits and Systems (AICAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AICAS51828.2021.9458486\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 3rd International Conference on Artificial Intelligence Circuits and Systems (AICAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AICAS51828.2021.9458486","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
摘要
在我们过去几年的商业部署经验中,我们认为本地化是自主机器应用中的关键任务,也是一个伟大的加速目标。在本文中,基于观察到视觉前端是主要的性能和能耗瓶颈,我们提出了基于fpga的ORB (Oriented-Fast and roted - brief)定位系统的节能硬件架构的设计和实现。为了支持我们的多传感器自主机器定位系统,我们提出了硬件同步,帧多路复用和并行化技术,这些技术集成在我们的设计中。与Nvidia TX1和Intel i7相比,我们基于fpga的实现分别实现了5.6倍和3.4倍的加速,以及3.0倍和34.6倍的功耗降低。
An Energy-Efficient Quad-Camera Visual System for Autonomous Machines on FPGA Platform
In our past few years’ of commercial deployment experiences, we identify localization as a critical task in autonomous machine applications, and a great acceleration target. In this paper, based on the observation that the visual frontend is a major performance and energy consumption bottleneck, we present our design and implementation of an energy-efficient hardware architecture for ORB (Oriented-Fast and Rotated-BRIEF) based localization system on FPGAs. To support our multi-sensor autonomous machine localization system, we present hardware synchronization, frame-multiplexing, and parallelization techniques, which are integrated in our design. Compared to Nvidia TX1 and Intel i7, our FPGA-based implementation achieves $5.6\times$ and $3.4\times$ speedup, as well as $3.0\times$ and $34.6\times$ power reduction, respectively.