{"title":"Design of Agilev4 and Its Real-time Implementation on an Economic Mobile Platform","authors":"Kuan-Hung Chen, Chun-Wei Su","doi":"10.1109/ICCE53296.2022.9730566","DOIUrl":null,"url":null,"abstract":"AI visual algorithms have obtained obvious improvements in detection performance. However, the resulting extremely high computational complexity hinders the progress of AI hardware design for real-time and portable applications. In this paper, we have developed a dedicated AI detection neural network along with the necessary AI model speedup techniques. For the development goal, we propose our own backbone neural network, which we named it Agilev4. Besides, we introduce a so-called GoP (Group of Pictures) technology to accelerate the computation by removing temporal redundancy. The test with a dedicated hyper-market scenario data illustrates that the presented system can track each object well. Finally, the frame rate can be accelerated to 35.4 frames per second when we deployed Agilev4 along with the GoP mode acceleration on Jetson Nano. Compared with the original existing neural network, the speedup ratio is more than 1200%.","PeriodicalId":350644,"journal":{"name":"2022 IEEE International Conference on Consumer Electronics (ICCE)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Consumer Electronics (ICCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCE53296.2022.9730566","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
AI visual algorithms have obtained obvious improvements in detection performance. However, the resulting extremely high computational complexity hinders the progress of AI hardware design for real-time and portable applications. In this paper, we have developed a dedicated AI detection neural network along with the necessary AI model speedup techniques. For the development goal, we propose our own backbone neural network, which we named it Agilev4. Besides, we introduce a so-called GoP (Group of Pictures) technology to accelerate the computation by removing temporal redundancy. The test with a dedicated hyper-market scenario data illustrates that the presented system can track each object well. Finally, the frame rate can be accelerated to 35.4 frames per second when we deployed Agilev4 along with the GoP mode acceleration on Jetson Nano. Compared with the original existing neural network, the speedup ratio is more than 1200%.
人工智能视觉算法在检测性能上取得了明显的进步。然而,由此产生的极高的计算复杂性阻碍了实时和便携式应用的人工智能硬件设计的进展。在本文中,我们开发了一个专用的人工智能检测神经网络以及必要的人工智能模型加速技术。为了实现开发目标,我们提出了自己的骨干神经网络,我们将其命名为Agilev4。此外,我们引入了所谓的GoP (Group of Pictures)技术,通过消除时间冗余来加快计算速度。用一个专门的超级市场场景数据进行了测试,结果表明所提出的系统可以很好地跟踪每个对象。最后,当我们在Jetson Nano上部署Agilev4和GoP模式加速时,帧率可以加速到每秒35.4帧。与原有的现有神经网络相比,加速比超过1200%。