Yang Xiao, Chuanjun Zhang, K. Inck, N. Vijaykrishnan
{"title":"Dynamic bandwidth adaptation using recognition accuracy prediction through pre-classification for embedded vision systems","authors":"Yang Xiao, Chuanjun Zhang, K. Inck, N. Vijaykrishnan","doi":"10.1109/ICCD.2013.6657020","DOIUrl":null,"url":null,"abstract":"Empowered by the massive growth of camera enabled mobile devices; mobile applications that allow users to perceive and experience the world in richer and more engaging ways have emerged at tremendous pace. As more complex perception algorithms are developed to take advantage of higher resolution imagery, future mobile applications will require application specific accelerators to maintain performance required for interactive user experiences. A key challenge in these accelerator-rich mobile platforms will be guaranteeing the off-chip memory bandwidth required by each accelerator. Device integration techniques such as Package on Package and Wide-IO seek to tackle the memory wall problem by reducing bottlenecks at the I/O interfaces. However, less effort has been focused on solving the bandwidth problem by dynamically leveraging the individual and collective bandwidth characteristics of accelerators operating concurrently. This work investigates the off-chip bandwidth characteristics of accelerators in the context of embedded perceptual computing applications. A bandwidth aware feedback system is proposed that dynamically partitions available bandwidth among a set of accelerators at the expense of application accuracy. As a case study, the proposed adaption policy is applied to a biologically-inspired scene understanding application. Results indicate that the system maintains good accuracy while requiring only 25% of the original bandwidth.","PeriodicalId":398811,"journal":{"name":"2013 IEEE 31st International Conference on Computer Design (ICCD)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE 31st International Conference on Computer Design (ICCD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCD.2013.6657020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Empowered by the massive growth of camera enabled mobile devices; mobile applications that allow users to perceive and experience the world in richer and more engaging ways have emerged at tremendous pace. As more complex perception algorithms are developed to take advantage of higher resolution imagery, future mobile applications will require application specific accelerators to maintain performance required for interactive user experiences. A key challenge in these accelerator-rich mobile platforms will be guaranteeing the off-chip memory bandwidth required by each accelerator. Device integration techniques such as Package on Package and Wide-IO seek to tackle the memory wall problem by reducing bottlenecks at the I/O interfaces. However, less effort has been focused on solving the bandwidth problem by dynamically leveraging the individual and collective bandwidth characteristics of accelerators operating concurrently. This work investigates the off-chip bandwidth characteristics of accelerators in the context of embedded perceptual computing applications. A bandwidth aware feedback system is proposed that dynamically partitions available bandwidth among a set of accelerators at the expense of application accuracy. As a case study, the proposed adaption policy is applied to a biologically-inspired scene understanding application. Results indicate that the system maintains good accuracy while requiring only 25% of the original bandwidth.
由于支持摄像头的移动设备的大量增长;允许用户以更丰富、更吸引人的方式感知和体验世界的移动应用程序正以惊人的速度出现。随着更复杂的感知算法被开发出来,以利用更高分辨率的图像,未来的移动应用程序将需要特定于应用程序的加速器来维持交互式用户体验所需的性能。在这些加速器丰富的移动平台中,一个关键的挑战将是保证每个加速器所需的片外内存带宽。器件集成技术(如Package on Package和Wide-IO)试图通过减少I/O接口的瓶颈来解决内存墙问题。然而,通过动态地利用并行运行的加速器的单个和集体带宽特性来解决带宽问题的努力较少。本研究研究了嵌入式感知计算应用中加速器的片外带宽特性。提出了一种带宽感知反馈系统,该系统以牺牲应用精度为代价,在一组加速器之间动态划分可用带宽。作为一个案例研究,提出的适应策略应用于一个受生物学启发的场景理解应用。结果表明,该系统在只需要原带宽的25%的情况下保持了较好的精度。