将算法参数整合到三维场景理解的基准测试和设计空间探索中

Sreekar Shenoy, Bruno Bodin, Luigi Nardi, M. Zia, Harry Wagstaff, Govind Sreekar Shenoy, M. Emani, John Mawer, Christos Kotselidis, A. Nisbet, M. Luján, Björn Franke, P. Kelly, M. O’Boyle
{"title":"将算法参数整合到三维场景理解的基准测试和设计空间探索中","authors":"Sreekar Shenoy, Bruno Bodin, Luigi Nardi, M. Zia, Harry Wagstaff, Govind Sreekar Shenoy, M. Emani, John Mawer, Christos Kotselidis, A. Nisbet, M. Luján, Björn Franke, P. Kelly, M. O’Boyle","doi":"10.1145/2967938.2967963","DOIUrl":null,"url":null,"abstract":"System designers typically use well-studied benchmarks to evaluate and improve new architectures and compilers. We design tomorrow's systems based on yesterday's applications. In this paper we investigate an emerging application, 3D scene understanding, likely to be significant in the mobile space in the near future. Until now, this application could only run in real-time on desktop GPUs. In this work, we examine how it can be mapped to power constrained embedded systems. Key to our approach is the idea of incremental co-design exploration, where optimization choices that concern the domain layer are incrementally explored together with low-level compiler and architecture choices. The goal of this exploration is to reduce execution time while minimizing power and meeting our quality of result objective. As the design space is too large to exhaustively evaluate, we use active learning based on a random forest predictor to find good designs. We show that our approach can, for the first time, achieve dense 3D mapping and tracking in the real-time range within a 1W power budget on a popular embedded device. This is a 4.8× execution time improvement and a 2.8× power reduction compared to the state-of-the-art.","PeriodicalId":407717,"journal":{"name":"2016 International Conference on Parallel Architecture and Compilation Techniques (PACT)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"45","resultStr":"{\"title\":\"Integrating algorithmic parameters into benchmarking and design space exploration in 3D scene understanding\",\"authors\":\"Sreekar Shenoy, Bruno Bodin, Luigi Nardi, M. Zia, Harry Wagstaff, Govind Sreekar Shenoy, M. Emani, John Mawer, Christos Kotselidis, A. Nisbet, M. Luján, Björn Franke, P. Kelly, M. O’Boyle\",\"doi\":\"10.1145/2967938.2967963\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"System designers typically use well-studied benchmarks to evaluate and improve new architectures and compilers. We design tomorrow's systems based on yesterday's applications. In this paper we investigate an emerging application, 3D scene understanding, likely to be significant in the mobile space in the near future. Until now, this application could only run in real-time on desktop GPUs. In this work, we examine how it can be mapped to power constrained embedded systems. Key to our approach is the idea of incremental co-design exploration, where optimization choices that concern the domain layer are incrementally explored together with low-level compiler and architecture choices. The goal of this exploration is to reduce execution time while minimizing power and meeting our quality of result objective. As the design space is too large to exhaustively evaluate, we use active learning based on a random forest predictor to find good designs. We show that our approach can, for the first time, achieve dense 3D mapping and tracking in the real-time range within a 1W power budget on a popular embedded device. This is a 4.8× execution time improvement and a 2.8× power reduction compared to the state-of-the-art.\",\"PeriodicalId\":407717,\"journal\":{\"name\":\"2016 International Conference on Parallel Architecture and Compilation Techniques (PACT)\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"45\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Conference on Parallel Architecture and Compilation Techniques (PACT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2967938.2967963\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Parallel Architecture and Compilation Techniques (PACT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2967938.2967963","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 45

摘要

系统设计人员通常使用经过充分研究的基准来评估和改进新的体系结构和编译器。我们根据昨天的应用设计明天的系统。在本文中,我们研究了一个新兴的应用,3D场景理解,可能在不久的将来在移动空间中很重要。到目前为止,这个应用程序只能在桌面gpu上实时运行。在这项工作中,我们研究如何将其映射到功率受限的嵌入式系统。我们方法的关键是增量协同设计探索的思想,其中涉及领域层的优化选择与低级编译器和体系结构选择一起被增量地探索。这一探索的目标是减少执行时间,同时最小化功率并满足我们的结果质量目标。由于设计空间太大,无法进行详尽的评估,我们使用基于随机森林预测器的主动学习来找到好的设计。我们表明,我们的方法可以首次在流行的嵌入式设备上以1W的功率预算在实时范围内实现密集的3D映射和跟踪。与最先进的技术相比,这是4.8倍的执行时间改进和2.8倍的功耗降低。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Integrating algorithmic parameters into benchmarking and design space exploration in 3D scene understanding
System designers typically use well-studied benchmarks to evaluate and improve new architectures and compilers. We design tomorrow's systems based on yesterday's applications. In this paper we investigate an emerging application, 3D scene understanding, likely to be significant in the mobile space in the near future. Until now, this application could only run in real-time on desktop GPUs. In this work, we examine how it can be mapped to power constrained embedded systems. Key to our approach is the idea of incremental co-design exploration, where optimization choices that concern the domain layer are incrementally explored together with low-level compiler and architecture choices. The goal of this exploration is to reduce execution time while minimizing power and meeting our quality of result objective. As the design space is too large to exhaustively evaluate, we use active learning based on a random forest predictor to find good designs. We show that our approach can, for the first time, achieve dense 3D mapping and tracking in the real-time range within a 1W power budget on a popular embedded device. This is a 4.8× execution time improvement and a 2.8× power reduction compared to the state-of-the-art.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信