2012 IEEE 10th Symposium on Embedded Systems for Real-time Multimedia最新文献

筛选
英文 中文
A novel low-power embedded object recognition system working at multi-frames per second (Extended abstract) 一种新型的低功耗嵌入式多帧每秒目标识别系统(扩展摘要)
2012 IEEE 10th Symposium on Embedded Systems for Real-time Multimedia Pub Date : 2012-10-01 DOI: 10.1145/2435227.2435229
A. Nikitakis, Savvas Papaioannou, I. Papaefstathiou
{"title":"A novel low-power embedded object recognition system working at multi-frames per second (Extended abstract)","authors":"A. Nikitakis, Savvas Papaioannou, I. Papaefstathiou","doi":"10.1145/2435227.2435229","DOIUrl":"https://doi.org/10.1145/2435227.2435229","url":null,"abstract":"One very important challenge in the field of multimedia is the implementation of fast and detailed Object Detection and Recognition systems. In particular, in the current state-of-the-art mobile multimedia systems, it is highly desirable to detect and locate certain objects within a video frame in real time. In this paper, we present a novel FPGA-based embedded implementation of a very efficient object recognition algorithm called Receptive Field Cooccurrence Histograms Algorithm(RFCH). Our main focus was to increase its performance so as to be able to handle the object recognition task of today's highly sophisticated embedded multimedia systems while keeping its energy consumption at very low levels. Our low-power embedded reconfigurable system is at least 15 times faster than the software implementation on a low-voltage high-end CPU, while consuming at least 60 times less energy. Our novel system is also 88 times more energy efficient than the recently introduced low-power multi-core Intel devices which are optimized for embedded systems.","PeriodicalId":431615,"journal":{"name":"2012 IEEE 10th Symposium on Embedded Systems for Real-time Multimedia","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122844958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Static prediction of recursion frequency using machine learning to enable hot spot optimizations 使用机器学习实现热点优化的递归频率静态预测
2012 IEEE 10th Symposium on Embedded Systems for Real-time Multimedia Pub Date : 2012-10-01 DOI: 10.1109/ESTIMedia.2012.6507027
D. Tetzlaff, S. Glesner
{"title":"Static prediction of recursion frequency using machine learning to enable hot spot optimizations","authors":"D. Tetzlaff, S. Glesner","doi":"10.1109/ESTIMedia.2012.6507027","DOIUrl":"https://doi.org/10.1109/ESTIMedia.2012.6507027","url":null,"abstract":"Recursion poses a severe problem for static optimizations because its execution frequency usually depends upon runtime values, hence being rarely predictable at compile time. As a consequence, optimization potential of programs is sacrificed since possible hot paths where most of the execution time is spent and where optimization would be beneficial might be undiscovered. In this paper, we propose a sophisticated machine learning based approach to statically predict the recursion frequency of functions for programs in real-world application domains, which can be used to guide various hot spot optimizations. Our experiments with 369 programs of 25 benchmark suites from different domains demonstrate that our approach is applicable to a wide range of programs with different behavior and yields more precise heuristics than those generated by pure static analyses. Moreover, our results provide valuable insights into recursive structures in general, when they appear and how deep they are.","PeriodicalId":431615,"journal":{"name":"2012 IEEE 10th Symposium on Embedded Systems for Real-time Multimedia","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120933792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
TEACA: Thread ProgrEss Aware Coherence Adaption for hybrid coherence protocols 混合相干协议的线程进程感知相干自适应
2012 IEEE 10th Symposium on Embedded Systems for Real-time Multimedia Pub Date : 2012-10-01 DOI: 10.1109/ESTIMedia.2012.6507024
Jianhua Li, Liang Shi, Qing'an Li, C. Xue, Yinlong Xu
{"title":"TEACA: Thread ProgrEss Aware Coherence Adaption for hybrid coherence protocols","authors":"Jianhua Li, Liang Shi, Qing'an Li, C. Xue, Yinlong Xu","doi":"10.1109/ESTIMedia.2012.6507024","DOIUrl":"https://doi.org/10.1109/ESTIMedia.2012.6507024","url":null,"abstract":"Hybrid coherence protocols can provide the scalability of directory protocols and low latency sharing miss handling in snooping protocols simultaneously. Unfortunately, how to adapt the hybrid protocols at runtime is not well studied. This paper proposes Thread ProgrEss Aware Coherence Adaption (TEACA) which utilizes the thread progress information as the hints to adapt hybrid coherence protocols. Specifically, TEACA fuses the memory system statistics to estimate the progress of threads. Based on the estimated thread progress information, TEACA dynamically categorizes threads into leader threads and laggard threads. The thread categorization decisions are then leveraged for efficient coherence adaption in hybrid coherence protocols. A case study on a recently proposed hybrid protocol (PATCH [29]) shows that, with the hints from TEACA, the enhanced hybrid protocol outperforms its baseline in both application execution time and energy dissipation.","PeriodicalId":431615,"journal":{"name":"2012 IEEE 10th Symposium on Embedded Systems for Real-time Multimedia","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133007499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Enhancing user experiences by exploiting energy and launch delay tradeoff of mobile multimedia applications (Extended abstract) 利用移动多媒体应用的能量和启动延迟权衡来增强用户体验(扩展摘要)
2012 IEEE 10th Symposium on Embedded Systems for Real-time Multimedia Pub Date : 2012-10-01 DOI: 10.1109/ESTIMedia.2012.6507034
Yi-Fan Chung, Yin-Tsung Lo, C. King
{"title":"Enhancing user experiences by exploiting energy and launch delay tradeoff of mobile multimedia applications (Extended abstract)","authors":"Yi-Fan Chung, Yin-Tsung Lo, C. King","doi":"10.1109/ESTIMedia.2012.6507034","DOIUrl":"https://doi.org/10.1109/ESTIMedia.2012.6507034","url":null,"abstract":"The growing multimedia applications on smart phones place ever more stringent demands on user experiences. A key factor affecting user experiences is the delay in launching applications. It affects a user's perception of the responsiveness of the phone and the multimedia applications.","PeriodicalId":431615,"journal":{"name":"2012 IEEE 10th Symposium on Embedded Systems for Real-time Multimedia","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125254908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mapping of streaming applications considering alternative application specifications (Extended abstract) 考虑可选应用程序规范的流应用程序映射(扩展抽象)
2012 IEEE 10th Symposium on Embedded Systems for Real-time Multimedia Pub Date : 2012-10-01 DOI: 10.1145/2435227.2435230
J. Zhai, Hristo Nikolov, T. Stefanov
{"title":"Mapping of streaming applications considering alternative application specifications (Extended abstract)","authors":"J. Zhai, Hristo Nikolov, T. Stefanov","doi":"10.1145/2435227.2435230","DOIUrl":"https://doi.org/10.1145/2435227.2435230","url":null,"abstract":"Streaming applications often require a parallel Model of Computation (MoC) to specify their application behavior and to facilitate mapping onto Multi-Processor System-on-Chip (MPSoC) platforms. Various performance requirements and resource budgets of embedded systems ask for an efficient design space exploration (DSE) approach to select the best design from a design space consisting of a large number of design choices. However, existing DSE approaches explore the design space that includes only architecture and mapping alternatives for an initial application specification given by the application designer. In this paper, we first show that a design often might not be optimal if alternative specifications of a given application are not taken into account. We further argue that the best alternative specification consists of only independent and load-balanced application tasks. Based on the Polyhedral Process Network (PPN) MoC, we present an approach to analyze and transform an initial PPN to an alternative one that contains only independent processes if possible. Finally, by prototyping real-life applications on both FPGA-based MPSoCs and desktop multi-core platforms, we demonstrate that mapping the alternative application specification results in a large performance gain compared to those approaches, in which alternative application specifications are not taken into account.","PeriodicalId":431615,"journal":{"name":"2012 IEEE 10th Symposium on Embedded Systems for Real-time Multimedia","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126992714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Loop instruction caching for energy-efficient embedded multitasking processors 循环指令缓存节能嵌入式多任务处理器
2012 IEEE 10th Symposium on Embedded Systems for Real-time Multimedia Pub Date : 2012-10-01 DOI: 10.1109/ESTIMedia.2012.6507036
Ji Gu, T. Ishihara, Kyungsoo Lee
{"title":"Loop instruction caching for energy-efficient embedded multitasking processors","authors":"Ji Gu, T. Ishihara, Kyungsoo Lee","doi":"10.1109/ESTIMedia.2012.6507036","DOIUrl":"https://doi.org/10.1109/ESTIMedia.2012.6507036","url":null,"abstract":"With the exponential increase of power consumption in processor generations, energy dissipation has become one of the most critical constraints in system design. Cache memories are usually the most energy consuming components on the processor chip due to their large die size occupation and frequent access operations. Furthermore, in step with the increased complexity of modern embedded applications, microprocessors are increasingly executing multitasking applications. In multitasking processors, the conventional L1 instruction cache (I-cache) is usually shared by multiple tasks and thereby suffering a highly intensive read/write operations, which can be even more energy-consuming than used in a single-task based system. This paper presents an energy-efficient shared multitasking loop instruction cache (SMLIC), which is designed to address the tasks sharing and context switch issues so that it can be efficiently utilized to reduce the I-cache accesses for energy savings in multitasking processors. Experiments on a set of multitasking applications demonstrate that the proposed SMLIC design scheme can reduce I-cache accesses by 12∼86% and energy consumption in instruction supply by 11∼79% for multitasking system, depending on various frequencies of context switch.","PeriodicalId":431615,"journal":{"name":"2012 IEEE 10th Symposium on Embedded Systems for Real-time Multimedia","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129925830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Keynote: “Design space exploration and run-time resource management in the embedded multi-core era” 主题演讲:嵌入式多核时代的设计空间探索与运行时资源管理
2012 IEEE 10th Symposium on Embedded Systems for Real-time Multimedia Pub Date : 2012-10-01 DOI: 10.1109/ESTIMedia.2012.6507016
S. Bampi
{"title":"Keynote: “Design space exploration and run-time resource management in the embedded multi-core era”","authors":"S. Bampi","doi":"10.1109/ESTIMedia.2012.6507016","DOIUrl":"https://doi.org/10.1109/ESTIMedia.2012.6507016","url":null,"abstract":"Increasingly demanding complex algorithms for multimedia systems and higher resolutions for multiview videos hit power and memory walls in portable hardware. Silicon IC technology scaling is reaching two-dimensional limitations that accompany escalating technology cost wall. In this scenario the severe costs of power density, circuit performance variability and energy constraints call for new algorithms-to-architecture approaches. This talk will highlight the architectures and circuits techniques that will influence multimedia systems architectures in the future. Design challenges and specific solutions that deal with energy dissipation in the case of multiview video are addressed. In this presentation the technology-design-architecture-algorithms interactions are pointed as drivers for new cross-layer optimizations in energy-constrained multimedia systems.","PeriodicalId":431615,"journal":{"name":"2012 IEEE 10th Symposium on Embedded Systems for Real-time Multimedia","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133361485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
O2render: An OpenCL-to-Renderscript translator for porting across various GPUs or CPUs O2render:用于跨各种gpu或cpu移植的OpenCL-to-Renderscript转换器
2012 IEEE 10th Symposium on Embedded Systems for Real-time Multimedia Pub Date : 2012-10-01 DOI: 10.1109/ESTIMedia.2012.6507031
Cheng-yan Yang, Yi-jui Wu, S. Liao
{"title":"O2render: An OpenCL-to-Renderscript translator for porting across various GPUs or CPUs","authors":"Cheng-yan Yang, Yi-jui Wu, S. Liao","doi":"10.1109/ESTIMedia.2012.6507031","DOIUrl":"https://doi.org/10.1109/ESTIMedia.2012.6507031","url":null,"abstract":"More than half-a-billion Android devices are world's most impactful real-time, interactive multimedia systems that are open-sourced. Google introduced Renderscript language and runtime in Android releases starting in 2011. Renderscript delivers performance and portability without losing usability. However, it is difficult to reuse software written in existing compute languages such as OpenCL. Thus, we develop the O2render system to enable OpenCL programs on Android devices. We analyze fundamental differences between OpenCL and Renderscript, and present our design of a translator between them using low-level virtual machine (LLVM). We extend LLVMs frontend, Clang, and show that we achieve about the same performance in Renderscript with minimal translation overhead.","PeriodicalId":431615,"journal":{"name":"2012 IEEE 10th Symposium on Embedded Systems for Real-time Multimedia","volume":"86 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126959824","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信