International Conference on Compilers, Architecture, and Synthesis for Embedded Systems最新文献

筛选
英文 中文
MNEMEE: a framework for memory management and optimization of static and dynamic data in MPSoCs MNEMEE:用于mpsoc中静态和动态数据的内存管理和优化的框架
A. Mallik, P. Marwedel, D. Soudris, S. Stuijk
{"title":"MNEMEE: a framework for memory management and optimization of static and dynamic data in MPSoCs","authors":"A. Mallik, P. Marwedel, D. Soudris, S. Stuijk","doi":"10.1145/1878921.1878959","DOIUrl":"https://doi.org/10.1145/1878921.1878959","url":null,"abstract":"As embedded systems are becoming the center of our digital life, system design becomes progressively harder. The integration of multiple features on devices with limited resources requires careful and exhaustive exploration of the design search space in order to efficiently map modern applications to an embedded multi-processor platform. The MNEMEE project [1] addresses this challenge by offering a unique integrated tool flow that performs source-to-source transformations to automatically optimize the original source code and map it on the target platform. The optimizations aim at reducing the number of memory accesses and the required memory storage of both dynamically and statically allocated data. Furthermore, the MNEMEE tool flow performs optimal assignment of all data on the memory hierarchy of the target platform. Overall, the MNEMEE techniques embedded in it will lead to more cost efficient systems that offer a better performance and lower energy consumption.\u0000 This tutorial gives an overview of the MNEMEE tool flow. The objective of the tutorial is to familiarize the audience with the tool framework and the optimizations used in the individual tools. The tutorial also features a demonstration of the tool flow. This demonstration shows that the tools developed in the MNEMEE project provide a user-friendly and efficient framework for MPSoC programming and memory managemen","PeriodicalId":136293,"journal":{"name":"International Conference on Compilers, Architecture, and Synthesis for Embedded Systems","volume":"82 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114328196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
The virtual hospital: the emergence of telemedicine 虚拟医院:远程医疗的出现
D. Petrasek, A. Barr, K. Palem
{"title":"The virtual hospital: the emergence of telemedicine","authors":"D. Petrasek, A. Barr, K. Palem","doi":"10.1145/1878921.1878930","DOIUrl":"https://doi.org/10.1145/1878921.1878930","url":null,"abstract":"The current practice of medicine, while utilizing the advances in biological and physical science, still takes place in the physician office or hospital. Unfortunately, traditional practice as integrated into the current Healthcare system is unsustainable. Accommodating the increase demand for medical services with the attendant rising costs has caused a crisis in healthcare.\u0000 Telemedicine, the practice of medicine by means of mobile/internet is a transformative process that will impact healthcare globally. Already, teleradiology (diagnostic radiology remotely by means of digital images that are electronically exported) and electronic medical records are gaining wide acceptance. The ability to distribute medical services by means of mobile and internet technology is a natural and almost irresistible direction for the field of Medicine.\u0000 The healthcare crisis has created an opportunity for new solutions and mobile/Internet technology has laid the infrastructure upon which one can build a powerful, innovative and badly needed platform for health services: The Global Virtual Hospital (GVH). The GVH will be a group of connected centers around the world that overlap (in time zones) throughout the working day. Patients will have access through the Internet or mobile phones. Medical records will be electronically stored, shared among authorized personal and updated with each medical interaction. The GVH, will serve as a platform and laboratory for the creation of innovative devices and technology that will improve the remote interaction.The Global Virtual Hospital System will exemplify the convergence of technology and medicine and will be integrated into standard practice in the next 5-10 year.","PeriodicalId":136293,"journal":{"name":"International Conference on Compilers, Architecture, and Synthesis for Embedded Systems","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121782013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Instruction selection by graph transformation 图变换指令选择
Sebastian Buchwald, Andreas Zwinkau
{"title":"Instruction selection by graph transformation","authors":"Sebastian Buchwald, Andreas Zwinkau","doi":"10.1145/1878921.1878926","DOIUrl":"https://doi.org/10.1145/1878921.1878926","url":null,"abstract":"Common generated instruction selections are based on tree pattern matching, but modern and custom architectures feature instructions, which cannot be covered by trees. To overcome this limitation, we are the first to employ graph transformation, the natural generalization of tree rewriting. Currently, the only approach allowing us to pair graph-based instruction selection with linear time complexity is the mapping to the Partitioned Boolean Quadratic Problem (PBQP). We present formal foundations to verify this approach and therewith identify two problems of the common method and resolve them. We confirm the capabilities of PBQP-based instruction selection by a comparison with a finely-tuned hand-written instruction selection.","PeriodicalId":136293,"journal":{"name":"International Conference on Compilers, Architecture, and Synthesis for Embedded Systems","volume":"149 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124167841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Towards minimizing execution delays on dynamically reconfigurable processors: a case study on REDEFINE 在动态可重构处理器上最小化执行延迟:一个重新定义的案例研究
R. Krishnamoorthy, Keshavan Varadarajan, Ganesh Garga, M. Alle, S. Nandy, R. Narayan, M. Fujita
{"title":"Towards minimizing execution delays on dynamically reconfigurable processors: a case study on REDEFINE","authors":"R. Krishnamoorthy, Keshavan Varadarajan, Ganesh Garga, M. Alle, S. Nandy, R. Narayan, M. Fujita","doi":"10.1145/1878921.1878935","DOIUrl":"https://doi.org/10.1145/1878921.1878935","url":null,"abstract":"In Dynamically Reconfigurable Processors (DRPs), compilation involves breaking an application into sub-tasks for piecewise execution on the fabric. These sub-tasks are sequenced based on data and control dependences. In DRPs, sub-task prefetching is used to hide the reconfiguration time while another sub-task executes. In REDEFINE, our target DRP, subtasks are referred to as HyperOps. Determining the successor for a HyperOp requires merging information from the control flow graph and the HyperOp dataflow graph. Succession in many cases is data dependent. Since hardware branch predictors cannot be applied due to the non-binary branches, we employ a speculative prefetch unit together with a profile based prediction scheme. Simulation results show around 7-33% reduction in overall execution time, when compared to the execution time without prefetching. We observe better performance when fewer resources on the fabric are used to execute prefetched HyperOps.","PeriodicalId":136293,"journal":{"name":"International Conference on Compilers, Architecture, and Synthesis for Embedded Systems","volume":"434 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126098488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Mighty-morphing power-SIMD Mighty-morphing power-SIMD
Ganesh S. Dasika, M. Woh, Sangwon Seo, Nathan Clark, T. Mudge, S. Mahlke
{"title":"Mighty-morphing power-SIMD","authors":"Ganesh S. Dasika, M. Woh, Sangwon Seo, Nathan Clark, T. Mudge, S. Mahlke","doi":"10.1145/1878921.1878934","DOIUrl":"https://doi.org/10.1145/1878921.1878934","url":null,"abstract":"In modern wireless devices, two broad classes of compute-intensive applications are common: those with high amounts of data-level parallelism, such as signal processing used in wireless baseband applications, and those that have little data-level parallelism, such as encryption. Wide single-instruction multiple-data (SIMD) processors have become popular for providing high performance, yet power efficient data engines for applications with abundant data parallelism. However, the non-data-parallel applications are relegated to a low-performance scalar datapath on these data engines while the SIMD resources are left idle. To accelerate both types of applications, we propose the design of a more flexible SIMD datapath called SIMD-Morph. In SIMD-Morph, code with data-level parallelism can be executed across the lanes in the traditional manner, but the lanes can be morphed into a feed-forward subgraph accelerator to execute scalar applications more efficiently. The morphed SIMD lanes form an accelerator that exploits both instruction-level parallelism as well as operation chaining to improve the performance of scalar code by exploiting the available resources in the SIMD lanes. Experimental results show that the performance impact is a 2.6X improvement for purely non-SIMD applications and a 1.4X improvement for the non-SIMD-ized portions of applications with data parallelism.","PeriodicalId":136293,"journal":{"name":"International Conference on Compilers, Architecture, and Synthesis for Embedded Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129107523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Real-time unobtrusive program execution trace compression using branch predictor events 使用分支预测器事件的实时、不显眼的程序执行跟踪压缩
Vladimir Uzelac, A. Milenković, Martin Burtscher, M. Milenkovic
{"title":"Real-time unobtrusive program execution trace compression using branch predictor events","authors":"Vladimir Uzelac, A. Milenković, Martin Burtscher, M. Milenkovic","doi":"10.1145/1878921.1878938","DOIUrl":"https://doi.org/10.1145/1878921.1878938","url":null,"abstract":"Unobtrusive capturing of program execution traces in real-time is crucial in debugging cyber-physical systems. However, tracing even limited program segments is often cost-prohibitive, requiring wide trace ports and large on-chip trace buffers. This paper introduces a new cost-effective technique for capturing and compressing program execution traces in real time. It uses branch predictor-like structures in the trace module to losslessly compress the traces. This approach results in high compression ratios because it only has to transmit misprediction events to the software debugger. Coupled with an effective variable encoding scheme, our technique requires merely 0.036 bits/instruction of trace port bandwidth (a 28-fold improvement over the commercial state-of-the-art) at a cost of roughly 5,200 logic gates.","PeriodicalId":136293,"journal":{"name":"International Conference on Compilers, Architecture, and Synthesis for Embedded Systems","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115317738","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Minimizing inter-task interferences in scratch-pad memory usage for reducing the energy consumption of multi-task systems 最小化刮刮板内存使用中的任务间干扰,以降低多任务系统的能耗
L. Gauthier, T. Ishihara, Hideki Takase, H. Tomiyama, H. Takada
{"title":"Minimizing inter-task interferences in scratch-pad memory usage for reducing the energy consumption of multi-task systems","authors":"L. Gauthier, T. Ishihara, Hideki Takase, H. Tomiyama, H. Takada","doi":"10.1145/1878921.1878945","DOIUrl":"https://doi.org/10.1145/1878921.1878945","url":null,"abstract":"This paper presents a new technique for reducing the energy consumption of a multi-task system by sharing its scratchpad memory (SPM) space among the tasks. With this technique, tasks can interfere by using common areas of the SPM. However, this requires to update these areas during context switches, which involves considerable overheads. Hence, an integer linear programming formulation is used at compile time for finding the best assignment of memory objects to the SPM and their respective locations inside it. Experiments show that the technique achieves up to 85% energy reduction with 8Kb of SPM and surpasses other sharing approaches.","PeriodicalId":136293,"journal":{"name":"International Conference on Compilers, Architecture, and Synthesis for Embedded Systems","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116090257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
A memory interface for multi-purpose multi-stream accelerators 多用途多流加速器的内存接口
Sylvain Girbal, O. Temam, S. Yehia, H. Berry, Zheng Li
{"title":"A memory interface for multi-purpose multi-stream accelerators","authors":"Sylvain Girbal, O. Temam, S. Yehia, H. Berry, Zheng Li","doi":"10.1145/1878921.1878939","DOIUrl":"https://doi.org/10.1145/1878921.1878939","url":null,"abstract":"Power and programming challenges make heterogeneous multi-cores composed of cores and ASICs an attractive alternative to homogeneous multi-cores. Recently, multi-purpose loop-based generated accelerators have emerged as an especially attractive accelerator option. They have several assets: short design time (automatic generation), flexibility (multi-purpose) but low configuration and routing overhead (unlike FPGAs), computational performance (operations are directly mapped to hardware), and a focus on memory throughput by leveraging loop constructs. However, with multiple streams, the memory behavior of such accelerators can become at least as complex as that of superscalar processors, while they still need to retain the memory ordering predictability and throughput efficiency of DMAs. In this article, we show how to design a memory interface for multi-purpose accelerators which combines the ordering predictability of DMAs, retains key efficiency features of memory systems for complex processors, and requires only a fraction of their cost by leveraging the properties of streams references. We evaluate the approach with a synthesizable version of the memory interface for an example 9-task generated loop-based accelerator","PeriodicalId":136293,"journal":{"name":"International Conference on Compilers, Architecture, and Synthesis for Embedded Systems","volume":"36 9","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132835699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Practical aggregation of semantical program properties for machine learning based optimization 基于机器学习优化的实用语义程序属性聚合
Mircea Namolaru, Albert Cohen, G. Fursin, A. Zaks, Ari Freund
{"title":"Practical aggregation of semantical program properties for machine learning based optimization","authors":"Mircea Namolaru, Albert Cohen, G. Fursin, A. Zaks, Ari Freund","doi":"10.1145/1878921.1878951","DOIUrl":"https://doi.org/10.1145/1878921.1878951","url":null,"abstract":"Iterative search combined with machine learning is a promising approach to design optimizing compilers harnessing the complexity of modern computing systems. While traversing a program optimization space, we collect characteristic feature vectors of the program, and use them to discover correlations across programs, target architectures, data sets, and performance. Predictive models can be derived from such correlations, effectively hiding the time-consuming feedback-directed optimization process from the application programmer.\u0000 One key task of this approach, naturally assigned to compiler experts, is to design relevant features and implement scalable feature extractors, including statistical models that filter the most relevant information from millions of lines of code. This new task turns out to be a very challenging and tedious one from a compiler construction perspective. So far, only a limited set of ad-hoc, largely syntactical features have been devised. Yet machine learning is only able to discover correlations from information it is fed with: it is critical to select topical program features for a given optimization problem in order for this approach to succeed.\u0000 We propose a general method for systematically generating numerical features from a program. This method puts no restrictions on how to logically and algebraically aggregate semantical properties into numerical features. We illustrate our method on the difficult problem of selecting the best possible combination of 88 available optimizations in GCC. We achieve 74% of the potential speedup obtained through iterative compilation on a wide range of benchmarks and four different general-purpose and embedded architectures. Our work is particularly relevant to embedded system designers willing to quickly adapt the optimization heuristics of a mainstream compiler to their custom ISA, microarchitecture, benchmark suite and workload. Our method has been integrated with the publicly released MILEPOST GCC [14].","PeriodicalId":136293,"journal":{"name":"International Conference on Compilers, Architecture, and Synthesis for Embedded Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129292390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 45
Compilers, architectures and synthesis for embedded computing: retrospect and prospect 嵌入式计算的编译器、体系结构和综合:回顾与展望
K. Palem
{"title":"Compilers, architectures and synthesis for embedded computing: retrospect and prospect","authors":"K. Palem","doi":"10.1145/1878921.1878947","DOIUrl":"https://doi.org/10.1145/1878921.1878947","url":null,"abstract":"I am humbled and honored to be included in the ranks of my many distinguished predecessors who have been recognized with the W. Wallace McDowell Award. Beyond the personal dimension, it is especially satisfying to see the field of embedded computing being recognized through this award. Over the past fifteen years or so, the synergy of computer science and engineering, and electrical engineering principles, have led to major advances in this field. While embedded computing might be largely viewed as being based on electrical engineering innovation, I want to draw attention to the importance of computer science principles and practice, and their influence on the recent developments in this field. Notably, algorithms guided by complexity theory have provided an important basis at the foundational level. In particular, the judicious use of algorithms within the framework of compiler optimizations, have played a central role. Finally and perhaps most significantly, the notion of an instruction set architecture or isa, has provided a significant lens through which some of the interesting challenges in embedded computing have been tackled. With this as a backdrop, I will","PeriodicalId":136293,"journal":{"name":"International Conference on Compilers, Architecture, and Synthesis for Embedded Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129017658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信