International Conference on Compilers, Architecture, and Synthesis for Embedded Systems最新文献_第5页

MNEMEE: a framework for memory management and optimization of static and dynamic data in MPSoCs MNEMEE:用于mpsoc中静态和动态数据的内存管理和优化的框架

International Conference on Compilers, Architecture, and Synthesis for Embedded Systems Pub Date : 2010-10-24 DOI: 10.1145/1878921.1878959

A. Mallik, P. Marwedel, D. Soudris, S. Stuijk

{"title":"MNEMEE: a framework for memory management and optimization of static and dynamic data in MPSoCs","authors":"A. Mallik, P. Marwedel, D. Soudris, S. Stuijk","doi":"10.1145/1878921.1878959","DOIUrl":"https://doi.org/10.1145/1878921.1878959","url":null,"abstract":"As embedded systems are becoming the center of our digital life, system design becomes progressively harder. The integration of multiple features on devices with limited resources requires careful and exhaustive exploration of the design search space in order to efficiently map modern applications to an embedded multi-processor platform. The MNEMEE project [1] addresses this challenge by offering a unique integrated tool flow that performs source-to-source transformations to automatically optimize the original source code and map it on the target platform. The optimizations aim at reducing the number of memory accesses and the required memory storage of both dynamically and statically allocated data. Furthermore, the MNEMEE tool flow performs optimal assignment of all data on the memory hierarchy of the target platform. Overall, the MNEMEE techniques embedded in it will lead to more cost efficient systems that offer a better performance and lower energy consumption.\u0000 This tutorial gives an overview of the MNEMEE tool flow. The objective of the tutorial is to familiarize the audience with the tool framework and the optimizations used in the individual tools. The tutorial also features a demonstration of the tool flow. This demonstration shows that the tools developed in the MNEMEE project provide a user-friendly and efficient framework for MPSoC programming and memory managemen","PeriodicalId":136293,"journal":{"name":"International Conference on Compilers, Architecture, and Synthesis for Embedded Systems","volume":"82 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114328196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

The virtual hospital: the emergence of telemedicine 虚拟医院:远程医疗的出现

International Conference on Compilers, Architecture, and Synthesis for Embedded Systems Pub Date : 2010-10-24 DOI: 10.1145/1878921.1878930

D. Petrasek, A. Barr, K. Palem

{"title":"The virtual hospital: the emergence of telemedicine","authors":"D. Petrasek, A. Barr, K. Palem","doi":"10.1145/1878921.1878930","DOIUrl":"https://doi.org/10.1145/1878921.1878930","url":null,"abstract":"The current practice of medicine, while utilizing the advances in biological and physical science, still takes place in the physician office or hospital. Unfortunately, traditional practice as integrated into the current Healthcare system is unsustainable. Accommodating the increase demand for medical services with the attendant rising costs has caused a crisis in healthcare.\u0000 Telemedicine, the practice of medicine by means of mobile/internet is a transformative process that will impact healthcare globally. Already, teleradiology (diagnostic radiology remotely by means of digital images that are electronically exported) and electronic medical records are gaining wide acceptance. The ability to distribute medical services by means of mobile and internet technology is a natural and almost irresistible direction for the field of Medicine.\u0000 The healthcare crisis has created an opportunity for new solutions and mobile/Internet technology has laid the infrastructure upon which one can build a powerful, innovative and badly needed platform for health services: The Global Virtual Hospital (GVH). The GVH will be a group of connected centers around the world that overlap (in time zones) throughout the working day. Patients will have access through the Internet or mobile phones. Medical records will be electronically stored, shared among authorized personal and updated with each medical interaction. The GVH, will serve as a platform and laboratory for the creation of innovative devices and technology that will improve the remote interaction.The Global Virtual Hospital System will exemplify the convergence of technology and medicine and will be integrated into standard practice in the next 5-10 year.","PeriodicalId":136293,"journal":{"name":"International Conference on Compilers, Architecture, and Synthesis for Embedded Systems","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121782013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Instruction selection by graph transformation 图变换指令选择

International Conference on Compilers, Architecture, and Synthesis for Embedded Systems Pub Date : 2010-10-24 DOI: 10.1145/1878921.1878926

Sebastian Buchwald, Andreas Zwinkau

引用次数: 12

Towards minimizing execution delays on dynamically reconfigurable processors: a case study on REDEFINE 在动态可重构处理器上最小化执行延迟:一个重新定义的案例研究

International Conference on Compilers, Architecture, and Synthesis for Embedded Systems Pub Date : 2010-10-24 DOI: 10.1145/1878921.1878935

R. Krishnamoorthy, Keshavan Varadarajan, Ganesh Garga, M. Alle, S. Nandy, R. Narayan, M. Fujita

引用次数: 2

Mighty-morphing power-SIMD Mighty-morphing power-SIMD

International Conference on Compilers, Architecture, and Synthesis for Embedded Systems Pub Date : 2010-10-24 DOI: 10.1145/1878921.1878934

Ganesh S. Dasika, M. Woh, Sangwon Seo, Nathan Clark, T. Mudge, S. Mahlke

{"title":"Mighty-morphing power-SIMD","authors":"Ganesh S. Dasika, M. Woh, Sangwon Seo, Nathan Clark, T. Mudge, S. Mahlke","doi":"10.1145/1878921.1878934","DOIUrl":"https://doi.org/10.1145/1878921.1878934","url":null,"abstract":"In modern wireless devices, two broad classes of compute-intensive applications are common: those with high amounts of data-level parallelism, such as signal processing used in wireless baseband applications, and those that have little data-level parallelism, such as encryption. Wide single-instruction multiple-data (SIMD) processors have become popular for providing high performance, yet power efficient data engines for applications with abundant data parallelism. However, the non-data-parallel applications are relegated to a low-performance scalar datapath on these data engines while the SIMD resources are left idle. To accelerate both types of applications, we propose the design of a more flexible SIMD datapath called SIMD-Morph. In SIMD-Morph, code with data-level parallelism can be executed across the lanes in the traditional manner, but the lanes can be morphed into a feed-forward subgraph accelerator to execute scalar applications more efficiently. The morphed SIMD lanes form an accelerator that exploits both instruction-level parallelism as well as operation chaining to improve the performance of scalar code by exploiting the available resources in the SIMD lanes. Experimental results show that the performance impact is a 2.6X improvement for purely non-SIMD applications and a 1.4X improvement for the non-SIMD-ized portions of applications with data parallelism.","PeriodicalId":136293,"journal":{"name":"International Conference on Compilers, Architecture, and Synthesis for Embedded Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129107523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

Real-time unobtrusive program execution trace compression using branch predictor events 使用分支预测器事件的实时、不显眼的程序执行跟踪压缩

International Conference on Compilers, Architecture, and Synthesis for Embedded Systems Pub Date : 2010-10-24 DOI: 10.1145/1878921.1878938

Vladimir Uzelac, A. Milenković, Martin Burtscher, M. Milenkovic

引用次数: 17

Minimizing inter-task interferences in scratch-pad memory usage for reducing the energy consumption of multi-task systems 最小化刮刮板内存使用中的任务间干扰，以降低多任务系统的能耗

International Conference on Compilers, Architecture, and Synthesis for Embedded Systems Pub Date : 2010-10-24 DOI: 10.1145/1878921.1878945

L. Gauthier, T. Ishihara, Hideki Takase, H. Tomiyama, H. Takada

引用次数: 23

A memory interface for multi-purpose multi-stream accelerators 多用途多流加速器的内存接口

International Conference on Compilers, Architecture, and Synthesis for Embedded Systems Pub Date : 2010-10-24 DOI: 10.1145/1878921.1878939

Sylvain Girbal, O. Temam, S. Yehia, H. Berry, Zheng Li

{"title":"A memory interface for multi-purpose multi-stream accelerators","authors":"Sylvain Girbal, O. Temam, S. Yehia, H. Berry, Zheng Li","doi":"10.1145/1878921.1878939","DOIUrl":"https://doi.org/10.1145/1878921.1878939","url":null,"abstract":"Power and programming challenges make heterogeneous multi-cores composed of cores and ASICs an attractive alternative to homogeneous multi-cores. Recently, multi-purpose loop-based generated accelerators have emerged as an especially attractive accelerator option. They have several assets: short design time (automatic generation), flexibility (multi-purpose) but low configuration and routing overhead (unlike FPGAs), computational performance (operations are directly mapped to hardware), and a focus on memory throughput by leveraging loop constructs. However, with multiple streams, the memory behavior of such accelerators can become at least as complex as that of superscalar processors, while they still need to retain the memory ordering predictability and throughput efficiency of DMAs. In this article, we show how to design a memory interface for multi-purpose accelerators which combines the ordering predictability of DMAs, retains key efficiency features of memory systems for complex processors, and requires only a fraction of their cost by leveraging the properties of streams references. We evaluate the approach with a synthesizable version of the memory interface for an example 9-task generated loop-based accelerator","PeriodicalId":136293,"journal":{"name":"International Conference on Compilers, Architecture, and Synthesis for Embedded Systems","volume":"36 9","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132835699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Practical aggregation of semantical program properties for machine learning based optimization 基于机器学习优化的实用语义程序属性聚合

International Conference on Compilers, Architecture, and Synthesis for Embedded Systems Pub Date : 2010-10-24 DOI: 10.1145/1878921.1878951

Mircea Namolaru, Albert Cohen, G. Fursin, A. Zaks, Ari Freund

{"title":"Practical aggregation of semantical program properties for machine learning based optimization","authors":"Mircea Namolaru, Albert Cohen, G. Fursin, A. Zaks, Ari Freund","doi":"10.1145/1878921.1878951","DOIUrl":"https://doi.org/10.1145/1878921.1878951","url":null,"abstract":"Iterative search combined with machine learning is a promising approach to design optimizing compilers harnessing the complexity of modern computing systems. While traversing a program optimization space, we collect characteristic feature vectors of the program, and use them to discover correlations across programs, target architectures, data sets, and performance. Predictive models can be derived from such correlations, effectively hiding the time-consuming feedback-directed optimization process from the application programmer.\u0000 One key task of this approach, naturally assigned to compiler experts, is to design relevant features and implement scalable feature extractors, including statistical models that filter the most relevant information from millions of lines of code. This new task turns out to be a very challenging and tedious one from a compiler construction perspective. So far, only a limited set of ad-hoc, largely syntactical features have been devised. Yet machine learning is only able to discover correlations from information it is fed with: it is critical to select topical program features for a given optimization problem in order for this approach to succeed.\u0000 We propose a general method for systematically generating numerical features from a program. This method puts no restrictions on how to logically and algebraically aggregate semantical properties into numerical features. We illustrate our method on the difficult problem of selecting the best possible combination of 88 available optimizations in GCC. We achieve 74% of the potential speedup obtained through iterative compilation on a wide range of benchmarks and four different general-purpose and embedded architectures. Our work is particularly relevant to embedded system designers willing to quickly adapt the optimization heuristics of a mainstream compiler to their custom ISA, microarchitecture, benchmark suite and workload. Our method has been integrated with the publicly released MILEPOST GCC [14].","PeriodicalId":136293,"journal":{"name":"International Conference on Compilers, Architecture, and Synthesis for Embedded Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129292390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 45

Compilers, architectures and synthesis for embedded computing: retrospect and prospect 嵌入式计算的编译器、体系结构和综合:回顾与展望

International Conference on Compilers, Architecture, and Synthesis for Embedded Systems Pub Date : 2010-10-24 DOI: 10.1145/1878921.1878947

K. Palem

{"title":"Compilers, architectures and synthesis for embedded computing: retrospect and prospect","authors":"K. Palem","doi":"10.1145/1878921.1878947","DOIUrl":"https://doi.org/10.1145/1878921.1878947","url":null,"abstract":"I am humbled and honored to be included in the ranks of my many distinguished predecessors who have been recognized with the W. Wallace McDowell Award. Beyond the personal dimension, it is especially satisfying to see the field of embedded computing being recognized through this award. Over the past fifteen years or so, the synergy of computer science and engineering, and electrical engineering principles, have led to major advances in this field. While embedded computing might be largely viewed as being based on electrical engineering innovation, I want to draw attention to the importance of computer science principles and practice, and their influence on the recent developments in this field. Notably, algorithms guided by complexity theory have provided an important basis at the foundational level. In particular, the judicious use of algorithms within the framework of compiler optimizations, have played a central role. Finally and perhaps most significantly, the notion of an instruction set architecture or isa, has provided a significant lens through which some of the interesting challenges in embedded computing have been tackled. With this as a backdrop, I will","PeriodicalId":136293,"journal":{"name":"International Conference on Compilers, Architecture, and Synthesis for Embedded Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129017658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3