Software and Compilers for Embedded Systems最新文献_第5页

The PROMPT design principles for predictable multi-core architectures 可预测的多核架构的PROMPT设计原则

Software and Compilers for Embedded Systems Pub Date : 2009-04-23 DOI: 10.1145/1543820.1543826

R. Wilhelm

引用次数: 4

Communication between nested loop programs via circular buffers in an embedded multiprocessor system 嵌入式多处理器系统中通过循环缓冲区的嵌套循环程序之间的通信

Software and Compilers for Embedded Systems Pub Date : 2008-03-13 DOI: 10.1145/1361096.1361104

T. Bijlsma, M. Bekooij, P. Jansen, G. Smit

引用次数: 26

Optimal vs. heuristic integrated code generation for clustered VLIW architectures 集群VLIW体系结构的最优与启发式集成代码生成

Software and Compilers for Embedded Systems Pub Date : 2008-03-13 DOI: 10.1145/1361096.1361099

Mattias V. Eriksson, Oskar Skoog, C. Kessler

引用次数: 22

Fast source-level data assignment to dual memory banks 快速源级数据分配到双内存库

Software and Compilers for Embedded Systems Pub Date : 2008-03-13 DOI: 10.1145/1361096.1361105

A. Murray, Björn Franke

{"title":"Fast source-level data assignment to dual memory banks","authors":"A. Murray, Björn Franke","doi":"10.1145/1361096.1361105","DOIUrl":"https://doi.org/10.1145/1361096.1361105","url":null,"abstract":"Due to their streaming nature memory bandwidth is critical for most digital signal processing applications. To accommodate for these bandwidth requirements digital signal processors are typically equipped with dual memory banks that enable simultaneous access to two operands if the data is partitioned appropriately. Fully automated and compiler integrated approaches to data partitioning and memory bank assignment, however, have found little acceptance by DSP software developers. This is partly due to their inflexibility and inability to cope with certain manual data pre-assignments, e.g. due to I/O constraints. In this paper we present a different and more flexible approach, namely source-level dual memory assignment where code generation targets DSP-C, a standardised C language extension widely supported by industrial C compilers for DSPs. Additionally, we present a novel partitioning algorithm based on soft colouring that is more efficient and scalable than the currently known best integer linear programming algorithm, whilst achieving competitive code quality. We have evaluated our scheme on an Analog Devices TigerSHARC DSP and achieved speedups of up to 1.57 on 13 UTDSP benchmarks.","PeriodicalId":375451,"journal":{"name":"Software and Compilers for Embedded Systems","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123504181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

Memory footprint reduction for embedded systems 减少嵌入式系统的内存占用

Software and Compilers for Embedded Systems Pub Date : 2008-03-13 DOI: 10.1145/1361096.1361102

K. D. Bosschere

{"title":"Memory footprint reduction for embedded systems","authors":"K. D. Bosschere","doi":"10.1145/1361096.1361102","DOIUrl":"https://doi.org/10.1145/1361096.1361102","url":null,"abstract":"The memory footprint is considered an important constraint for embedded systems. This is especially important in the context of increasing sophistication of embedded software, and the increasing use of modern software engineering techniques like component-based design. Since reusability is the major motivation for using components, most components are not optimized for the (limited) functionality they have to realize in an embedded system. All this leads to an increasing amount of code and data that might not be needed for a given functionality. The memory footprint of an embedded system consists of 2 parts: the footprint of the application and the footprint of the operating system. In this keynote talk, I will focus on the memory footprint reduction of application as well as the Linux kernel. I will report memory footprint reductions that have been obtained by the Diablo binary rewriter, which has been used to substantially reduce the memory footprint of both applications and of the system software. For the applications, the optimizer is capable of reducing the code size of programs compiled with two proprietary ARM tool chains (ADS 1.1 and RVCT 2.1) with on average 16% for statically linked ARM programs, while making them 12.8% faster. Execution of the rewritten programs also consumes on average 10.7% less energy. For the system software, we specialize the kernel both for the system calls that are actually occurring in the application program, and for the boot parameters of the kernel. We also assume that the hardware is fixed so that part of the bootstrap process is completely deterministic and can be optimized based on actual trace information. Finally, we compress frozen code, and we swap cold code to flash memory. All combined, these compaction techniques on the kernel can reduce the kernel's RAM footprint with up to 48% for the Linux kernel. The slowdown was limited to 1--2%. This proves that binary rewriting can help in substantially reducing the memory footprint of both the application and the system software. The nice thing is that it can be done automatically, and that it also reduces the execution time and the power consumption.","PeriodicalId":375451,"journal":{"name":"Software and Compilers for Embedded Systems","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133343628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A new heuristic for SOA problem based on effective tie break function 一种基于有效断接函数的SOA问题启发式算法

Software and Compilers for Embedded Systems Pub Date : 2008-03-13 DOI: 10.1145/1361096.1361106

H. Shokry, H. M. El-Boghdadi, S. Shaheen

{"title":"A new heuristic for SOA problem based on effective tie break function","authors":"H. Shokry, H. M. El-Boghdadi, S. Shaheen","doi":"10.1145/1361096.1361106","DOIUrl":"https://doi.org/10.1145/1361096.1361106","url":null,"abstract":"Producing efficient and compact code for embedded DSP processors is very important for nowadays faster and smaller size devices. Because such processors have highly irregular data-path, conventional code generation techniques typically result in inefficient code. Embedded software compilers are expected to make use of the Address Generation Unit (AGU); a feature commonly found in modern embedded DSP processors. This helps in generating optimized offset assignments to program variables in memory, and consequently minimize the overhead instructions dedicated for addresses computations. This paper addresses one of the problems of code optimizations; namely Simple Offset Assignment (SOA) problem.\u0000 In this paper, we study the tie break function introduced by Leupers and Marwedel [1] and show that this function does not represent the actual tie break that could happen in the graph. Then we introduce the notion of Effective Tie Break Function (ETBF) and use it in proposing a new algorithm for solving the SOA problem. We apply the algorithm to randomly generated graphs. Our results show improvement in offset assignment cost of up to 7% over well known offset assignment algorithms [1,2,3].","PeriodicalId":375451,"journal":{"name":"Software and Compilers for Embedded Systems","volume":"85 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129740099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Fast cycle-approximate instruction set simulation 快速周期近似指令集仿真

Software and Compilers for Embedded Systems Pub Date : 2008-03-13 DOI: 10.1145/1361096.1361109

Björn Franke

引用次数: 39

WCET-driven, code-size critical procedure cloning wcet驱动，代码大小关键过程克隆

Software and Compilers for Embedded Systems Pub Date : 2008-03-13 DOI: 10.1145/1361096.1361100

Paul Lokuciejewski, H. Falk, P. Marwedel, Henrik Theiling

{"title":"WCET-driven, code-size critical procedure cloning","authors":"Paul Lokuciejewski, H. Falk, P. Marwedel, Henrik Theiling","doi":"10.1145/1361096.1361100","DOIUrl":"https://doi.org/10.1145/1361096.1361100","url":null,"abstract":"In the domain of the worst-case execution time (WCET) analysis, loops are an inherent source of unpredictability and loss of precision since the determination of tight and safe information on the number of loop iterations is a difficult task. In particular, data-dependent loops whose iteration counts depend on function parameters can not be precisely handled by a timing analysis. Procedure Cloning can be exploited to make these loops explicit within the source code allowing a highly precise WCET analysis.\u0000 In this paper we extend the standard Procedure Cloning optimization by WCET-aware concepts with the objective to improve the tightness of the WCET estimation. Our novel approach is driven by WCET information which successively eliminates code structures leading to overestimated timing results, thus making the code more suitable for the analysis. In addition, the code size increase during the optimization is monitored and large increases are avoided.\u0000 The effectiveness of our optimization is shown by tests on real-world benchmarks. After performing our optimization, the estimated WCET is reduced by up to 64.2% while the employed code transformations yield an additional code size increase of 22.6% on average. In contrast, the average-case performance being the original objective of Procedure Cloning showed a slight decrease.","PeriodicalId":375451,"journal":{"name":"Software and Compilers for Embedded Systems","volume":"95 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124081880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 19

A fully-non-transparent approach to the code location problem 一个完全不透明的方法来解决代码位置问题

Software and Compilers for Embedded Systems Pub Date : 2008-03-13 DOI: 10.1145/1361096.1361108

Hugo Venturini, F. Riss, Jean-Claude Fernandez, M. Santana

{"title":"A fully-non-transparent approach to the code location problem","authors":"Hugo Venturini, F. Riss, Jean-Claude Fernandez, M. Santana","doi":"10.1145/1361096.1361108","DOIUrl":"https://doi.org/10.1145/1361096.1361108","url":null,"abstract":"In the context of embedded systems such as cell-phones, PDA or cars and planes software, optimizations of code are required because of timing and memory constraints imposed. Many problems arise when trying to debug optimized code. One of them is the irrelevance of the mapping between the source code and the optimized target program: the Code Location Problem. This paper proposes a solution to this problem in the case of highly optimized code in the context of embedded systems.\u0000 Two approaches exist: non-transparent and transparent debugging. Our approach is non-transparent. The idea is to reveal the execution of the optimized program to the user so the latter understands the mapping to the source code in spite of transformations applied to the program. We do not emulate the execution of the unoptimized program. We make good use of the programmer's knowledge of its development platform. Standard debuggers do not provide the required mechanisms while compilers do not provide the relevant debug information. We propose a novel method to maintain accurate debug information when optimizing at compilation and we experiment this method on the MMDSP+ C compiler and the IDBug debugger.","PeriodicalId":375451,"journal":{"name":"Software and Compilers for Embedded Systems","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126106257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Integrated code generation by using fuzzy control system 利用模糊控制系统集成代码生成

Software and Compilers for Embedded Systems Pub Date : 2008-03-13 DOI: 10.1145/1361096.1361098

Xiaoyan Jia, Jie Guo, G. Fettweis

引用次数: 1