{"title":"Translating Java for resource constrained embedded systems","authors":"G. Plumbridge, N. Audsley","doi":"10.1109/ReCoSoC.2012.6322868","DOIUrl":"https://doi.org/10.1109/ReCoSoC.2012.6322868","url":null,"abstract":"This paper discusses a strategy for translating the Java programming language to a form that is suitable for execution on resource limited embedded systems such as softcore processors in FPGAs, Network-on-Chip nodes and microcontrollers. The translation strategy prioritises the minimisation of runtime memory usage, generated code size, and suitability for a wide range of small architectures over other desirable goals such as execution speed and strict adherence to the Java standard. The translation procedure, or Concrete Hardware Implementation of a software application first converts the application's compiled Java class files to a self-contained intermediate representation conducive to optimisation and refactoring. The intermediate format is then serialised into a programming language compilable to the target architecture. This paper presents techniques for analysing whole Java applications, translating Java methods and building a stand-alone translated application with the same functional behaviour as the original Java. An example C-code generator is described and evaluated against similar previous approaches. An existing benchmark application, JavaBenchEmbedded, is demonstrated to require less than 30KiB of program code and 16KiB of runtime heap memory when executing on a Xilinx MicroBlaze Processor.","PeriodicalId":263746,"journal":{"name":"7th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"91 12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123116498","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pascal Cotret, Florian Devic, G. Gogniat, Benoît Badrignans, L. Torres
{"title":"Security enhancements for FPGA-based MPSoCs: A boot-to-runtime protection flow for an embedded Linux-based system","authors":"Pascal Cotret, Florian Devic, G. Gogniat, Benoît Badrignans, L. Torres","doi":"10.1109/ReCoSoC.2012.6322896","DOIUrl":"https://doi.org/10.1109/ReCoSoC.2012.6322896","url":null,"abstract":"Nowadays, embedded systems become more and more complex: the hardware/software codesign approach is a method to create such systems in a single chip which can be based on reconfigurable technologies such as FPGAs (Field-Programmable Gate Arrays). In such systems, data exchanges are a key point as they convey critical and confidential information and data are transmitted between several hardware modules and software layers. In case of an FPGA development life cycle, OS (Operating System) / data updates as runtime communications can be done through an insecure link: attackers can use this medium to make the system misbehave (malicious injection) or retrieve bitstream-related information (eavesdropping). Recent works propose solutions to securely boot a bitstream and the associated OS while runtime transactions are not protected. This work proposes a full boot-to-runtime protection flow of an embedded Linux kernel during boot and confidentiality/integrity protection of the external memory containing the kernel and the main application code/data. This work shows that such a solution with hardware components induces an area occupancy of 10% of a xc6vlx240t Virtex-6 FPGA while having an improved throughput for Linux booting and low-latency security for runtime protection.","PeriodicalId":263746,"journal":{"name":"7th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128278240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Dominic Hillenbrand, Christian Brugger, J. Tao, Shufan Yang, M. Balzer
{"title":"RIVER architecture: Reconfigurable flow and fabric for parallel stream processing on FPGAs","authors":"Dominic Hillenbrand, Christian Brugger, J. Tao, Shufan Yang, M. Balzer","doi":"10.1109/ReCoSoC.2012.6322876","DOIUrl":"https://doi.org/10.1109/ReCoSoC.2012.6322876","url":null,"abstract":"The RIVER architecture is a run-time configurable and programmable fabric for parallel stream processing on FPGAs. RIVER's memory architecture has been designed to support non-trivial data flows efficiently and in real-time. The individual data processing cores are called Dynamic Streaming Engines (DSE). Our cloud computing supported design flow generates hundreds of thousands different DSE cores ahead in time. At run-time users may download pre-synthesized DSE cores according to their requirements - for example area and power consumption. Furthermore, our design flow shields users from traditional HDL design flows and tedious design optimizations. However, we do not impede architectural changes but provide support for them through custom instructions and numerous design time options. Our results suggest that our architecture performs well for computational- and memory-intensive kernels such as 2-dimensional convolution. By comparison to recently published, highly specialized architectures we achieve higher clock speeds and offer additional, non-trivial features thanks to our sophisticated memory architecture.","PeriodicalId":263746,"journal":{"name":"7th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123614194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Timo Stripf, Oliver Oey, Thomas Bruckschlögl, Ralf König, M. Hübner, J. Becker, G. Rauwerda, K. Sunesen, N. Kavvadias, G. Dimitroulakos, K. Masselos, D. Kritharidis, N. Mitas, G. Goulas, P. Alefragis, N. Voros, Steven Derrien, D. Ménard, O. Sentieys, D. Göhringer, T. Perschke
{"title":"A flexible approach for compiling scilab to reconfigurable multi-core embedded systems","authors":"Timo Stripf, Oliver Oey, Thomas Bruckschlögl, Ralf König, M. Hübner, J. Becker, G. Rauwerda, K. Sunesen, N. Kavvadias, G. Dimitroulakos, K. Masselos, D. Kritharidis, N. Mitas, G. Goulas, P. Alefragis, N. Voros, Steven Derrien, D. Ménard, O. Sentieys, D. Göhringer, T. Perschke","doi":"10.1109/ReCoSoC.2012.6322879","DOIUrl":"https://doi.org/10.1109/ReCoSoC.2012.6322879","url":null,"abstract":"The mapping process of high performance embedded applications to today's reconfigurable multiprocessor System-on-Chip devices suffers from a complex toolchain and programming process. Thus, the efficient programming of such architectures in terms of achievable performance and power consumption is limited to experts only. Enabling them to nonexperts requires a simplified programming process that hides the complexity of the underlying hardware - introduced by software parallelism of multiple cores and the flexibility of reconfigurable architectures - to the end user. The Architecture oriented paraLlelization for high performance embedded Multi-core systems using scilAb (ALMA) European project aims to bridge these hurdles through the introduction and exploitation of a Scilab- and architecture-description-language-based toolchain which enables the efficient mapping of applications on multiprocessor platforms from high level of abstraction. This holistic solution of the toolchain allows the complexity of both the application and the architecture to be hidden, which leads to a better acceptance, reduced development costs, and shorter time-to-market.","PeriodicalId":263746,"journal":{"name":"7th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123926297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Distributed control for reconfigurable FPGA systems: A high-level design approach","authors":"C. Trabelsi, S. Meftali, J. Dekeyser","doi":"10.1109/ReCoSoC.2012.6322871","DOIUrl":"https://doi.org/10.1109/ReCoSoC.2012.6322871","url":null,"abstract":"Due to their exponential complexity, designing adaptation control for Reconfigurable Systems-on-Chip (RSoC) is becoming one of the most challenging tasks, resulting in longer design cycles and increased time-to-market. This paper addresses this issue and proposes a novel adaptation control design approach for FPGA-based reconfigurable systems aiming to increase design productivity. This approach combines control distribution and high-level modeling in order to decrease design complexity and enhance design reuse and scalability. Control distribution is based on allocating local control aspects (monitoring, decision and reconfiguration) to distributed controllers, while respecting global system constraints/objectives using a coordinator. High-level modeling makes use of Model-Driven Engineering and the MARTE (Modeling and Analysis of Real-Time and Embedded Systems) standard in order to move from high level models to automatic code generation, which significantly simplifies the control design. The proposed design approach is integrated in a model-driven RSoC design flow and allows to model adaptation aspects at different design levels: application, architecture, allocation and deployment, which allows to target a wide range of control requirements. In order to validate our approach, a video processing application was implemented on a reconfigurable system that contained four distributed hardware controllers.","PeriodicalId":263746,"journal":{"name":"7th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132340880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Invited paper: On-chip monitoring for adaptive heterogeneous multicore systems","authors":"D. Göhringer, Mounir Chemaou, M. Hübner","doi":"10.1109/ReCoSoC.2012.6322872","DOIUrl":"https://doi.org/10.1109/ReCoSoC.2012.6322872","url":null,"abstract":"Ad-hoc and dynamic adaptation to the requirements of an application enables to increase the energy efficiency of a processor. This method is especially for novel heterogeneous multicore systems of high interest since the various cores can be adapted individually. For this purpose, the current status of the system has to be monitored on the chip. Parameters of interest are the number of communications between cores as well as the current program sequence which is currently processed on an individual processor core. A program sequence is e.g. a loop, control flow or even the termination of the program. All this information has to be extracted from the standard processor cores through a set of on-chip monitors. The concept, as well as the physical realization of two different on-chip monitors for extracting information about the communication effort and for detecting program sequences, is presented in this paper.","PeriodicalId":263746,"journal":{"name":"7th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"246 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123022161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
T. Iakymchuk, A. R. Muñoz, J. V. Francés, M. Bataller-Mompeán
{"title":"Fast spiking neural network architecture for low-cost FPGA devices","authors":"T. Iakymchuk, A. R. Muñoz, J. V. Francés, M. Bataller-Mompeán","doi":"10.1109/ReCoSoC.2012.6322906","DOIUrl":"https://doi.org/10.1109/ReCoSoC.2012.6322906","url":null,"abstract":"Spiking Neural Networks (SNN) consist of fully interconnected computation units (neurons) based on spike processing. This type of networks resembles those found in biological systems studied by neuroscientists. This paper shows a hardware implementation for SNN. First, SNN require the inputs to be spikes, being necessary a conversion system (encoding) from digital values into spikes. For travelling spikes, each neuron interconnection is characterized by weights and delays, requiring an internal neuron processing by a Postsynaptic Potential (PSP) function and membrane potential threshold evaluation for a postsynaptic output spike generation. In order to model a real biological system by artificial SNN, the number of required neurons is very high (thousands). In this work, we propose a SNN architecture able to adapt big size networks using reduced hardware resources. While spikes are processed at 1ms time, inter spike time is used for internal calculations, a mixed serial-parallel structure allows optimized computation of all neuron output values. Results show that SNN can be accommodated using a medium-size FPGA device such as Xilinx Spartan 3 with processing speed comparable to fully parallel implementations with up to 70% resource reduction.","PeriodicalId":263746,"journal":{"name":"7th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125202335","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
D. Bekiaris, Efstathios Sotiriou-Xanthopoulos, G. Economakos, D. Soudris
{"title":"Systematic design and evaluation of a scalable reconfigurable multiplier scheme for HLS environments","authors":"D. Bekiaris, Efstathios Sotiriou-Xanthopoulos, G. Economakos, D. Soudris","doi":"10.1109/ReCoSoC.2012.6322890","DOIUrl":"https://doi.org/10.1109/ReCoSoC.2012.6322890","url":null,"abstract":"Modern digital design has been greatly forced to offer More-Moore integration densities and very high operation frequencies for demanding applications. In this search-for-performance race, alternative and less radical More-than-Moore solutions are emerging, like reconfigurable computing. Reconfigurable computing stands between hardware and software and promises to offer the former's performance alongside with the latter's flexibility. Research in the field deals with fine or coarse grain reconfigurable components and efficient ways to map applications onto them. In this paper, a systematic design methodology and evaluation of a coarse grain reconfigurable component targeting the ASIC domain is presented. The specific component is a morphable architecture, that works in mutually exclusive modes, offering different functionality in each mode. The novelty presented in this paper is a systematic evaluation of the scalability of the morphable component. Continuously functionally improved modes are evaluated for performance, area and power, in order to choose the best architecture for a number of widely used DSP applications. Overall, a power* performance improvement of up to 24% is reported and a power* area of up to 13% compared to conventional, non-reconfigurable component architectures.","PeriodicalId":263746,"journal":{"name":"7th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124669658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A framework for self-reconfigurable DCTs based on multiobjective optimization of the Power-Performance-Accuracy space","authors":"D. Llamocca, M. Pattichis, Cesar Carranza","doi":"10.1109/RECOSOC.2012.6322903","DOIUrl":"https://doi.org/10.1109/RECOSOC.2012.6322903","url":null,"abstract":"We present a framework for the implementation of self-reconfigurable 2D Discrete Cosine Transforms (DCTs). Dynamic Partial Reconfiguration (DPR) and Dynamic Frequency Control lead to a multi-objective optimization scheme that generates Pareto-optimal realizations from the Power-Performance-Accuracy (PPA) space. The PPA space is created by evaluating the 2D DCTs realizations in terms of their power consumption, performance, and accuracy. Dynamic PPA management can then carried out by selecting Pareto-optimal realizations that meet time-varying PPA constraints.","PeriodicalId":263746,"journal":{"name":"7th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128704335","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Using genetic algorithms to map hard real-time on NoC-based systems","authors":"Adrian Racu, L. Indrusiak","doi":"10.1109/ReCoSoC.2012.6322893","DOIUrl":"https://doi.org/10.1109/ReCoSoC.2012.6322893","url":null,"abstract":"This paper investigates the effectiveness of genetic algorithms (GAs) for static task scheduling in wormhole Network-on-Chip-based systems. The overall objective was to get the application model mapped onto the architecture so that all tasks and communication meet their deadlines. Inter-task communication is accounted for by using analytical methods. The GA explores both the mapping of tasks as well as the priority ordering of the task set. A novel fitness function was developed and found to perform better than existing functions.","PeriodicalId":263746,"journal":{"name":"7th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128708593","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}