M. Dreschmann, O. Sander, Alexander Klimm, Christoph Roth, J. Becker
{"title":"Addiguration: Exploring configuration behavior of Spartan-3 devices","authors":"M. Dreschmann, O. Sander, Alexander Klimm, Christoph Roth, J. Becker","doi":"10.1109/ReCoSoC.2013.6581543","DOIUrl":"https://doi.org/10.1109/ReCoSoC.2013.6581543","url":null,"abstract":"Reconfigurability of FPGAs is an enabler for many applications. In recent years a lot of different reconfiguration approaches and methodologies were presented, including full and partial reconfiguration of devices. In this paper we present the novel methodology of addiguration for Xilinx Spartan-3 devices. Our proposed methodology exploits special properties of the configuration logic and enables to incrementally configure on top of an existing design already running on an FPGA. This work demonstrates a basic technology that might be exploited for testing scenarios or as an entry vector for attacks on FPGA designs. Prototypical experiments have shown that the addiguration can easily be carried out during runtime of the device and over a standard configuration interface.","PeriodicalId":354964,"journal":{"name":"2013 8th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116028223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"SoC performance evaluation with ArchC and TLM-2.0","authors":"Jörg Walter, Jörg Lenhardt, W. Schiffmann","doi":"10.1109/ReCoSoC.2013.6581521","DOIUrl":"https://doi.org/10.1109/ReCoSoC.2013.6581521","url":null,"abstract":"ArchC is an architecture description language that provides instruction set level simulation and binary tool chain generation. It is based on SystemC and can communicate with other SystemC components using transaction level modeling (TLM). In this article we present an upgrade of ArchC that allows TLM-2.0 usage and makes it available in timed simulations. These extensions enable performance evaluation of complete System-on-Chip designs built around an ArchC processor model. As a proof-of-concept, we examine various TLM-connected memory hierarchies. We outline how model designers can use a combination of fast functional simulation and slow timed simulation to determine an optimal system architecture for a given workload.","PeriodicalId":354964,"journal":{"name":"2013 8th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117028158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Eduardo Cuevas-Farfan, M. Morales-Sandoval, R. Cumplido, C. F. Uribe, I. Algredo-Badillo
{"title":"A programmable FPGA-based cryptoprocessor for bilinear pairings over F2m","authors":"Eduardo Cuevas-Farfan, M. Morales-Sandoval, R. Cumplido, C. F. Uribe, I. Algredo-Badillo","doi":"10.1109/ReCoSoC.2013.6581528","DOIUrl":"https://doi.org/10.1109/ReCoSoC.2013.6581528","url":null,"abstract":"Bilinear pairings over elliptic curves are an emerging research field in cryptography. First cryptographic protocols based on bilinear pairings were proposed by the year 2000 and currently they are not standardized yet. The computation of bilinear pairings relies on arithmetic over finite fields. In the literature, several works have focused in the design of custom hardware architectures for efficient implementation of this arithmetic, but in a non-standardized environment a flexible design is prefered in order to support changes in the specifications. This paper presents the design and implementation of a novel programmable cryptoprocessor for computing bilinear pairings over binary fields in FPGA, which is able to support different algorithms and corresponding parameters as the elliptic curve, the tower field and the distortion map. The results show that high flexibility is achieved by the proposed cryptoprocessor at a competitive timing and area usage, when it is compared to custom designs for pairings defined over singular/supersingular elliptic curves at a 128-bit security level.","PeriodicalId":354964,"journal":{"name":"2013 8th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127215673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Qian Zhao, M. Amagasaki, M. Iida, M. Kuga, T. Sueyoshi
{"title":"An FPGA design and implementation framework combined with commercial VLSI CADs","authors":"Qian Zhao, M. Amagasaki, M. Iida, M. Kuga, T. Sueyoshi","doi":"10.1109/ReCoSoC.2013.6581534","DOIUrl":"https://doi.org/10.1109/ReCoSoC.2013.6581534","url":null,"abstract":"Conventional full-custom reconfigurable logic device design and implementation are time consuming processes. In this research, we propose a design framework in order to improve FPGA IP core design efficiency by link academic FPGA design flow and commercial VLSI CADs based on the synthesizable method. A novel FPGA routing tool is developed in this framework, namely the EasyRouter. By using simple templates, EasyRouter can automatically generate the HDL codes and the configuration bitstream for an FPGA. With this design flow, accurate physical information can be reported when a new FPGA architecture is evaluated with reliable commercial VLSI CADs. For FPGA architectures that cannot be easily implemented with present VLSI process, EasyRouter provides a fast performance analysis flow, which improved delay accuracy 5.1 times than VPR on average.","PeriodicalId":354964,"journal":{"name":"2013 8th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127237605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Vianney Lapôtre, M. Hübner, G. Gogniat, Purushotham Murugappa, A. Baghdadi, J. Diguet
{"title":"An efficient on-chip configuration infrastructure for a flexible multi-ASIP turbo decoder architecture","authors":"Vianney Lapôtre, M. Hübner, G. Gogniat, Purushotham Murugappa, A. Baghdadi, J. Diguet","doi":"10.1109/ReCoSoC.2013.6581518","DOIUrl":"https://doi.org/10.1109/ReCoSoC.2013.6581518","url":null,"abstract":"Dynamic reconfiguration of multiprocessor platforms is an important challenge for System-on-Chip designers. Addressing this issue is mandatory in order to manage the increasing number of applications and execution conditions that multiprocessor platforms have to face. In this paper, a novel configuration infrastructure for the UDec multi-ASIP turbo decoder architecture is presented. Our approach leads to split the interconnection architecture in two subsets, one dedicated for data and another dedicated for configuration. Indeed both types of communication do not have the same requirements. Our novel configuration infrastructure, which proposes an area efficient and low latency solution, has been validated through a two-step approach. First a SystemC/VHDL mixed simulation model has been developed to perform an early performance evaluation, second a hardware FPGA prototype has been built. Results show that up to 64 processing elements can be dynamically configured in 5.352 μs.","PeriodicalId":354964,"journal":{"name":"2013 8th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132171510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Marco Ramírez, M. Daneshtalab, P. Liljeberg, J. Plosila
{"title":"Towards a Configurable Many-core Accelerator for FPGA-based embedded systems","authors":"Marco Ramírez, M. Daneshtalab, P. Liljeberg, J. Plosila","doi":"10.1109/ReCoSoC.2013.6581548","DOIUrl":"https://doi.org/10.1109/ReCoSoC.2013.6581548","url":null,"abstract":"Hardware accelerators release the general purpose processor of a system from very compute-demanding tasks. This work presents a Configurable Many-core Accelerator for FPGA-based systems, named CoMA. Its architecture combines an array of processing cores interconnected by an NoC, with an I/O interface based on the AXI protocol. CoMA provides the designer with a system abstraction layer that facilitates task partitioning and peripheral access. The implementation of the I/O interface was verified through simulation, and synthesized for an FPGA.","PeriodicalId":354964,"journal":{"name":"2013 8th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114631076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Dominic Hillenbrand, Yuuki Furuyama, Akihiro Hayashi, Hiroki Mikami, K. Kimura, H. Kasahara
{"title":"Reconciling application power control and operating systems for optimal power and performance","authors":"Dominic Hillenbrand, Yuuki Furuyama, Akihiro Hayashi, Hiroki Mikami, K. Kimura, H. Kasahara","doi":"10.1109/ReCoSoC.2013.6581539","DOIUrl":"https://doi.org/10.1109/ReCoSoC.2013.6581539","url":null,"abstract":"In the age of dark silicon on-chip power control is a necessity. Upcoming and state of the art embedded- and cloud computer system-on-chips (SoCs) already provide interfaces for fine grained power control. Sometimes both: core- and interconnect-voltage and frequency can be scaled for example. To further reduce power consumption SoCs often have specialized accelerators. Due to the rising specialization of hard- and software general purpose operating systems require changes to exploit the power saving opportunities provided by the hardware. However, they lack detailed hardware- and application-level-information. Application-level power control in turn is still very uncommon and difficult to realize. Now a days vendors of mobile devices are forced to tweak and patch system-level software to enhance the power efficiency of each individual product. This manual process is time consuming and must be re-iterated for each new product. In this paper we explore the opportunities and challenges of automatic application- level power control using compilers.","PeriodicalId":354964,"journal":{"name":"2013 8th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"1998 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128239487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"On a FPGA-based method for authentication using Edwards curves","authors":"André Himmighofen, Bernhard Jungk, S. Reith","doi":"10.1109/ReCoSoC.2013.6581530","DOIUrl":"https://doi.org/10.1109/ReCoSoC.2013.6581530","url":null,"abstract":"Modern cryptographic authentication protocols have a wide range of real world applications to gain assurance about the identity of a potential communication partner. For example they are used as wireless keys to gain access to an otherwise restricted place, e.g. a closed door or a car. Widely used challenge-response authentication protocols have very often a shared secret, such that the a prover can convince a verifier about his identity by using this secret. The major drawback of this scheme is, that the prover and the verifier both know the shared secret, and thus the distribution of the secret is problematic. This problem can be solved by using asymmetric cryptography and adapting suitable existing protocols to the used cryptographic primitive. This paper evaluates the Schnorr protocol for FPGA implementations and adopts twisted Edwards curves, a variant of elliptic curves, for this study. Both a theoretical introduction and a thorough practical part is provided and in this way the suitability of this combination for hardware implementations is shown.","PeriodicalId":354964,"journal":{"name":"2013 8th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130752428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A new model for estimating bit error probabilities of Ring-Oscillator PUFs","authors":"Matthias Hiller, G. Sigl, Michael Pehl","doi":"10.1109/ReCoSoC.2013.6581531","DOIUrl":"https://doi.org/10.1109/ReCoSoC.2013.6581531","url":null,"abstract":"Embedded systems carry and process more and more sensitive information in untrusted environments, where an attacker can wiretap the external communication and also has unlimited physical access to the device. Cryptography protects systems against many of the threats and relies on the security of the cryptographic keys inside the system. Physical Unclonable Functions (PUFs) measure manufacturing variations inside integrated circuits, for example FPGAs, to generate a unique secret PUF response inside each device. Similar to deriving a biometric pattern from human features, the individual pattern inside an FPGA differs slightly from measurement to measurement. From these measurements, the PUF response is generated to derive a secure and reliable cryptographic key. The Ring-Oscillator (RO) PUF is a popular PUF type because of its high randomness and reliability. Frequencies of ROs are compared pairwise to derive one secret bit. So far, the reliability of RO PUFs was evaluated by counting bit flips in measured PUF responses. This work analyzes the distribution of frequency measurements to derive the behavior of the PUF. Analyzing the frequency distributions gives a more precise estimation of the PUF bit error rates than measuring the bit errors after the comparison of two oscillator frequencies. The evaluation of publicly available real world empirical FPGA data has shown that most error probabilities of RO PUF responses are so low that they cannot be measured in feasible time. For almost 200 evaluated FPGAs, more than 70% of the PUF outputs on every FPGA have bit error probabilities under 10-20. We can even ensure this error probability for over 60% of the PUF outputs after a practicable number of frequency measurements with a confidence of 99.9%. Index Terms-Physical Unclonable Functions (PUFs), Ring Oscillator PUF, FPGA, Statistics.","PeriodicalId":354964,"journal":{"name":"2013 8th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121795116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Centralized traffic monitoring for online-resizable clusters in Networks-on-Chip","authors":"Philipp Gorski, D. Timmermann","doi":"10.1109/ReCoSoC.2013.6581523","DOIUrl":"https://doi.org/10.1109/ReCoSoC.2013.6581523","url":null,"abstract":"Runtime-based traffic monitoring is one of the most essential system services for modern many-core platforms. It ensures self-awareness of the current system load and enables other runtime mechanisms, like application mapping and adaptive routing, to optimize workload operations. While Networks-on-Chip introduced a scalable and massively parallel communication infrastructure for the growing number of on-chip resources, scalable traffic monitoring becomes a key concern. A high degree of runtime adaptivity at different system layers comes with the costs that monitored system states need to be available at different locations, scopes and resolutions. Furthermore, many-core systems with hundreds of resources will be less single purpose and run workloads composed of various application domains, traffic and timing characteristics. Thus, the monitoring operations should be adaptive to consider this variability and provide customizable behavior. In the work at hand, a runtime configurable and cluster-based traffic monitoring is proposed. Our experiments show that in maximum 2% hardware overhead per tile in an 8×8 NoC is needed to enable online resizable clustering of up to 64 IP cores, centralized traffic monitoring of all path and link loads inside these clusters at resolutions of 1%, and configurable monitoring timing adjustment in a range of 103 to 105 clock cycles.","PeriodicalId":354964,"journal":{"name":"2013 8th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC)","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117263565","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}