R. Oliveira, Hélio M. de Oliveira, R. Souza, E. J. P. Santos
{"title":"A flexible implementation of a matrix Laurent series-based 16-point fast Fourier and Hartley transforms","authors":"R. Oliveira, Hélio M. de Oliveira, R. Souza, E. J. P. Santos","doi":"10.1109/SPL.2010.5483017","DOIUrl":"https://doi.org/10.1109/SPL.2010.5483017","url":null,"abstract":"This paper describes a flexible architecture for implementing a new fast computation of the discrete Fourier and Hartley transforms, which is based on a matrix Laurent series. The device calculates the transforms based on a single bit selection operator. The hardware structure and synthesis are presented, which handled a 16-point fast transform in 65 nsec, with a Xilinx SPARTAN 3E device.","PeriodicalId":372692,"journal":{"name":"2010 VI Southern Programmable Logic Conference (SPL)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133680632","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
D. Muñoz, Diego F. Sánchez, C. Llanos, M. Ayala-Rincón
{"title":"FPGA based floating-point library for CORDIC algorithms","authors":"D. Muñoz, Diego F. Sánchez, C. Llanos, M. Ayala-Rincón","doi":"10.1109/SPL.2010.5483002","DOIUrl":"https://doi.org/10.1109/SPL.2010.5483002","url":null,"abstract":"Computation of floating-point transcendental functions has a relevant importance in a wide variety of scientific applications, where the area cost, error and latency are important requirements to be attended. This paper describes a flexible FPGA implementation of a parameterizable floating-point library for computing sine, cosine, arctangent and exponential functions using the CORDIC algorithm. The novelty of the proposed architecture is that by sharing the same resources the CORDIC algorithm can be used in two operation modes, allowing it to compute the sine, cosine or arctangent functions. Additionally, in case of the exponential function, the architectures change automatically between the CORDIC or a Taylor approach, which helps to improve the precision characteristics of the circuit, specifically for small input values after the argument reduction. Synthesis of the circuits and an experimental analysis of the errors have demonstrated the correctness and effectiveness of the implemented cores and allow the designer to choose, for general-purpose applications, a suitable bit-width representation and number of iterations of the CORDIC algorithm.","PeriodicalId":372692,"journal":{"name":"2010 VI Southern Programmable Logic Conference (SPL)","volume":"105 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126674183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
J. Freijedo, M. Valdés, M. J. Moure, L. Costas, J. Rodríguez-Andina, J. Semião, F. Vargas, I. Teixeira, J. P. Teixeira
{"title":"Delay modeling for power noise-aware design in Spartan-3A FPGAs","authors":"J. Freijedo, M. Valdés, M. J. Moure, L. Costas, J. Rodríguez-Andina, J. Semião, F. Vargas, I. Teixeira, J. P. Teixeira","doi":"10.1109/SPL.2010.5483026","DOIUrl":"https://doi.org/10.1109/SPL.2010.5483026","url":null,"abstract":"There is a continuously increasing demand for lower power consumption and higher operating frequencies in digital systems. In addition, external or operation-induced disturbances may significantly affect circuit functionality or performance. This paper analyzes the effect of power supply disturbances on the propagation delays of digital circuits implemented in Spartan-3A FPGAs and demonstrates that a previously proposed time management methodology can successfully be applied to the design of circuits with increased robustness to these disturbances. Experimental results are presented that support the claimed contributions of the work.","PeriodicalId":372692,"journal":{"name":"2010 VI Southern Programmable Logic Conference (SPL)","volume":"119 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123479885","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A general-purpose dynamically reconfigurable SVM","authors":"J. Gomes Filho, M. Raffo, M. Strum, W. Chau","doi":"10.1109/SPL.2010.5483031","DOIUrl":"https://doi.org/10.1109/SPL.2010.5483031","url":null,"abstract":"This paper presents an hardware implementation of the Sequential Minimal Optimization (SMO) for the Support Vector Machine (SVM) training phase. A general-purpose reconfigurable architecture, aimed to partial reconfiguration FPGAs, is developed, i.e., it supports different sizes of training sets, with wide-range number of samples and elements. The effects of fixed-point implementation are analyzed and data on area and frequency targeting the Xilinx Virtex-IV XC4VLX25 FPGA are provided. The architecture was able to perform the training in different learning benchmarks and the reconfigurable architecture was able to save 22.38% of FPGA's area.","PeriodicalId":372692,"journal":{"name":"2010 VI Southern Programmable Logic Conference (SPL)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124515234","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
L. Parrilla, Encarnación Castillo, Antonio García, D. González, A. Lloris, E. Todorovich, E. Boemo
{"title":"Protection of microprocessor-based cores for FPL devices","authors":"L. Parrilla, Encarnación Castillo, Antonio García, D. González, A. Lloris, E. Todorovich, E. Boemo","doi":"10.1109/SPL.2010.5483008","DOIUrl":"https://doi.org/10.1109/SPL.2010.5483008","url":null,"abstract":"Microprocessor cores are widely used in the development of complex digital systems. In this paper, a new scheme for the IP protection of microprocessor cores is presented. The proposed framework can perform this task in two ways: the hosting of a digital signature using watermarking techniques that allows claiming authorship rights; and the introduction of additional hardware limiting the functionality of the core if it is not activated. This last feature enables the distribution of cores in “demo” mode. The protection method, named μIPP@HDL provides a robust protection system, while maintaining low overhead and a reasonable area increase, as experimental results show.","PeriodicalId":372692,"journal":{"name":"2010 VI Southern Programmable Logic Conference (SPL)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130152073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Design and implementation of packet switching capabilities on 10GbE MAC core","authors":"Roman Arenas, J. Finochietto, Leonardo M Rocha","doi":"10.1109/SPL.2010.5483024","DOIUrl":"https://doi.org/10.1109/SPL.2010.5483024","url":null,"abstract":"This paper proposes the integration of packet switching capabilities to 10 Gigabit Ethernet (10 GbE) Medium Access Control (MAC) devices. For this purpose, the architecture of a MAC core is first analyzed to evaluate where these capabilities can be best placed. Next, a general packet switching architecture is proposed which comprises classification, queueing and scheduling stages. The proposed classification stage exploits the intrinsic latency of the MAC processing to simultaneously inspect the packet header and determine the destination queue where the packet is to be stored. The proposed architecture was implemented and integrated inside a 10 GbE MAC core, and validated on a FPGA development board. The main contribution of this work is the analysis, design and verification of an advance MAC core implementation which integrates switching capabilities. This architecture is evaluated in terms of resource usage and scalability.","PeriodicalId":372692,"journal":{"name":"2010 VI Southern Programmable Logic Conference (SPL)","volume":"225 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122250701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A placement tool for a NOC-based dynamically reconfigurable system","authors":"M. Raffo, J. Gomes Filho, M. Strum, W. Chau","doi":"10.1109/SPL.2010.5483005","DOIUrl":"https://doi.org/10.1109/SPL.2010.5483005","url":null,"abstract":"In the last years, Field programmable gate-arrays (FPGAs) with partial reconfiguration capabilities have raised interest in the implementation of dynamically reconfigurable systems. It has not become a mainstream activity though, due to the lack of solid design methodologies and associated tools. One of the approaches aimed to free the designer of lower level implementation details is to use structured communication resources to provide the interaction between reconfigurable partitions (modules). The architecture of a network-on-chip (NoC) based dynamically reconfigurable system and a placement tool, which automatically places all of its modules, is presented. The tool takes the partitioned design information and the restrictions imposed by the device family architecture into consideration. The basics of the placement algorithm and a study-case as an example are presented.","PeriodicalId":372692,"journal":{"name":"2010 VI Southern Programmable Logic Conference (SPL)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126018620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Parallel decimal multipliers using binary multipliers","authors":"M. Véstias, H. Neto","doi":"10.1109/SPL.2010.5483001","DOIUrl":"https://doi.org/10.1109/SPL.2010.5483001","url":null,"abstract":"Human-centric applications, like financial and commercial, depend on decimal arithmetic since the results must match exactly those obtained by human calculations. The IEEE-754 2008 standard for floating point arithmetic has definitely recognized the importance of decimal for computer arithmetic. A number of hardware approaches have already been proposed for decimal arithmetic operations, including addition, subtraction, multiplication and division. However, few efforts have been done to develop decimal IP cores able to take advantage of the binary multipliers available in most reconfigurable computing architectures. In this paper, we analyze the tradeoffs involved in the design of a parallel decimal multiplier, for decimal operands with 8 and 16 digits, using existent coarse-grained embedded binary arithmetic blocks. The proposed circuits were implemented in a Xilinx Virtex 4 FPGA. The results indicate that the proposed parallel multipliers are very competitive when compared to decimal multipliers implemented with direct manipulation of BCD numbers.","PeriodicalId":372692,"journal":{"name":"2010 VI Southern Programmable Logic Conference (SPL)","volume":"166 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126649025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Genetic Programming based approach for efficiently exploring architectural communication design space of MPSoCs","authors":"Guilherme Esmeraldo, E. Barros","doi":"10.1109/SPL.2010.5483006","DOIUrl":"https://doi.org/10.1109/SPL.2010.5483006","url":null,"abstract":"New integrated circuits technologies and the demand for more complex applications have created Multi-Processor System-on-Chip (MPSoC). MPSoC is a complex integrated circuit, which can be composed of microprocessors, buses, memories and others computational system components. As the number and variety of components of today's MPSoC is increasing, its communication architecture is becoming a limiting factor for applications performance and power consumption. Thus, techniques have been created for exploring the design space in order to find out the best communication architecture for a given application. Such techniques, however, are either inaccurate (by using static analysis based approaches) or very time consuming since each communication configuration of the design space must be simulated (by using simulation models) or estimated (using mixed approaches). This paper presents a new approach to explore the design space of bus-based communication architectures of MPSoCs using Generalized Linear Models and Genetic Programming. By using the proposed approach, some experiments show that it was possible to explore a subset of the design space and to identify the best communication configuration for a given application reducing 90% of the exploration time with less of 3,8% mean global error.","PeriodicalId":372692,"journal":{"name":"2010 VI Southern Programmable Logic Conference (SPL)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127774962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Decimal division: Algorithms and FPGA implementations","authors":"J. Deschamps, G. Sutter","doi":"10.1109/SPL.2010.5483000","DOIUrl":"https://doi.org/10.1109/SPL.2010.5483000","url":null,"abstract":"The work reported in this paper is devoted to the FPGA implementation of decimal dividers. Two types of dividers are described. The first one implements a decimal non-restoring like algorithm and uses ripple-carry operators. For medium size operators it gives a good compromise between cost and latency. The second one implements an SRT-like algorithm and uses carry-free operators. Their latencies are close to that of a binary radix-16 divider with the same range, implemented in the same FPGA.","PeriodicalId":372692,"journal":{"name":"2010 VI Southern Programmable Logic Conference (SPL)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116851457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}