{"title":"用于延迟不敏感FPGA设计的高性能宏块的自动时序表征","authors":"Roberto Sierra, C. Carreras, G. Caffarena","doi":"10.1109/PATMOS.2018.8464170","DOIUrl":null,"url":null,"abstract":"Traditional FPGA design workflow requires the complete system in order to perform timing characterization. However, high-performance digital systems implementing scientific applications frequently occupy a large area and are suitably developed following a latency insensitive design approach to achieve multi-level parallelism. The produced circuits are composed of several deeply pipelined specialized computational macroblocks and try to maximize both the operating frequency and the usage of FPGA device resources to full capacity. These goals are most appropriately attainable through a decentralized control strategy and the optimization of performance for individual blocks independently from each other. This work proposes a synthesizable virtual wrapper architecture which does not add any functionality but simulates complete system conditions for the timing characterization of individual blocks using standard low-level synthesis tools. It further presents a high-level tool to generate both the computational blocks and the wrapper from a functional specification in C language, and shows that large blocks with inputs and outputs exceeding the available number of FPGA pins can be automatically generated and characterized. In particular, results for the blocks generated for a real computational fluid dynamics application are provided.","PeriodicalId":234100,"journal":{"name":"2018 28th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Automated Timing Characterization of High-Performance Macroblocks for Latency Insensitive FPGA Designs\",\"authors\":\"Roberto Sierra, C. Carreras, G. Caffarena\",\"doi\":\"10.1109/PATMOS.2018.8464170\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traditional FPGA design workflow requires the complete system in order to perform timing characterization. However, high-performance digital systems implementing scientific applications frequently occupy a large area and are suitably developed following a latency insensitive design approach to achieve multi-level parallelism. The produced circuits are composed of several deeply pipelined specialized computational macroblocks and try to maximize both the operating frequency and the usage of FPGA device resources to full capacity. These goals are most appropriately attainable through a decentralized control strategy and the optimization of performance for individual blocks independently from each other. This work proposes a synthesizable virtual wrapper architecture which does not add any functionality but simulates complete system conditions for the timing characterization of individual blocks using standard low-level synthesis tools. It further presents a high-level tool to generate both the computational blocks and the wrapper from a functional specification in C language, and shows that large blocks with inputs and outputs exceeding the available number of FPGA pins can be automatically generated and characterized. In particular, results for the blocks generated for a real computational fluid dynamics application are provided.\",\"PeriodicalId\":234100,\"journal\":{\"name\":\"2018 28th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 28th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PATMOS.2018.8464170\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 28th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PATMOS.2018.8464170","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Automated Timing Characterization of High-Performance Macroblocks for Latency Insensitive FPGA Designs
Traditional FPGA design workflow requires the complete system in order to perform timing characterization. However, high-performance digital systems implementing scientific applications frequently occupy a large area and are suitably developed following a latency insensitive design approach to achieve multi-level parallelism. The produced circuits are composed of several deeply pipelined specialized computational macroblocks and try to maximize both the operating frequency and the usage of FPGA device resources to full capacity. These goals are most appropriately attainable through a decentralized control strategy and the optimization of performance for individual blocks independently from each other. This work proposes a synthesizable virtual wrapper architecture which does not add any functionality but simulates complete system conditions for the timing characterization of individual blocks using standard low-level synthesis tools. It further presents a high-level tool to generate both the computational blocks and the wrapper from a functional specification in C language, and shows that large blocks with inputs and outputs exceeding the available number of FPGA pins can be automatically generated and characterized. In particular, results for the blocks generated for a real computational fluid dynamics application are provided.