2016 International Symposium on Rapid System Prototyping (RSP)最新文献

筛选
英文 中文
A HW/SW embedded system for accelerating diagnosis of glaucoma from eye fundus images 基于眼底图像加速青光眼诊断的硬件/软件嵌入式系统
2016 International Symposium on Rapid System Prototyping (RSP) Pub Date : 2016-10-01 DOI: 10.1145/2990299.2990303
P. Dantas, Andrea Sarmento, Adriano Sarmento
{"title":"A HW/SW embedded system for accelerating diagnosis of glaucoma from eye fundus images","authors":"P. Dantas, Andrea Sarmento, Adriano Sarmento","doi":"10.1145/2990299.2990303","DOIUrl":"https://doi.org/10.1145/2990299.2990303","url":null,"abstract":"Glaucoma is an irreversible eye disease in which the optic nerve is progressively damaged leading to blindness. However, it is manageable if diagnosed early. The most common screening exam for glaucoma diagnosis is the eye fundus evaluation in which the ophthalmologist examines the optic nerve and estimates the Vertical Cup to Disc Ratio (VCDR). Currently, VCDR evaluation is performed by ophthalmologists based on their visual perception and experience. This paper explores different embedded architectures based on low power processors and describes a HW/SW embedded system that automatically calculates VCDR from eye fundus images using image processing techniques. Optic disc diameter is calculated by a HW accelerator, while optic cup diameter is calculated by SW. This resulted in an embedded system that reduces at least 30% of the execution time of a SW-only implementation and that is significantly faster than other related works based on desktop computers. The proposed system was tested on 70 eye fundus images and achieved a 97.72% accuracy rate.","PeriodicalId":407053,"journal":{"name":"2016 International Symposium on Rapid System Prototyping (RSP)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125073675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
HAMEX: heterogeneous architecture and memory exploration framework HAMEX:异构架构和内存探索框架
2016 International Symposium on Rapid System Prototyping (RSP) Pub Date : 2016-10-01 DOI: 10.1145/2990299.2990316
Kasra Moazzemi, Chen-Ying Hsieh, N. Dutt
{"title":"HAMEX: heterogeneous architecture and memory exploration framework","authors":"Kasra Moazzemi, Chen-Ying Hsieh, N. Dutt","doi":"10.1145/2990299.2990316","DOIUrl":"https://doi.org/10.1145/2990299.2990316","url":null,"abstract":"The increasing amount of computation in heterogeneous architectures (including CPU and GPU cores) puts a big burden on memory subsystem. With the gap between compute units and the memory performance getting wider, designing a platform with a responsive memory system becomes more challenging. This issue is exacerbated when memory systems have to satisfy a high volume of traffic generated from heterogeneous compute units. Furthermore, as emerging memory technologies are being introduced to address these issues, a rapid and flexible mechanism is needed to evaluate these technologies in the context of heterogeneous architectures. This paper proposes HAMEX, a framework that enables early design space exploration of heterogeneous systems with a focus on resolving memory access bottlenecks. This framework first allows system designers to easily model heterogeneous architectures that can run both CPU and GPU workloads. Next, given a set of workloads partitioned on various compute units, traffic generated by these units are captured in order to explore different memory systems. We show the feasibility of design space exploration using HAMEX by simulating a contemporary commercial heterogeneous platform and explore the opportunities for power and performance improvements by adopting different memory technologies.","PeriodicalId":407053,"journal":{"name":"2016 International Symposium on Rapid System Prototyping (RSP)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125904562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Automatic detection and elision of reset sub-circuits 自动检测和省略复位子电路
2016 International Symposium on Rapid System Prototyping (RSP) Pub Date : 2016-10-01 DOI: 10.1145/2990299.2990305
Panagiotis Patros, K. Kent
{"title":"Automatic detection and elision of reset sub-circuits","authors":"Panagiotis Patros, K. Kent","doi":"10.1145/2990299.2990305","DOIUrl":"https://doi.org/10.1145/2990299.2990305","url":null,"abstract":"Electronic circuits are too complex to be designed by hand so hardware languages, like Verilog, and Computer Aided Design (CAD) tools are used for these purposes. Two main types of circuits are Application-Specific Integrated Circuits (ASICs) and Field Programmable Gate Arrays (FPGAs). ASICs require a reset sub-circuit to initialize their state; however, such a procedure is not necessary for FPGAs that support power-on reset. We propose and evaluate a tool that automatically detects and elides reset sub-circuits as part of the Verilog-to-Routing (VTR) CAD flow and in particular Odin II. Also, our tool can be used to decide if a reset sub-circuit has been properly implemented and can point towards memory components that have not been initialized. Such a tool is the first to our knowledge. Our tests with the VTR Verilog benchmarks and other Verilog circuits showed significant reductions in resource consumption on the target FPGAs as much as 25.9% shorter critical path, 90.39% shorter maximum net, 62.84% fewer used logic blocks and 30.87% fewer used routing elements. Also, our approach yielded significant reductions in the execution time of the placement-and-routing algorithm for the elided circuits that were as high as 4.5 times faster VPR execution and 3.35 times faster overall VTR flow execution.","PeriodicalId":407053,"journal":{"name":"2016 International Symposium on Rapid System Prototyping (RSP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126140704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
RapidSoC: short turnaround creation of FPGA based SoCs RapidSoC:基于FPGA的soc的短周期创建
2016 International Symposium on Rapid System Prototyping (RSP) Pub Date : 2016-10-01 DOI: 10.1145/2990299.2990314
Jakob Wenzel, C. Hochberger
{"title":"RapidSoC: short turnaround creation of FPGA based SoCs","authors":"Jakob Wenzel, C. Hochberger","doi":"10.1145/2990299.2990314","DOIUrl":"https://doi.org/10.1145/2990299.2990314","url":null,"abstract":"Field Programmable Gate Arrays (FPGA) offer the opportunity to build individual hardware solutions even for applications which are produced in small quantity. Quite often, these customized Systems-on-Chip (SoC) contain soft-core processors and a selection of standard peripherals. Synthesizing such systems can be time consuming and thus, design space exploration can become a rather long process. In this contribution, we show an approach to substantially speed up the time to create such system implementations. The price for this improved synthesis time is a slightly reduced operating frequency, which is acceptable in many cases. Using a set of benchmark system configurations, we evaluate our approach against state of the art commercial synthesis tools in terms of tool runtime, resource utilization and achieved system clock frequency.","PeriodicalId":407053,"journal":{"name":"2016 International Symposium on Rapid System Prototyping (RSP)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123384011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Inter-FPGA routing environment for performance exploration of multi-FPGA systems 用于多fpga系统性能探索的fpga间路由环境
2016 International Symposium on Rapid System Prototyping (RSP) Pub Date : 2016-10-01 DOI: 10.1145/2990299.2990317
U. Farooq, R. Chotin-Avot, M. Azeem, Maminionja Ravoson, M. Turki, H. Mehrez
{"title":"Inter-FPGA routing environment for performance exploration of multi-FPGA systems","authors":"U. Farooq, R. Chotin-Avot, M. Azeem, Maminionja Ravoson, M. Turki, H. Mehrez","doi":"10.1145/2990299.2990317","DOIUrl":"https://doi.org/10.1145/2990299.2990317","url":null,"abstract":"Multi-FPGA platforms are a popular choice today for complex system prototyping because they offer high execution speed, low cost, and real world testing experience. However, performance of multi-FPGA based systems is severely affected by widening logic to I/O gap in FPGAs. In order to address the performance issue, in this work, we propose an exploration and optimization flow for multi-FPGA based prototyping that gives an end-to-end experience starting from benchmark generation to optimized inter- FPGA routing. Using generic tools of the flow, ten large benchmarks are generated. Then, through a generic novel inter-FPGA routing environment, effect of variation of number of FPGAs as well as number of inter-FPGA tracks on the performance of a target design is explored. For performance exploration and optimization, five different FPGA boards are utilized where number of FPGAs on board are varied from two to six. Moreover, for each board four different inter-FPGA track combinations are used. Experimental results reveal that multi-FPGA boards with inter-FPGA tracks corresponding optimally to the cut net requirements of benchmarks under consideration give best frequency results. Furthermore, frequency comparison between different boards shows that FPGA board with six FPGAs gives, on average, best frequency results. Finally, we also perform frequency-price analysis which shows that board with four FPGAs gives better frequency-price tradeoff as compared to other FPGA boards under consideration.","PeriodicalId":407053,"journal":{"name":"2016 International Symposium on Rapid System Prototyping (RSP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121424575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Schedulability-guided exploration of multi-core systems 多核系统的可调度性引导探索
2016 International Symposium on Rapid System Prototyping (RSP) Pub Date : 2016-10-01 DOI: 10.1145/2990299.2990319
Rabeh Ayari, Imane Hafnaoui, G. Beltrame, G. Nicolescu
{"title":"Schedulability-guided exploration of multi-core systems","authors":"Rabeh Ayari, Imane Hafnaoui, G. Beltrame, G. Nicolescu","doi":"10.1145/2990299.2990319","DOIUrl":"https://doi.org/10.1145/2990299.2990319","url":null,"abstract":"Efficient mapping of tasks onto heterogeneous multi-core systems is very challenging especially under hard timing constraints. Assigning tasks to processors is an NP-hard problem and solving it requires the use of meta-heuristics. Relevantly, genetic algorithms have already proven to be one of the most powerful and widely used stochastic tools to solve this problem. Conventional genetic algorithms were initially defined as a general evolutionary algorithm based on blind operators. It is commonly admitted that the use of these operators is quite poor for an efficient exploration. Likewise, since exhaustive exploration of the solution space is unrealistic, a potent option is often to guide the exploration process by hints, derived by problem structure. This guided exploration prioritizes fitter solutions to be part of next generations and avoids exploring unpromising configurations by transmitting a set of predefined criteria from parents to children. Consequently, genetic operators, such as crossover, must incorporate specific domain knowledge to intelligently guide the exploration of the solution space. In this paper, we illustrate and evaluate the impact of crossover operators and we propose a hybrid genetic algorithm based on a novel schedulability-guided operator that easily outperforms the classical operators by offering at least 21% improvement in terms of ratio of certainly schedulable tasks.","PeriodicalId":407053,"journal":{"name":"2016 International Symposium on Rapid System Prototyping (RSP)","volume":"122 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116609976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Efficient parallel multi-objective optimization for real-time systems software design exploration 面向实时系统软件设计探索的高效并行多目标优化
2016 International Symposium on Rapid System Prototyping (RSP) Pub Date : 2016-10-01 DOI: 10.1145/2990299.2990310
Rahma Bouaziz, L. Lemarchand, Frank Singhoff, Bechir Zalila, M. Jmaiel
{"title":"Efficient parallel multi-objective optimization for real-time systems software design exploration","authors":"Rahma Bouaziz, L. Lemarchand, Frank Singhoff, Bechir Zalila, M. Jmaiel","doi":"10.1145/2990299.2990310","DOIUrl":"https://doi.org/10.1145/2990299.2990310","url":null,"abstract":"Real-time embedded systems may be composed of a large number of time constrained functions. During software architecture design, these functions must be assigned to tasks that will run the functions on the top of a real-time operating systems (RTOS). This is a challenging work due to the large number of valid candidate functions to tasks assignment solutions. Moreover, the impact of the assignment on the system performance criteria (often conflicting) should be taken into account in the architecture exploration. The automation of the design exploration by the use of metaheuristics such as multi-objective evolutionary algorithm (MOEA) is a suitable way to help the designers. MOEAs approximate near-optimal alternatives at a reasonable time when compared to an exact search method. However, for large-scale systems even a MOEA method is impractical due to the increased time required to solve a problem instance. To tackle this problem, we present in this article a parallel implementation of the Pareto Archived Evolution Strategy (PAES) algorithm used as a MOEA for the design exploration. The proposed parallelization method is based on the well-known Master-Slave paradigm. Additionally, it involves a new selection scheme in the PAES algorithm. Results of experimentations provide evidence that, on one hand, the parallel approach can considerably speed up the design exploration and the optimization processes. On the other hand, the proposed selection strategy improves the quality of obtained solutions as compared to the original PAES selection schema.","PeriodicalId":407053,"journal":{"name":"2016 International Symposium on Rapid System Prototyping (RSP)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116563663","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Rapid SOC prototyping utilizing quilt packaging technology for modular functional IC partitioning 快速SOC原型利用被子封装技术的模块化功能IC划分
2016 International Symposium on Rapid System Prototyping (RSP) Pub Date : 2016-10-01 DOI: 10.1145/2990299.2990313
Tian Lu, Carlos Ortega, J. Kulick, G. Bernstein, Scott Ardisson, Rob Engelhardt
{"title":"Rapid SOC prototyping utilizing quilt packaging technology for modular functional IC partitioning","authors":"Tian Lu, Carlos Ortega, J. Kulick, G. Bernstein, Scott Ardisson, Rob Engelhardt","doi":"10.1145/2990299.2990313","DOIUrl":"https://doi.org/10.1145/2990299.2990313","url":null,"abstract":"A microchip integration technology called Quilt Packaging (QP) enables rapid prototyping of complex SoCs and microwave/RF systems, as well as optical, power, and DSP applications. QP is a direct edge-to-edge chip-level interconnect technology that can be implemented in a variety of materials and/or process technologies, and has been demonstrated in both planar and non-planar 3D architectures. Quilt Packaging technology can be applied to create a “Lego-like” design kit for ultra-fast prototyping and proof-ofconcept chip-level system verification. Partitioning subcomponents into small, inexpensive “chiplets” can allow for much faster design turns and greatly reduced first-pass prototype verification. In addition, QP enables low-loss, high-throughput chip-to-chip I/O interconnects while reducing size, weight, and power requirements, lessening the burden of design trade-offs for hardware system designers developing the next generation of microelectronic systems.","PeriodicalId":407053,"journal":{"name":"2016 International Symposium on Rapid System Prototyping (RSP)","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125768404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Architecture exploration of intelligent robot system using ros-compliant FPGA component 基于ros-compliant FPGA组件的智能机器人系统架构探索
2016 International Symposium on Rapid System Prototyping (RSP) Pub Date : 2016-10-01 DOI: 10.1145/2990299.2990312
Takeshi Ohkawa, Kazushi Yamashina, Takuya Matsumoto, K. Ootsu, T. Yokota
{"title":"Architecture exploration of intelligent robot system using ros-compliant FPGA component","authors":"Takeshi Ohkawa, Kazushi Yamashina, Takuya Matsumoto, K. Ootsu, T. Yokota","doi":"10.1145/2990299.2990312","DOIUrl":"https://doi.org/10.1145/2990299.2990312","url":null,"abstract":"This paper presents a novel method for architecture exploration of an intelligent robot system while satisfying high processing performance at low power by utilizing FPGA and remote computing resources. In order to ease development complexity in the conventional architecture exploration, ROS-compliant FPGA component technology is employed. As a case study, Visual SLAM (Self Localization and Mapping) processing is studied, which is important for realizing intelligent autonomous robots. Some part of Visual SLAM processing is to be off-loaded onto a remote server outside a robot and to be processed parallel in the server. At the same time, the essential part of front-end of SLAM processing stays in the robot itself to reduce communication traffic between the robot and the remote computing resources. We have studied SLAM processing to find optimum function partitioning. In order to distribute and parallelize this processing, we explored processing architecture for trade-offs of power and performance.","PeriodicalId":407053,"journal":{"name":"2016 International Symposium on Rapid System Prototyping (RSP)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132171068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Transforming VHDL descriptions into formal component-based models 将VHDL描述转换为正式的基于组件的模型
2016 International Symposium on Rapid System Prototyping (RSP) Pub Date : 2016-10-01 DOI: 10.1145/2990299.2990320
Ayoub Nouri, Rahma Ben Atitallah, A. Molnos, Christian Fabre, Frédéric Heitzmann, O. Debicki
{"title":"Transforming VHDL descriptions into formal component-based models","authors":"Ayoub Nouri, Rahma Ben Atitallah, A. Molnos, Christian Fabre, Frédéric Heitzmann, O. Debicki","doi":"10.1145/2990299.2990320","DOIUrl":"https://doi.org/10.1145/2990299.2990320","url":null,"abstract":"In this work, we investigate a transformation of VHDL descriptions into equivalent formal models. The targeted equivalence is at the level of the functional behavior. That is, we aim at producing formal models that have the same functional simulation behavior as the original VHDL implementation. We rely on the BIP component-based modeling language as the underlying formalism for this transformation. The expected benefits of such a transformation are: enabling the formal verification of hardware designs, allowing for software/hardware system modeling within the same formal framework, and, potentially, accelerating VHDL designs functional simulation by producing distributed BIP models. We show, through a case study, that the transformation is feasible and worth to develop.","PeriodicalId":407053,"journal":{"name":"2016 International Symposium on Rapid System Prototyping (RSP)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114479607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信