A. Nomura, Hiroki Matsutani, T. Kuroda, J. Kadomoto, Yusuke Matsushita, H. Amano
{"title":"Vertical Packet Switching Elevator Network Using Inductive Coupling ThruChip Interface","authors":"A. Nomura, Hiroki Matsutani, T. Kuroda, J. Kadomoto, Yusuke Matsushita, H. Amano","doi":"10.1109/CANDAR.2016.0043","DOIUrl":"https://doi.org/10.1109/CANDAR.2016.0043","url":null,"abstract":"A wireless inductive coupling ThruChip Interface (TCI) is a flexible system-in-package (SiP) technique which enables to build a powerful interconnection network between stacked chips. For easy use of TCI, intellectual properties (IPs) are developed, and interconnection networks which can make the use of IPs are proposed and evaluated based on the real chip implementation. The new proposed elevator network which uses piggyback of the credit packets outperforms the ring network used in the first prototype by 28%-59% in terms of throughput. The performance overhead by the piggyback control is less than 3%-4%.","PeriodicalId":322499,"journal":{"name":"2016 Fourth International Symposium on Computing and Networking (CANDAR)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128383884","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Keisuke Fujimoto, Shinya Takamaeda-Yamazaki, Y. Nakashima
{"title":"Stop the World: A Lightweight Runtime Power-Capping Mechanism for FPGAs","authors":"Keisuke Fujimoto, Shinya Takamaeda-Yamazaki, Y. Nakashima","doi":"10.1109/CANDAR.2016.0070","DOIUrl":"https://doi.org/10.1109/CANDAR.2016.0070","url":null,"abstract":"Power-constrained computing is now becoming essential paradigm in both high performance computing and embedded systems. Power budget is dynamically assigned to each computing resource for improving energy efficiency and system throughput. Modern computer systems have accelerator devices, such as GPUs and FPGAs, for higher energy efficiency and performance. Therefore, power management mechanisms of such accelerator devices are required. In this paper, we present a lightweight mechanism of runtime power capping on FPGA systems. According to the amount of a given power budget, instead of the frequency scaling, the proposed mechanism controls the execution speed by throttling off-chip memory accesses from the computing logic, so that the power consumption is accordingly controlled. We evaluated the power controllability of the proposed mechanism by using an FPGA board with an embedded power meter. The result shows that the proposed approach has a high linea rity of power control. The result also indicates that the accuracy of the power control depends on throttling interval granularities, and the control accuracy is improved by utilizing a longer throttling interval. Additionally, we compared the power control accuracy with a design-time fixed frequency scaling approach. The result shows that the proposed approach achieves the same accuracy as the static approach, even though the proposed approach allows the runtime power control.","PeriodicalId":322499,"journal":{"name":"2016 Fourth International Symposium on Computing and Networking (CANDAR)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127056486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Takuma Nakajima, Masato Yoshimi, Celimuge Wu, T. Yoshinaga
{"title":"A Light-Weight Content Distribution Scheme for Cooperative Caching in Telco-CDNs","authors":"Takuma Nakajima, Masato Yoshimi, Celimuge Wu, T. Yoshinaga","doi":"10.1109/CANDAR.2016.0033","DOIUrl":"https://doi.org/10.1109/CANDAR.2016.0033","url":null,"abstract":"A key technique to reduce the rapid growing of video-on-demand's traffic is a cooperative caching strategy aggregating multiple cache storages. Many internet service providers have considered the use of cache servers on their networks as a solution to reduce the traffic. Existing schemes often periodically calculate a sub-optimal allocation of the content caches in the network. However, such approaches require a large computational overhead that cannot be amortized in a presence of frequent changes of the contents' popularities. This paper proposes a light-weight scheme for a cooperative caching that obtains a sub-optimal distribution of the contents by focusing on their popularities. This was made possible by adding color tags to both cache servers and contents. In addition, we propose a hybrid caching strategy based on Least Frequently Used (LFU) and Least Recently Used (LRU) schemes, which efficiently manages the contents even with a frequent change in the popularity. Evaluation results showed that our light-weight scheme could considerably reduce the traffic, reaching a sub-optimal result. In addition, the performance gain is obtained with a computation overhead of just a few seconds. The evaluation results also showed that the hybrid caching strategy could follow the rapid variation of the popularity. While a single LFU strategy drops the hit ratio by 13.9%, affected by rapid popularity changes, our proposed hybrid strategy could limit the degradation to only 2.3%.","PeriodicalId":322499,"journal":{"name":"2016 Fourth International Symposium on Computing and Networking (CANDAR)","volume":"430 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123275963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ryuta Kawano, Hiroshige Nakahara, I. Fujiwara, Hiroki Matsutani, M. Koibuchi, H. Amano
{"title":"LOREN: A Scalable Routing Method for Layout-Conscious Random Topologies","authors":"Ryuta Kawano, Hiroshige Nakahara, I. Fujiwara, Hiroki Matsutani, M. Koibuchi, H. Amano","doi":"10.1109/CANDAR.2016.0019","DOIUrl":"https://doi.org/10.1109/CANDAR.2016.0019","url":null,"abstract":"End-to-end network latency has become an important issue for parallel application on large-scale high performance computing (HPC) systems. It has been reported that randomly-connected inter-switch networks can lower the end-to-end network latency. The trade-off is a large amount of routing information. For irregular networks, minimal routing is achieved by using routing tables for all destinations in the network. In this work, a novel distributed routing method called LOREN (Layout-Oriented Routing with Entries for Neighbors) to achieve low-latency with a small routing table is proposed for irregular networks whose link length is limited. The routing tables contain both physically and topologically nearby neighbor nodes to ensure livelock-freedom and a small number of hops between nodes. Experimental results show that LOREN reduces the average latencies by 2.8% and improves the network throughput by up to 39% compared with a conventional compact routing method. Moreover, the required routing table size is reduced by up to 67%, which improves scalability and flexibility for implementation.","PeriodicalId":322499,"journal":{"name":"2016 Fourth International Symposium on Computing and Networking (CANDAR)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114834280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Effective Page Padding Method for RAM Buffer Algorithms to Enhance the SSD Endurance","authors":"Eri Ogawa, Kenji Kise","doi":"10.1109/CANDAR.2016.0034","DOIUrl":"https://doi.org/10.1109/CANDAR.2016.0034","url":null,"abstract":"Solid State Drives (SSDs) have become popular for mobile devices and PCs nowadays due to their advantages compared with conventional hard disk drives. However, they suffer from endurance problems. An effective approach to increase the endurance of an SSD is to improve the management algorithm of the RAM buffer, a cache-like structure inside the SSD. There are several previous studies of RAM buffer management algorithms which improve the I/O performance and endurance of SSDs by reordering the write operations and exploiting the temporal locality of the write stream. We observe that the sequential write patterns of the write stream can be exploited to reduce the number of expensive erase operations. Using this insight, we propose a new effective method for RAM buffer management algorithms to enhance the endurance of SSDs. We evaluate our proposed method by software simulation and the result shows that the proposed method reduces the number of erasures and average response time compared to the best existing method by up to 17.3% and 16.9% respectively.","PeriodicalId":322499,"journal":{"name":"2016 Fourth International Symposium on Computing and Networking (CANDAR)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122617157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Particle Complexity of Universal Finite Number-Conserving Cellular Automata","authors":"A. Alhazov, Katsunobu Imai","doi":"10.1109/CANDAR.2016.0045","DOIUrl":"https://doi.org/10.1109/CANDAR.2016.0045","url":null,"abstract":"A number-conserving cellular automaton (NCCA) is a cellular automaton whose states are integers and whose transition function keeps the sum of all cells constant throughout its evolution. It can be seen as a kind of particle-based modeling of the physical conservation law of mass. In this paper we focus on the case we call finite NCCA when states are non-negative integers, and the total sum is finite. In spite of the strong constraint, we constructed a radius 1 universal FNCCA by simulating register machines with two registers. We also consider the particle complexity in the case of large (but finite) radius, and constructed a universal FNCCA with only five particles.","PeriodicalId":322499,"journal":{"name":"2016 Fourth International Symposium on Computing and Networking (CANDAR)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123468117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Workflow Support Based on Mobile Data Stream Management System","authors":"Jiwang Xu, Y. Nakamoto, Shingo Akiyama","doi":"10.1109/CANDAR.2016.0065","DOIUrl":"https://doi.org/10.1109/CANDAR.2016.0065","url":null,"abstract":"This paper presents a mobile workflow support system based on mobile stream data management system. A stream data management system is a management system for data generated continually and is used for financial applications, network traffic systems, sensor networks. We developed a mobile data stream management system for Android devices (mDSMS) and applied vehicle data services and information sharing in collaborative works. DSMS not only matches execution of workflow but also facilitates the workflow descriptions to respond to external events in an organization. We present implementation of workflow support using mDSMS and its evaluation based on our development experience, comparison with a workflow pattern, and its performance.","PeriodicalId":322499,"journal":{"name":"2016 Fourth International Symposium on Computing and Networking (CANDAR)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126584905","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Cold Boot Attack Methods for the Discrete Logarithm Problem","authors":"Hwei-Ming Ying, N. Kunihiro","doi":"10.1109/CANDAR.2016.0037","DOIUrl":"https://doi.org/10.1109/CANDAR.2016.0037","url":null,"abstract":"This paper proposes improvements to existing cold boot attack algorithms which greatly reduce the number of correlate tests required at the expense of some memory in discrete logarithm based cryptosystems. In practical key recovery settings, the excess memory incurred is shown to be insignificant when the variable parameter in the algorithms is optimized. The results show that improvements of up to 2.2 times for wNAF encodings and up to 5.5 times in PolarSSL encodings can be achieved based on the number of correlate tests when such optimal values of the variable parameter are applied.","PeriodicalId":322499,"journal":{"name":"2016 Fourth International Symposium on Computing and Networking (CANDAR)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127280202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"On the Ergodicity of One-Dimensional Linear Cellular Automata with Additive Error","authors":"Ryouta Kouduma, Masato Takei","doi":"10.1109/CANDAR.2016.0047","DOIUrl":"https://doi.org/10.1109/CANDAR.2016.0047","url":null,"abstract":"We consider one-dimensional two-state elementary linear cellular automata, perturbed by an additive noise: Each cell is updated by a linear function of its finite neighborhood, and the value is erroneously reversed with a probability which may depend on the time step and the cell, independently of each other. The state given by fair coin tossing is invariant for this system. The aim of this paper is to give a necessary and sufficient condition for ergodicity, which means that irrespective of choice of the initial state, the system approaches to the state given by fair coin tossing as time goes on. When error probabilities are space-homogeneous but time-inhomogeneous, the condition for ergodicity is the same provided that the linear cellular automata is 'non-trivial' i.e. it is not an identity map or a shift map. On the other hand, when error probabilities are time-homogeneous but space-inhomogeneous, the condition for ergodicity heavily depends on choice of a rule of linear cellular automata.","PeriodicalId":322499,"journal":{"name":"2016 Fourth International Symposium on Computing and Networking (CANDAR)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133119761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
M. Koibuchi, I. Fujiwara, Fabien Chaix, H. Casanova
{"title":"Towards Ideal Hop Counts in Interconnection Networks with Arbitrary Size","authors":"M. Koibuchi, I. Fujiwara, Fabien Chaix, H. Casanova","doi":"10.1109/CANDAR.2016.0042","DOIUrl":"https://doi.org/10.1109/CANDAR.2016.0042","url":null,"abstract":"Designing low-latency network topologies of switches is a key objective for next-generation parallel computing platforms. Low latency is preconditioned on low hop counts, but existing network topologies have hop counts much larger than theoretical lower bounds. The degree diameter problem (DDP) has been studied for decades and consists in generating the largest possible graph given degree and diameter constraints, striving to approach theoretical upper bounds. To generate network topologies with low hop counts we propose using best known DDP solutions as starting points for generating topologies of arbitrary size. Using discrete-event simulation, we quantify the performance of representative parallel applications when executed on our proposed topologies, on previously proposed fully random topologies, and on a classical non-random topology.","PeriodicalId":322499,"journal":{"name":"2016 Fourth International Symposium on Computing and Networking (CANDAR)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121099851","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}