IEEE Transactions on Multi-Scale Computing Systems最新文献_第6页

Stochastic-Based Synapse and Soft-Limiting Neuron with Spintronic Devices for Low Power and Robust Artificial Neural Networks 用于低功耗和鲁棒人工神经网络的基于随机的突触和带自旋电子器件的软限制神经元

IEEE Transactions on Multi-Scale Computing Systems Pub Date : 2017-12-27 DOI: 10.1109/TMSCS.2017.2787109

Yu Bai;Deliang Fan;Mingjie Lin

{"title":"Stochastic-Based Synapse and Soft-Limiting Neuron with Spintronic Devices for Low Power and Robust Artificial Neural Networks","authors":"Yu Bai;Deliang Fan;Mingjie Lin","doi":"10.1109/TMSCS.2017.2787109","DOIUrl":"https://doi.org/10.1109/TMSCS.2017.2787109","url":null,"abstract":"We propose an innovative stochastic-based computing architecture to implement low-power and robust artificial neural network (S-ANN) with both magnetic tunneling junction (MTJ) and Domain Wall (DW) devices. Our mixed-model HSPICE simulation results have shown that, for a well-known pattern recognition task, a 34-neuron S-ANN implementation achieves more than 1.5 orders of magnitude lower energy consumption and 2.5 orders of magnitude less hidden layer chip area, when compared with its deterministicbased ANN counterparts which are implemented with digital and analog CMOS circuits. We believe that our S-ANN architecture achieves such a remarkable performance gain by leveraging two key ideas. First, because all neural signals are encoded as random bit streams, the standard weighted-sum synapses can be accomplished by stochastic bit writing and reading procedure. Second, we designed and implemented a novel multiple-phase pumping circuit structure to effectively realize the soft-limiting neural transfer function that is essential to improve the overall ANN capability and reduce its network complexity.","PeriodicalId":100643,"journal":{"name":"IEEE Transactions on Multi-Scale Computing Systems","volume":"4 3","pages":"463-476"},"PeriodicalIF":0.0,"publicationDate":"2017-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TMSCS.2017.2787109","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68026464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

SpiNNaker: Event-Based Simulation—Quantitative Behavior SpiNNaker：基于事件的模拟——定量行为

IEEE Transactions on Multi-Scale Computing Systems Pub Date : 2017-11-22 DOI: 10.1109/TMSCS.2017.2748122

Andrew D. Brown;John E. Chad;Raihaan Kamarudin;Kier J. Dugan;Stephen B. Furber

{"title":"SpiNNaker: Event-Based Simulation—Quantitative Behavior","authors":"Andrew D. Brown;John E. Chad;Raihaan Kamarudin;Kier J. Dugan;Stephen B. Furber","doi":"10.1109/TMSCS.2017.2748122","DOIUrl":"https://doi.org/10.1109/TMSCS.2017.2748122","url":null,"abstract":"SpiNNaker (Spiking Neural Network Architecture) is a specialized computing engine, intended for real-time simulation of neural systems. It consists of a mesh of 240x240 nodes, each containing 18 ARM9 processors: over a million cores, communicating via a bespoke network. Ultimately, the machine will support the simulation of up to a billion neurons in real time, allowing simulation experiments to be taken to hitherto unattainable scales. The architecture achieves this by ignoring three of the axioms of computer design: the communication fabric is non-deterministic; there is no global core synchronisation, and the system state-held in distributed memory-is not coherent. Time models itself: there is no notion of computed simulation time-wallclock time is simulation time. Whilst these design decisions are orthogonal to conventional wisdom, they bring the engine behavior closer to its intended simulation target-neural systems. We describe how SpiNNaker simulates large neural ensembles; we provide performance figures and outline some failure mechanisms. SpiNNaker simulation time scales 1:1 with wallclock time at least up to nine million synaptic connections on a 768 core subsystem (~1400th of the full system) to accurately produce logically predicted results.","PeriodicalId":100643,"journal":{"name":"IEEE Transactions on Multi-Scale Computing Systems","volume":"4 3","pages":"450-462"},"PeriodicalIF":0.0,"publicationDate":"2017-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TMSCS.2017.2748122","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67861115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

On Approximate Speculative Lock Elision 关于近似推测锁Elision

IEEE Transactions on Multi-Scale Computing Systems Pub Date : 2017-11-17 DOI: 10.1109/TMSCS.2017.2773488

S. Karen Khatamifard;Ismail Akturk;Ulya R. Karpuzcu

{"title":"On Approximate Speculative Lock Elision","authors":"S. Karen Khatamifard;Ismail Akturk;Ulya R. Karpuzcu","doi":"10.1109/TMSCS.2017.2773488","DOIUrl":"https://doi.org/10.1109/TMSCS.2017.2773488","url":null,"abstract":"Each synchronization point represents a point of serialization, and thereby can easily hurt parallel scalability. As demonstrated by recent studies, approximating, i.e., relaxing synchronization by eliminating a subset of synchronization points spatio-temporally can help improve parallel scalability, as long as approximation incurred violations of basic execution semantics remain predictable and controllable. Even if the divergence from fully-synchronized execution renders lower computation accuracy ratherthan catastrophic program termination, for approximation to be viable, the accuracy loss must be bounded. In this paper, we assess the viability of approximate synchronization using Speculative Lock Elision (SLE), which was adopted by hardware transactional memory implementations from industry, as a baseline for comparison. Specifically, we investigate the efficacy of exploiting semantic and temporal characteristics of critical sections in preventing excessive loss in computation accuracy, and devise a light-weight, proof-of-concept Approximate Speculative Lock Elision (ASLE) implementation, which exploits existing hardware support for SLE.","PeriodicalId":100643,"journal":{"name":"IEEE Transactions on Multi-Scale Computing Systems","volume":"4 2","pages":"141-151"},"PeriodicalIF":0.0,"publicationDate":"2017-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TMSCS.2017.2773488","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68025087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

High-Precision Performance Estimation for the Design Space Exploration of Dynamic Dataflow Programs 动态数据流程序设计空间探索的高精度性能估计

IEEE Transactions on Multi-Scale Computing Systems Pub Date : 2017-11-16 DOI: 10.1109/TMSCS.2017.2774294

Małgorzata Michalska;Simone Casale-Brunet;Endri Bezati;Marco Mattavelli

{"title":"High-Precision Performance Estimation for the Design Space Exploration of Dynamic Dataflow Programs","authors":"Małgorzata Michalska;Simone Casale-Brunet;Endri Bezati;Marco Mattavelli","doi":"10.1109/TMSCS.2017.2774294","DOIUrl":"https://doi.org/10.1109/TMSCS.2017.2774294","url":null,"abstract":"The implementation and optimization of dynamic dataflow programs on multi/many-core platforms require solving a very difficult problem: how to partition and schedule the processing elements and dimension their interconnecting buffers according to given optimization functions in terms of throughput, memory usage, and energy consumption. This problem is NP-hard even for two cores. Thus, finding a close-to-optimal solution consists of exploring the design space by appropriate heuristics identifying those design points that maximize or minimize the desired (multiple) objective functions subject to a set of constraints. In general, exploring the design space efficiently is a challenging task due to the massive number of admissible design points. Efficient estimation methodologies are necessary to support an effective search of the design space by reducing to a minimum the cost and the number of measurements on the physical platform. This paper presents a new methodology that provides high-precision estimations of dynamic dataflow programs performances on multi/many-core platforms for any set of design configurations. The estimations rely on the execution trace post-processing obtained by a single profiled execution of the program. Furthermore, the paper describes the estimation methodology, implementation tools, and the type of information that is obtained from many/multi-core dataflow executions and used to drive the optimization heuristics. The results confirm a high level of accuracy achieved on different types of platforms and the effectiveness of the illustrated design space exploration methodology.","PeriodicalId":100643,"journal":{"name":"IEEE Transactions on Multi-Scale Computing Systems","volume":"4 2","pages":"127-140"},"PeriodicalIF":0.0,"publicationDate":"2017-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TMSCS.2017.2774294","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68025090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Multiscaled Simulation Methodology for Neuro-Inspired Circuits Demonstrated with an Organic Memristor 用有机忆阻器演示神经激励电路的多尺度仿真方法

IEEE Transactions on Multi-Scale Computing Systems Pub Date : 2017-11-14 DOI: 10.1109/TMSCS.2017.2773523

Christopher H. Bennett;Jean-Etienne Lorival;Francois Marc;Théo Cabaret;Bruno Jousselme;Vincent Derycke;Jacques-Olivier Klein;Cristell Maneux

{"title":"Multiscaled Simulation Methodology for Neuro-Inspired Circuits Demonstrated with an Organic Memristor","authors":"Christopher H. Bennett;Jean-Etienne Lorival;Francois Marc;Théo Cabaret;Bruno Jousselme;Vincent Derycke;Jacques-Olivier Klein;Cristell Maneux","doi":"10.1109/TMSCS.2017.2773523","DOIUrl":"https://doi.org/10.1109/TMSCS.2017.2773523","url":null,"abstract":"Organic memristors are promising molecular electronic devices for neuro-inspired on-chip learning applications. In this paper, we present a numerically efficient compact model suitable for \u0000<inline-formula><tex-math>$Fe(bpy)_3^{2+}$</tex-math></inline-formula>\u0000 organic memristors operating according to an intramolecular charge transfer switching mechanism. This compact model, being physics-based and relying on electrical characterizations and parametric extractions performed on test structures, is especially efficient in pulsed mode and describes the conductance variations for both SET and RESET regimes. Using this model, a dynamic multiscale simulation approach has been set-up to extend the model from individual devices to larger model systems that learn progressively through time. To verify the soundness and highlight emergent properties of the organic memristors, instances of the compact model have been simulated within a simple neuromorphic design that co-integrates with CMOS neurons. In addition, a larger supervised learning system using the new compact model is demonstrated. These successful tests suggest our model might be of interest to neuromorphic designers.","PeriodicalId":100643,"journal":{"name":"IEEE Transactions on Multi-Scale Computing Systems","volume":"4 4","pages":"822-832"},"PeriodicalIF":0.0,"publicationDate":"2017-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TMSCS.2017.2773523","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67861365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

OpSecure: A Secure Unidirectional Optical Channel for Implantable Medical Devices OpSecure：一种用于植入式医疗设备的安全单向光通道

IEEE Transactions on Multi-Scale Computing Systems Pub Date : 2017-11-09 DOI: 10.1109/TMSCS.2017.2771347

Arsalan Mosenia;Niraj K. Jha

{"title":"OpSecure: A Secure Unidirectional Optical Channel for Implantable Medical Devices","authors":"Arsalan Mosenia;Niraj K. Jha","doi":"10.1109/TMSCS.2017.2771347","DOIUrl":"https://doi.org/10.1109/TMSCS.2017.2771347","url":null,"abstract":"Implantable medical devices (IMDs) are opening up new opportunities for holistic healthcare by enabling continuous monitoring and treatment of various medical conditions, leading to an ever-improving quality of life for patients. Integration of radio frequency (RF) modules in IMDs has provided wireless connectivity and facilitated access to on-device data and post-deployment tuning of essential therapy. However, this has also made IMDs susceptible to various security attacks. Several lightweight encryption mechanisms have been developed to prevent well-known attacks, e.g., integrity attacks that send malicious commands to the device, on IMDs. However, lack of a secure key exchange protocol (that enables the exchange of the encryption key while maintaining its confidentiality) and the immaturity of already-in-use wakeup protocols (that are used to turn on the RF module before an authorized data transmission) are two fundamental challenges that must be addressed to ensure the security of wireless-enabled IMDs. In this paper, we introduce OpSecure, an optical secure communication channel between an IMD and an external device, e.g., a smartphone. OpSecure enables an intrinsically user-perceptible unidirectional data transmission, suitable for physically-secure communication with minimal size and energy overheads. Based on OpSecure, we design and implement two protocols: (i) a low-power wakeup protocol that is resilient against remote battery-draining attacks, and (ii) a secure key exchange protocol to share the encryption key between the IMD and the external device. We evaluate the two protocols using a human body model.","PeriodicalId":100643,"journal":{"name":"IEEE Transactions on Multi-Scale Computing Systems","volume":"4 3","pages":"410-419"},"PeriodicalIF":0.0,"publicationDate":"2017-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TMSCS.2017.2771347","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68023884","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

A Hardware/Software Stack for Heterogeneous Systems 异构系统的硬件/软件堆栈

IEEE Transactions on Multi-Scale Computing Systems Pub Date : 2017-11-09 DOI: 10.1109/TMSCS.2017.2771750

Jeronimo Castrillon;Matthias Lieber;Sascha Klüppelholz;Marcus Völp;Nils Asmussen;Uwe Aßmann;Franz Baader;Christel Baier;Gerhard Fettweis;Jochen Fröhlich;Andrés Goens;Sebastian Haas;Dirk Habich;Hermann Härtig;Mattis Hasler;Immo Huismann;Tomas Karnagel;Sven Karol;Akash Kumar;Wolfgang Lehner;Linda Leuschner;Siqi Ling;Steffen Märcker;Christian Menard;Johannes Mey;Wolfgang Nagel;Benedikt Nöthen;Rafael Peñaloza;Michael Raitza;Jörg Stiller;Annett Ungethüm;Axel Voigt;Sascha Wunderlich

{"title":"A Hardware/Software Stack for Heterogeneous Systems","authors":"Jeronimo Castrillon;Matthias Lieber;Sascha Klüppelholz;Marcus Völp;Nils Asmussen;Uwe Aßmann;Franz Baader;Christel Baier;Gerhard Fettweis;Jochen Fröhlich;Andrés Goens;Sebastian Haas;Dirk Habich;Hermann Härtig;Mattis Hasler;Immo Huismann;Tomas Karnagel;Sven Karol;Akash Kumar;Wolfgang Lehner;Linda Leuschner;Siqi Ling;Steffen Märcker;Christian Menard;Johannes Mey;Wolfgang Nagel;Benedikt Nöthen;Rafael Peñaloza;Michael Raitza;Jörg Stiller;Annett Ungethüm;Axel Voigt;Sascha Wunderlich","doi":"10.1109/TMSCS.2017.2771750","DOIUrl":"https://doi.org/10.1109/TMSCS.2017.2771750","url":null,"abstract":"Plenty of novel emerging technologies are being proposed and evaluated today, mostly at the device and circuit levels. It is unclear what the impact of different new technologies at the system level will be. What is clear, however, is that new technologies will make their way into systems and will increase the already high complexity of heterogeneous parallel computing platforms, making it ever so difficult to program them. This paper discusses a programming stack for heterogeneous systems that combines and adapts well-understood principles from different areas, including capability-based operating systems, adaptive application runtimes, dataflow programming models, and model checking. We argue why we think that these principles built into the stack and the interfaces among the layers will also be applicable to future systems that integrate heterogeneous technologies. The programming stack is evaluated on a tiled heterogeneous multicore.","PeriodicalId":100643,"journal":{"name":"IEEE Transactions on Multi-Scale Computing Systems","volume":"4 3","pages":"243-259"},"PeriodicalIF":0.0,"publicationDate":"2017-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TMSCS.2017.2771750","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68026461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

The Quest for Energy-Efficient I$ Design in Ultra-Low-Power Clustered Many-Cores 超低功率集群多核的节能I$设计探索

IEEE Transactions on Multi-Scale Computing Systems Pub Date : 2017-11-02 DOI: 10.1109/TMSCS.2017.2769046

Igor Loi;Alessandro Capotondi;Davide Rossi;Andrea Marongiu;Luca Benini

{"title":"The Quest for Energy-Efficient I$ Design in Ultra-Low-Power Clustered Many-Cores","authors":"Igor Loi;Alessandro Capotondi;Davide Rossi;Andrea Marongiu;Luca Benini","doi":"10.1109/TMSCS.2017.2769046","DOIUrl":"https://doi.org/10.1109/TMSCS.2017.2769046","url":null,"abstract":"High performance and extreme energy efficiency are strong requirements for a fast-growing number of edge-node Internet of Things (IoT) applications. While traditional Ultra-Low-Power designs rely on single-core micro-controllers (MCU), a new generation of architectures leveraging fully programmable tightly-coupled clusters of near-threshold processors is emerging, joining the performance gain of parallel execution over multiple cores with the energy efficiency of low-voltage operation. In this work, we tackle one of the most critical energy-efficiency bottlenecks for these architectures: instruction memory hierarchy. Exploiting the instruction locality typical of data-parallel applications, we explore two different shared instruction cache architectures, based on energy-efficient latch-based memory banks: one leveraging a crossbar between processors and single-port banks (SP), and one leveraging banks with multiple read ports (MP). We evaluate the proposed architectures on a set of signal processing applications with different executable sizes and working-sets. The results show that the shared cache architectures are able to efficiently execute a much wider set of applications (including those featuring large memory footprint and irregular access patterns) with a much smaller area and with much better energy efficiency with respect to the private cache. The multi-port cache is suitable for sizes up to a few kB, improving performance by up to 40 percent, energy efficiency by up to 20 percent, and energy × area efficiency by up to 30 percent with respect to the private cache. The single-port solution is more suitable for larger cache sizes (up to 16 kB), providing up to 20 percent better energy x area efficiency than the multi-port, and up to 30 percent better energy efficiency than private cache.","PeriodicalId":100643,"journal":{"name":"IEEE Transactions on Multi-Scale Computing Systems","volume":"4 2","pages":"99-112"},"PeriodicalIF":0.0,"publicationDate":"2017-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TMSCS.2017.2769046","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68025088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

A Systematic and Realistic Network-on-Chip Traffic Modeling and Generation Technique for Emerging Many-Core Systems 面向新兴多核心系统的系统真实的片上网络流量建模与生成技术

IEEE Transactions on Multi-Scale Computing Systems Pub Date : 2017-11-02 DOI: 10.1109/TMSCS.2017.2768362

Weichen Liu;Zhe Wang;Peng Yang;Jiang Xu;Bin Li;Ravi Lyer;Ramesh Illikkal

{"title":"A Systematic and Realistic Network-on-Chip Traffic Modeling and Generation Technique for Emerging Many-Core Systems","authors":"Weichen Liu;Zhe Wang;Peng Yang;Jiang Xu;Bin Li;Ravi Lyer;Ramesh Illikkal","doi":"10.1109/TMSCS.2017.2768362","DOIUrl":"https://doi.org/10.1109/TMSCS.2017.2768362","url":null,"abstract":"As programs for microprocessor architectures, network-on-chip (NoC) traffic patterns are essential tools for NoC performance assessment and design exploration. The fidelity of NoC traffic patterns has profound influence on NoC studies. In this paper, we present a systematic traffic modeling and generation methodology and a traffic suite for efficient evaluation of NoC-based many-core systems. The publicly released MCSL (multi-constraint system-level) traffic suite includes a set of realistic traffic patterns for real-world applications and covers popular NoC architectures. It captures both the communication behaviors in NoCs and the temporal dependencies among them. The MCSL traffic suite can be easily incorporated into existing NoC simulators and significantly improve NoC simulation accuracy. The proposed methodology uses formal computational models to capture both communication and computation requirements of applications. It optimizes application memory requirements, mapping, and scheduling to maximize overall system performance and utilization before extracting traffic patterns through cycle level simulations. Experiment results show that the MCSL traffic suite can be used to study NoC characteristics more accurately than traditional random traffic patterns.","PeriodicalId":100643,"journal":{"name":"IEEE Transactions on Multi-Scale Computing Systems","volume":"4 2","pages":"113-126"},"PeriodicalIF":0.0,"publicationDate":"2017-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TMSCS.2017.2768362","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68025089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Finding and Counting Tree-Like Subgraphs Using MapReduce 使用MapReduce查找和计数树状子图

IEEE Transactions on Multi-Scale Computing Systems Pub Date : 2017-10-31 DOI: 10.1109/TMSCS.2017.2768426

Zhao Zhao;Langshi Chen;Mihai Avram;Meng Li;Guanying Wang;Ali Butt;Maleq Khan;Madhav Marathe;Judy Qiu;Anil Vullikanti

{"title":"Finding and Counting Tree-Like Subgraphs Using MapReduce","authors":"Zhao Zhao;Langshi Chen;Mihai Avram;Meng Li;Guanying Wang;Ali Butt;Maleq Khan;Madhav Marathe;Judy Qiu;Anil Vullikanti","doi":"10.1109/TMSCS.2017.2768426","DOIUrl":"https://doi.org/10.1109/TMSCS.2017.2768426","url":null,"abstract":"Several variants of the subgraph isomorphism problem, e.g., finding, counting, and estimating frequencies of subgraphs in networks arise in a number of real world applications, such as web analysis, disease diffusion prediction, and social network analysis. These problems are computationally challenging in having to scale to very large networks with millions of vertices. In this paper, we present SAHAD, a MapReduce algorithm for detecting and counting trees of bounded size using the elegant color coding technique developed by N. Alon et al. SAHAD is a randomized algorithm, and we show rigorous bounds on the approximation quality and the performance of it. SAHAD scales to very large networks comprising of 10\u00007\u0000 - 10\u00008\u0000 vertices and 10\u00008\u0000 - 10\u00009\u0000 edges and tree-like (acyclic) templates with up to 12 vertices. Further, we extend our results by implementing SAHAD in the Harp framework, which is more of a high performance computing environment. The new implementation gives 100x improvement in performance over the standard Hadoop implementation and achieves better performance than state-of-the-art MPI solutions on larger graphs.","PeriodicalId":100643,"journal":{"name":"IEEE Transactions on Multi-Scale Computing Systems","volume":"4 3","pages":"217-230"},"PeriodicalIF":0.0,"publicationDate":"2017-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TMSCS.2017.2768426","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68026459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3