Efficient Distributed Training in Heterogeneous Mobile Networks with Active Sampling

2020 16th International Conference on Mobility, Sensing and Networking (MSN) Pub Date : 2020-12-01 DOI:10.1109/MSN50589.2020.00041

Yunhui Guo, Xiaofan Yu, Kamalika Chaudhuri, T. Simunic

{"title":"Efficient Distributed Training in Heterogeneous Mobile Networks with Active Sampling","authors":"Yunhui Guo, Xiaofan Yu, Kamalika Chaudhuri, T. Simunic","doi":"10.1109/MSN50589.2020.00041","DOIUrl":null,"url":null,"abstract":"Mobile edge computing is an emerging research topic which aims at pushing the computation from the cloud to the edge devices. Most of the current machine learning (ML) algorithms, such as federated learning, are designed for homogeneous mobile networks, that is, all the devices collect the same type of data. In this paper, we address distributed training of ML algorithms in heterogeneous mobile networks where the features, rather than the samples, are distributed across multiple heterogeneous mobile devices. Training ML models in heterogeneous mobile networks incurs a large communication cost due to the necessity to deliver the local data to a central server. Inspired by active learning, which is traditionally used to reduce the labeling cost for training ML models, we propose an active sampling method to reduce the communication cost of learning in heterogeneous mobile networks. Instead of sending all the local data, the proposed active sampling method identifies and sends only informative data from each device to the central server. Extensive experiments on four real datasets, both with numerical simulation and on a networked mobile system, show that the proposed method can reduce the communication cost by up to 53% and energy consumption by up to 67% without accuracy degradation compared with the conventional approaches.","PeriodicalId":447605,"journal":{"name":"2020 16th International Conference on Mobility, Sensing and Networking (MSN)","volume":"11 Suppl 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 16th International Conference on Mobility, Sensing and Networking (MSN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MSN50589.2020.00041","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Mobile edge computing is an emerging research topic which aims at pushing the computation from the cloud to the edge devices. Most of the current machine learning (ML) algorithms, such as federated learning, are designed for homogeneous mobile networks, that is, all the devices collect the same type of data. In this paper, we address distributed training of ML algorithms in heterogeneous mobile networks where the features, rather than the samples, are distributed across multiple heterogeneous mobile devices. Training ML models in heterogeneous mobile networks incurs a large communication cost due to the necessity to deliver the local data to a central server. Inspired by active learning, which is traditionally used to reduce the labeling cost for training ML models, we propose an active sampling method to reduce the communication cost of learning in heterogeneous mobile networks. Instead of sending all the local data, the proposed active sampling method identifies and sends only informative data from each device to the central server. Extensive experiments on four real datasets, both with numerical simulation and on a networked mobile system, show that the proposed method can reduce the communication cost by up to 53% and energy consumption by up to 67% without accuracy degradation compared with the conventional approaches.

查看原文本刊更多论文

基于主动采样的异构移动网络高效分布式训练

移动边缘计算是一个新兴的研究课题，旨在将计算从云端推向边缘设备。目前大多数机器学习(ML)算法，如联邦学习，都是为同构移动网络设计的，也就是说，所有设备收集相同类型的数据。在本文中，我们解决了异构移动网络中ML算法的分布式训练问题，其中特征而不是样本分布在多个异构移动设备上。由于需要将本地数据传递到中心服务器，在异构移动网络中训练ML模型会产生很大的通信成本。受主动学习的启发，我们提出了一种主动采样方法来降低异构移动网络中学习的通信成本。主动学习通常用于降低训练ML模型的标记成本。主动采样方法不是发送所有本地数据，而是识别并仅将来自每个设备的信息数据发送到中心服务器。在四个实际数据集上进行的大量实验，包括数值模拟和网络移动系统，表明与传统方法相比，该方法在不降低精度的情况下，可将通信成本降低53%，能耗降低67%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 16th International Conference on Mobility, Sensing and Networking (MSN)

自引率

0.00%

发文量