利用端缘协同计算加速多视图推理

IF 2 3区 计算机科学 Q3 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Wangbing Cheng, MinFeng Zhang, Fang Dong, Shucun Fu
{"title":"利用端缘协同计算加速多视图推理","authors":"Wangbing Cheng, MinFeng Zhang, Fang Dong, Shucun Fu","doi":"10.1109/CSCWD57460.2023.10152842","DOIUrl":null,"url":null,"abstract":"Multi-view inference can utilize visual information from several views like a human being and significantly improve accuracy in some scenes, but it inevitably incurs more computing overhead than traditional DNN inference. To meet the requirement of low latency in typical scenarios, we consider utilizing model partition technique of edge computing to speed up multi-view inference, and design a multi-view end-edge co-inference execution framework (MV-IEF) which can make use of both end and edge resources for multi-view inference tasks. However, when employing the framework simply, the efficiency of multi-view inference will be constrained by network dynamics and heterogeneity of devices corresponding to multiple views. To break this constraint, we establish an optimization model based on the framework to minimize the multi-view inference time and solve it on the basis of game theory. And meanwhile, we propose a joint optimization algorithm for multi-view resource allocation and model partition (MV-JRAMP), which can make remarkable decisions of resource allocation and model partiton according to network status and computing capabilities of devices. Finally, we build a prototype and evaluate the performance of MV-JRAMP. Experiments show that MV-JRAMP can accelerate multi-view inference by up to 3.71×.","PeriodicalId":51008,"journal":{"name":"Computer Supported Cooperative Work-The Journal of Collaborative Computing","volume":"42 1","pages":"1625-1631"},"PeriodicalIF":2.0000,"publicationDate":"2023-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Accelerate Multi-view Inference with End-edge Collaborative Computing\",\"authors\":\"Wangbing Cheng, MinFeng Zhang, Fang Dong, Shucun Fu\",\"doi\":\"10.1109/CSCWD57460.2023.10152842\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multi-view inference can utilize visual information from several views like a human being and significantly improve accuracy in some scenes, but it inevitably incurs more computing overhead than traditional DNN inference. To meet the requirement of low latency in typical scenarios, we consider utilizing model partition technique of edge computing to speed up multi-view inference, and design a multi-view end-edge co-inference execution framework (MV-IEF) which can make use of both end and edge resources for multi-view inference tasks. However, when employing the framework simply, the efficiency of multi-view inference will be constrained by network dynamics and heterogeneity of devices corresponding to multiple views. To break this constraint, we establish an optimization model based on the framework to minimize the multi-view inference time and solve it on the basis of game theory. And meanwhile, we propose a joint optimization algorithm for multi-view resource allocation and model partition (MV-JRAMP), which can make remarkable decisions of resource allocation and model partiton according to network status and computing capabilities of devices. Finally, we build a prototype and evaluate the performance of MV-JRAMP. Experiments show that MV-JRAMP can accelerate multi-view inference by up to 3.71×.\",\"PeriodicalId\":51008,\"journal\":{\"name\":\"Computer Supported Cooperative Work-The Journal of Collaborative Computing\",\"volume\":\"42 1\",\"pages\":\"1625-1631\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2023-05-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Supported Cooperative Work-The Journal of Collaborative Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1109/CSCWD57460.2023.10152842\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Supported Cooperative Work-The Journal of Collaborative Computing","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1109/CSCWD57460.2023.10152842","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

摘要

多视图推理可以像人类一样利用来自多个视图的视觉信息,并在某些场景中显着提高准确性,但它不可避免地会比传统的深度神经网络推理产生更多的计算开销。为了满足典型场景下低时延的要求,我们考虑利用边缘计算的模型划分技术来加速多视图推理,并设计了一个多视图端-边缘协同推理执行框架(MV-IEF),该框架可以同时利用端-边缘资源执行多视图推理任务。然而,当简单使用该框架时,多视图推理的效率将受到网络动态和多视图对应设备的异构性的限制。为了打破这一约束,我们建立了一个基于框架的优化模型,以最小化多视图推理时间,并基于博弈论进行求解。同时,我们提出了一种多视图资源分配和模型划分联合优化算法(MV-JRAMP),该算法能够根据设备的网络状态和计算能力做出较好的资源分配和模型划分决策。最后,建立了MV-JRAMP的原型,并对其性能进行了评估。实验表明,MV-JRAMP可将多视图推理速度提高3.71倍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Accelerate Multi-view Inference with End-edge Collaborative Computing
Multi-view inference can utilize visual information from several views like a human being and significantly improve accuracy in some scenes, but it inevitably incurs more computing overhead than traditional DNN inference. To meet the requirement of low latency in typical scenarios, we consider utilizing model partition technique of edge computing to speed up multi-view inference, and design a multi-view end-edge co-inference execution framework (MV-IEF) which can make use of both end and edge resources for multi-view inference tasks. However, when employing the framework simply, the efficiency of multi-view inference will be constrained by network dynamics and heterogeneity of devices corresponding to multiple views. To break this constraint, we establish an optimization model based on the framework to minimize the multi-view inference time and solve it on the basis of game theory. And meanwhile, we propose a joint optimization algorithm for multi-view resource allocation and model partition (MV-JRAMP), which can make remarkable decisions of resource allocation and model partiton according to network status and computing capabilities of devices. Finally, we build a prototype and evaluate the performance of MV-JRAMP. Experiments show that MV-JRAMP can accelerate multi-view inference by up to 3.71×.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computer Supported Cooperative Work-The Journal of Collaborative Computing
Computer Supported Cooperative Work-The Journal of Collaborative Computing COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS-
CiteScore
6.40
自引率
4.20%
发文量
31
审稿时长
>12 weeks
期刊介绍: Computer Supported Cooperative Work (CSCW): The Journal of Collaborative Computing and Work Practices is devoted to innovative research in computer-supported cooperative work (CSCW). It provides an interdisciplinary and international forum for the debate and exchange of ideas concerning theoretical, practical, technical, and social issues in CSCW. The CSCW Journal arose in response to the growing interest in the design, implementation and use of technical systems (including computing, information, and communications technologies) which support people working cooperatively, and its scope remains to encompass the multifarious aspects of research within CSCW and related areas. The CSCW Journal focuses on research oriented towards the development of collaborative computing technologies on the basis of studies of actual cooperative work practices (where ‘work’ is used in the wider sense). That is, it welcomes in particular submissions that (a) report on findings from ethnographic or similar kinds of in-depth fieldwork of work practices with a view to their technological implications, (b) report on empirical evaluations of the use of extant or novel technical solutions under real-world conditions, and/or (c) develop technical or conceptual frameworks for practice-oriented computing research based on previous fieldwork and evaluations.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信