{"title":"基于宏块分区的视频流分析云边缘框架","authors":"Jie Duan, Wei Gu, Shujvan Zhang, Xin Gong","doi":"10.1109/CISCE58541.2023.10142809","DOIUrl":null,"url":null,"abstract":"Deep neural networks (DNNs) have become the state-of-the-art solution for video stream analysis, but their high delay poses a challenge in meeting real-time requirements. Existing research focuses on compressing video while retaining enough information for server-side DNNs to perform high-precision inference. However, this approach can only reduce network transmission delay and may not be optimal for massive video analysis requests. This paper proposes a Cloud-Edge collaborative Video Analysis (CEVA) framework that uses depth estimation technology to divide the inference difficulty of different macroblocks in the video and offload them to edge and cloud servers for inference. Experimental results on the VisDrone dataset demonstrate that CEVA outperforms server-side feedback-driven methods in inference accuracy and reduces waiting delay by an average of 264 milliseconds. Compared to traditional offloading methods, CEVA doubles accuracy and increases the average delay by less than 100 milliseconds.","PeriodicalId":145263,"journal":{"name":"2023 5th International Conference on Communications, Information System and Computer Engineering (CISCE)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Cloud-Edge Framework for Video Stream Analysis using Macroblock Partitioning\",\"authors\":\"Jie Duan, Wei Gu, Shujvan Zhang, Xin Gong\",\"doi\":\"10.1109/CISCE58541.2023.10142809\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep neural networks (DNNs) have become the state-of-the-art solution for video stream analysis, but their high delay poses a challenge in meeting real-time requirements. Existing research focuses on compressing video while retaining enough information for server-side DNNs to perform high-precision inference. However, this approach can only reduce network transmission delay and may not be optimal for massive video analysis requests. This paper proposes a Cloud-Edge collaborative Video Analysis (CEVA) framework that uses depth estimation technology to divide the inference difficulty of different macroblocks in the video and offload them to edge and cloud servers for inference. Experimental results on the VisDrone dataset demonstrate that CEVA outperforms server-side feedback-driven methods in inference accuracy and reduces waiting delay by an average of 264 milliseconds. Compared to traditional offloading methods, CEVA doubles accuracy and increases the average delay by less than 100 milliseconds.\",\"PeriodicalId\":145263,\"journal\":{\"name\":\"2023 5th International Conference on Communications, Information System and Computer Engineering (CISCE)\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 5th International Conference on Communications, Information System and Computer Engineering (CISCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CISCE58541.2023.10142809\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 5th International Conference on Communications, Information System and Computer Engineering (CISCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISCE58541.2023.10142809","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Cloud-Edge Framework for Video Stream Analysis using Macroblock Partitioning
Deep neural networks (DNNs) have become the state-of-the-art solution for video stream analysis, but their high delay poses a challenge in meeting real-time requirements. Existing research focuses on compressing video while retaining enough information for server-side DNNs to perform high-precision inference. However, this approach can only reduce network transmission delay and may not be optimal for massive video analysis requests. This paper proposes a Cloud-Edge collaborative Video Analysis (CEVA) framework that uses depth estimation technology to divide the inference difficulty of different macroblocks in the video and offload them to edge and cloud servers for inference. Experimental results on the VisDrone dataset demonstrate that CEVA outperforms server-side feedback-driven methods in inference accuracy and reduces waiting delay by an average of 264 milliseconds. Compared to traditional offloading methods, CEVA doubles accuracy and increases the average delay by less than 100 milliseconds.