{"title":"移动YOLACT:面向移动设备的轻量级实例分割","authors":"Juwon Lee, Seungjae Lee, Jong-gook Ko","doi":"10.1109/ICTC52510.2021.9621125","DOIUrl":null,"url":null,"abstract":"In this paper, we present a lightweight instance segmentation model, mobileYOLACT which is designed for mobile environments where the computational resources are limited. We propose several modifications to YOLACT to improve computational efficiency. First, we use a quantized lightweight backbone for feature extraction. Second, we reduce the computational burden with marginal degradation in accuracy by employing the depthwise separable convolution on the entire model. Third, we simplified the structure of prototype mask generation branch. Last, we used TorchScript and NCNN to further optimize the model and deploy it on mobile device. We validate the effectiveness of the proposed method from the experiments on COCO dataset. The proposed model can run at the speed of 21 FPS on Samsung Galaxy S20 with 23 APmask at 0.5 IoU threshold.","PeriodicalId":299175,"journal":{"name":"2021 International Conference on Information and Communication Technology Convergence (ICTC)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"mobile YOLACT: Toward Lightweight Instance Segmentation for Mobile Devices\",\"authors\":\"Juwon Lee, Seungjae Lee, Jong-gook Ko\",\"doi\":\"10.1109/ICTC52510.2021.9621125\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we present a lightweight instance segmentation model, mobileYOLACT which is designed for mobile environments where the computational resources are limited. We propose several modifications to YOLACT to improve computational efficiency. First, we use a quantized lightweight backbone for feature extraction. Second, we reduce the computational burden with marginal degradation in accuracy by employing the depthwise separable convolution on the entire model. Third, we simplified the structure of prototype mask generation branch. Last, we used TorchScript and NCNN to further optimize the model and deploy it on mobile device. We validate the effectiveness of the proposed method from the experiments on COCO dataset. The proposed model can run at the speed of 21 FPS on Samsung Galaxy S20 with 23 APmask at 0.5 IoU threshold.\",\"PeriodicalId\":299175,\"journal\":{\"name\":\"2021 International Conference on Information and Communication Technology Convergence (ICTC)\",\"volume\":\"80 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Information and Communication Technology Convergence (ICTC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTC52510.2021.9621125\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Information and Communication Technology Convergence (ICTC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTC52510.2021.9621125","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
mobile YOLACT: Toward Lightweight Instance Segmentation for Mobile Devices
In this paper, we present a lightweight instance segmentation model, mobileYOLACT which is designed for mobile environments where the computational resources are limited. We propose several modifications to YOLACT to improve computational efficiency. First, we use a quantized lightweight backbone for feature extraction. Second, we reduce the computational burden with marginal degradation in accuracy by employing the depthwise separable convolution on the entire model. Third, we simplified the structure of prototype mask generation branch. Last, we used TorchScript and NCNN to further optimize the model and deploy it on mobile device. We validate the effectiveness of the proposed method from the experiments on COCO dataset. The proposed model can run at the speed of 21 FPS on Samsung Galaxy S20 with 23 APmask at 0.5 IoU threshold.