{"title":"mobile YOLACT: Toward Lightweight Instance Segmentation for Mobile Devices","authors":"Juwon Lee, Seungjae Lee, Jong-gook Ko","doi":"10.1109/ICTC52510.2021.9621125","DOIUrl":null,"url":null,"abstract":"In this paper, we present a lightweight instance segmentation model, mobileYOLACT which is designed for mobile environments where the computational resources are limited. We propose several modifications to YOLACT to improve computational efficiency. First, we use a quantized lightweight backbone for feature extraction. Second, we reduce the computational burden with marginal degradation in accuracy by employing the depthwise separable convolution on the entire model. Third, we simplified the structure of prototype mask generation branch. Last, we used TorchScript and NCNN to further optimize the model and deploy it on mobile device. We validate the effectiveness of the proposed method from the experiments on COCO dataset. The proposed model can run at the speed of 21 FPS on Samsung Galaxy S20 with 23 APmask at 0.5 IoU threshold.","PeriodicalId":299175,"journal":{"name":"2021 International Conference on Information and Communication Technology Convergence (ICTC)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Information and Communication Technology Convergence (ICTC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTC52510.2021.9621125","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In this paper, we present a lightweight instance segmentation model, mobileYOLACT which is designed for mobile environments where the computational resources are limited. We propose several modifications to YOLACT to improve computational efficiency. First, we use a quantized lightweight backbone for feature extraction. Second, we reduce the computational burden with marginal degradation in accuracy by employing the depthwise separable convolution on the entire model. Third, we simplified the structure of prototype mask generation branch. Last, we used TorchScript and NCNN to further optimize the model and deploy it on mobile device. We validate the effectiveness of the proposed method from the experiments on COCO dataset. The proposed model can run at the speed of 21 FPS on Samsung Galaxy S20 with 23 APmask at 0.5 IoU threshold.