Slimmer:为移动增强现实加速3D语义分割

2020 IEEE 17th International Conference on Mobile Ad Hoc and Sensor Systems (MASS) Pub Date : 2020-12-01 DOI:10.1109/MASS50613.2020.00079

Huanle Zhang, Bo Han, C. Y. Ip, P. Mohapatra

{"title":"Slimmer:为移动增强现实加速3D语义分割","authors":"Huanle Zhang, Bo Han, C. Y. Ip, P. Mohapatra","doi":"10.1109/MASS50613.2020.00079","DOIUrl":null,"url":null,"abstract":"Three-Dimensional (3D) semantic segmentation is an essential building block for interactive Augmented Reality (AR). However, existing Deep Neural Network (DNN) models for segmenting 3D objects are not only computation-intensive but also memory heavy, hindering their deployment on resourceconstrained mobile devices. We present the design, implementation and evaluation of Slimmer, a generic and model-independent framework for accelerating 3D semantic segmentation and facilitating its real-time applications on mobile devices. In contrast to the current practice that directly feeds a point cloud to DNN models, Slimmer is motivated by our observation that these models remain high accuracy even if we remove a fraction of points from the input, which can significantly reduce the inference time and memory usage of these models. Our design of Slimmer faces two key challenges. First, the simplification method of point clouds should be lightweight. Otherwise, the reduced inference time may be canceled out by the incurred overhead of input-data simplification. Second, Slimmer still needs to accurately segment the removed points from the input to create a complete segmentation of the original input, again, using a lightweight method. Our extensive performance evaluation demonstrates that, by addressing these two challenges, Slimmer can dramatically reduce the resource utilization of a representative DNN model for 3D semantic segmentation. For example, if we can tolerate 1% accuracy loss, the reduction could be $\\sim$20% for inference time and$\\sim$9% for memory usage. The reduction increases to around $\\sim$27% for inference time and$\\sim$15% for memory usage when we can tolerate 2% accuracy loss.","PeriodicalId":105795,"journal":{"name":"2020 IEEE 17th International Conference on Mobile Ad Hoc and Sensor Systems (MASS)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Slimmer: Accelerating 3D Semantic Segmentation for Mobile Augmented Reality\",\"authors\":\"Huanle Zhang, Bo Han, C. Y. Ip, P. Mohapatra\",\"doi\":\"10.1109/MASS50613.2020.00079\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Three-Dimensional (3D) semantic segmentation is an essential building block for interactive Augmented Reality (AR). However, existing Deep Neural Network (DNN) models for segmenting 3D objects are not only computation-intensive but also memory heavy, hindering their deployment on resourceconstrained mobile devices. We present the design, implementation and evaluation of Slimmer, a generic and model-independent framework for accelerating 3D semantic segmentation and facilitating its real-time applications on mobile devices. In contrast to the current practice that directly feeds a point cloud to DNN models, Slimmer is motivated by our observation that these models remain high accuracy even if we remove a fraction of points from the input, which can significantly reduce the inference time and memory usage of these models. Our design of Slimmer faces two key challenges. First, the simplification method of point clouds should be lightweight. Otherwise, the reduced inference time may be canceled out by the incurred overhead of input-data simplification. Second, Slimmer still needs to accurately segment the removed points from the input to create a complete segmentation of the original input, again, using a lightweight method. Our extensive performance evaluation demonstrates that, by addressing these two challenges, Slimmer can dramatically reduce the resource utilization of a representative DNN model for 3D semantic segmentation. For example, if we can tolerate 1% accuracy loss, the reduction could be $\\\\sim$20% for inference time and$\\\\sim$9% for memory usage. The reduction increases to around $\\\\sim$27% for inference time and$\\\\sim$15% for memory usage when we can tolerate 2% accuracy loss.\",\"PeriodicalId\":105795,\"journal\":{\"name\":\"2020 IEEE 17th International Conference on Mobile Ad Hoc and Sensor Systems (MASS)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 17th International Conference on Mobile Ad Hoc and Sensor Systems (MASS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MASS50613.2020.00079\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 17th International Conference on Mobile Ad Hoc and Sensor Systems (MASS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MASS50613.2020.00079","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

摘要

三维语义分割是交互式增强现实(AR)的重要组成部分。然而，现有的用于分割3D物体的深度神经网络(DNN)模型不仅计算量大，而且内存量大，阻碍了它们在资源有限的移动设备上的部署。我们提出了Slimmer的设计、实现和评估，这是一个通用的、独立于模型的框架，用于加速3D语义分割并促进其在移动设备上的实时应用。与目前直接向DNN模型提供点云的做法相反，Slimmer的动机是我们观察到，即使我们从输入中删除一小部分点，这些模型仍然保持很高的准确性，这可以显着减少这些模型的推理时间和内存使用。我们的Slimmer设计面临两个关键挑战。首先，点云的简化方法应该是轻量级的。否则，减少的推理时间可能会被输入数据简化带来的开销所抵消。其次，Slimmer仍然需要从输入中准确地分割被删除的点，以创建原始输入的完整分割，同样使用轻量级方法。我们广泛的性能评估表明，通过解决这两个挑战，Slimmer可以显着降低用于3D语义分割的代表性DNN模型的资源利用率。例如，如果我们可以容忍1%的准确性损失，则推理时间减少20%，内存使用减少9%。当我们可以容忍2%的准确性损失时，推理时间的减少增加到27%左右，内存使用的减少增加到15%左右。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Slimmer: Accelerating 3D Semantic Segmentation for Mobile Augmented Reality

Three-Dimensional (3D) semantic segmentation is an essential building block for interactive Augmented Reality (AR). However, existing Deep Neural Network (DNN) models for segmenting 3D objects are not only computation-intensive but also memory heavy, hindering their deployment on resourceconstrained mobile devices. We present the design, implementation and evaluation of Slimmer, a generic and model-independent framework for accelerating 3D semantic segmentation and facilitating its real-time applications on mobile devices. In contrast to the current practice that directly feeds a point cloud to DNN models, Slimmer is motivated by our observation that these models remain high accuracy even if we remove a fraction of points from the input, which can significantly reduce the inference time and memory usage of these models. Our design of Slimmer faces two key challenges. First, the simplification method of point clouds should be lightweight. Otherwise, the reduced inference time may be canceled out by the incurred overhead of input-data simplification. Second, Slimmer still needs to accurately segment the removed points from the input to create a complete segmentation of the original input, again, using a lightweight method. Our extensive performance evaluation demonstrates that, by addressing these two challenges, Slimmer can dramatically reduce the resource utilization of a representative DNN model for 3D semantic segmentation. For example, if we can tolerate 1% accuracy loss, the reduction could be $\sim$20% for inference time and$\sim$9% for memory usage. The reduction increases to around $\sim$27% for inference time and$\sim$15% for memory usage when we can tolerate 2% accuracy loss.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 IEEE 17th International Conference on Mobile Ad Hoc and Sensor Systems (MASS)

自引率

0.00%

发文量