Kritpawit Soongswang, Phattharaphon Romphet, C. Chantrapornchai
{"title":"Enhancing MobileNetV2 Performance with Layer Replication and Splitting for 3D Face Recognition Task Using Distributed Training","authors":"Kritpawit Soongswang, Phattharaphon Romphet, C. Chantrapornchai","doi":"10.1109/ITC-CSCC58803.2023.10212828","DOIUrl":null,"url":null,"abstract":"In this paper, we propose the operators, which are layer replication and splitting, to modify the CNN network. The algorithm for exploring suitable modifications to the prototype MobileNetV2 architecture for the 3D face recognition task is presented. The algorithm consists of two steps. The first step involves splitting the input and searching for the optimal position to expand the network through layer replication. The second part explores the architecture modification by altering the split input layer position within the model. Experiments demonstrate that the discovered model provides a more cost-effective performance, with only 0.015% increased size compared to a vanilla MobileNet V2, while delivering 6.99% higher accuracy than the previous 3D MobileNetV2 and 9.15% more accuracy than the vanilla MobileNetV2, with a total of 3,801,136 parameters. Using distributed data training on multi-GPUs, the total training time can be reduced by 75% while maintaining good accuracy compared to traditional single-GPU training.","PeriodicalId":220939,"journal":{"name":"2023 International Technical Conference on Circuits/Systems, Computers, and Communications (ITC-CSCC)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Technical Conference on Circuits/Systems, Computers, and Communications (ITC-CSCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITC-CSCC58803.2023.10212828","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we propose the operators, which are layer replication and splitting, to modify the CNN network. The algorithm for exploring suitable modifications to the prototype MobileNetV2 architecture for the 3D face recognition task is presented. The algorithm consists of two steps. The first step involves splitting the input and searching for the optimal position to expand the network through layer replication. The second part explores the architecture modification by altering the split input layer position within the model. Experiments demonstrate that the discovered model provides a more cost-effective performance, with only 0.015% increased size compared to a vanilla MobileNet V2, while delivering 6.99% higher accuracy than the previous 3D MobileNetV2 and 9.15% more accuracy than the vanilla MobileNetV2, with a total of 3,801,136 parameters. Using distributed data training on multi-GPUs, the total training time can be reduced by 75% while maintaining good accuracy compared to traditional single-GPU training.