Kritpawit Soongswang, Phattharaphon Romphet, C. Chantrapornchai
{"title":"使用分布式训练增强MobileNetV2的层复制和分割性能,用于3D人脸识别任务","authors":"Kritpawit Soongswang, Phattharaphon Romphet, C. Chantrapornchai","doi":"10.1109/ITC-CSCC58803.2023.10212828","DOIUrl":null,"url":null,"abstract":"In this paper, we propose the operators, which are layer replication and splitting, to modify the CNN network. The algorithm for exploring suitable modifications to the prototype MobileNetV2 architecture for the 3D face recognition task is presented. The algorithm consists of two steps. The first step involves splitting the input and searching for the optimal position to expand the network through layer replication. The second part explores the architecture modification by altering the split input layer position within the model. Experiments demonstrate that the discovered model provides a more cost-effective performance, with only 0.015% increased size compared to a vanilla MobileNet V2, while delivering 6.99% higher accuracy than the previous 3D MobileNetV2 and 9.15% more accuracy than the vanilla MobileNetV2, with a total of 3,801,136 parameters. Using distributed data training on multi-GPUs, the total training time can be reduced by 75% while maintaining good accuracy compared to traditional single-GPU training.","PeriodicalId":220939,"journal":{"name":"2023 International Technical Conference on Circuits/Systems, Computers, and Communications (ITC-CSCC)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing MobileNetV2 Performance with Layer Replication and Splitting for 3D Face Recognition Task Using Distributed Training\",\"authors\":\"Kritpawit Soongswang, Phattharaphon Romphet, C. Chantrapornchai\",\"doi\":\"10.1109/ITC-CSCC58803.2023.10212828\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose the operators, which are layer replication and splitting, to modify the CNN network. The algorithm for exploring suitable modifications to the prototype MobileNetV2 architecture for the 3D face recognition task is presented. The algorithm consists of two steps. The first step involves splitting the input and searching for the optimal position to expand the network through layer replication. The second part explores the architecture modification by altering the split input layer position within the model. Experiments demonstrate that the discovered model provides a more cost-effective performance, with only 0.015% increased size compared to a vanilla MobileNet V2, while delivering 6.99% higher accuracy than the previous 3D MobileNetV2 and 9.15% more accuracy than the vanilla MobileNetV2, with a total of 3,801,136 parameters. Using distributed data training on multi-GPUs, the total training time can be reduced by 75% while maintaining good accuracy compared to traditional single-GPU training.\",\"PeriodicalId\":220939,\"journal\":{\"name\":\"2023 International Technical Conference on Circuits/Systems, Computers, and Communications (ITC-CSCC)\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Technical Conference on Circuits/Systems, Computers, and Communications (ITC-CSCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITC-CSCC58803.2023.10212828\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Technical Conference on Circuits/Systems, Computers, and Communications (ITC-CSCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITC-CSCC58803.2023.10212828","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Enhancing MobileNetV2 Performance with Layer Replication and Splitting for 3D Face Recognition Task Using Distributed Training
In this paper, we propose the operators, which are layer replication and splitting, to modify the CNN network. The algorithm for exploring suitable modifications to the prototype MobileNetV2 architecture for the 3D face recognition task is presented. The algorithm consists of two steps. The first step involves splitting the input and searching for the optimal position to expand the network through layer replication. The second part explores the architecture modification by altering the split input layer position within the model. Experiments demonstrate that the discovered model provides a more cost-effective performance, with only 0.015% increased size compared to a vanilla MobileNet V2, while delivering 6.99% higher accuracy than the previous 3D MobileNetV2 and 9.15% more accuracy than the vanilla MobileNetV2, with a total of 3,801,136 parameters. Using distributed data training on multi-GPUs, the total training time can be reduced by 75% while maintaining good accuracy compared to traditional single-GPU training.