{"title":"基于边缘设备的手语识别的数据增强和深度学习建模方法","authors":"Yuzhe Ding, Shaofei Huang, Roubo Peng","doi":"10.1109/ISCTT51595.2020.00093","DOIUrl":null,"url":null,"abstract":"In this paper, methods of realizing sign language recognition (SLR) on mobile edge devices - AutoML and Transfer Learning - are recommended and compared. Their performance in SLR is compared in this article in terms of model size, speed, and accuracy. To perform a more reliable comparative analysis, we first built and applied data augmentation to obtain a benchmark dataset based on Chinese Sign Language (CSL). With this dataset, models were then trained and deployed to mobile edge devices for real-time testing. It is found that data augmentation can effectively improve the diversity of datasets and improve the robustness of the trained model. The results of the model comparison show that AutoML is more dominant in accuracy, and Transfer Learning is more suitable for low-latency applications. In addition, AutoML excels at classifying macroscopically homogeneous images, such as gestures, despite its low training speed. On the other hand, the training process of Transfer Learning is speedy, but low accuracy remains to be its problem. These conclusions are of guiding significance to the lightweight mobile application implementation of SLR.","PeriodicalId":178054,"journal":{"name":"2020 5th International Conference on Information Science, Computer Technology and Transportation (ISCTT)","volume":"43 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data Augmentation and Deep Learning Modeling Methods on Edge-Device-Based Sign Language Recognition\",\"authors\":\"Yuzhe Ding, Shaofei Huang, Roubo Peng\",\"doi\":\"10.1109/ISCTT51595.2020.00093\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, methods of realizing sign language recognition (SLR) on mobile edge devices - AutoML and Transfer Learning - are recommended and compared. Their performance in SLR is compared in this article in terms of model size, speed, and accuracy. To perform a more reliable comparative analysis, we first built and applied data augmentation to obtain a benchmark dataset based on Chinese Sign Language (CSL). With this dataset, models were then trained and deployed to mobile edge devices for real-time testing. It is found that data augmentation can effectively improve the diversity of datasets and improve the robustness of the trained model. The results of the model comparison show that AutoML is more dominant in accuracy, and Transfer Learning is more suitable for low-latency applications. In addition, AutoML excels at classifying macroscopically homogeneous images, such as gestures, despite its low training speed. On the other hand, the training process of Transfer Learning is speedy, but low accuracy remains to be its problem. These conclusions are of guiding significance to the lightweight mobile application implementation of SLR.\",\"PeriodicalId\":178054,\"journal\":{\"name\":\"2020 5th International Conference on Information Science, Computer Technology and Transportation (ISCTT)\",\"volume\":\"43 6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 5th International Conference on Information Science, Computer Technology and Transportation (ISCTT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCTT51595.2020.00093\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Information Science, Computer Technology and Transportation (ISCTT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCTT51595.2020.00093","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Data Augmentation and Deep Learning Modeling Methods on Edge-Device-Based Sign Language Recognition
In this paper, methods of realizing sign language recognition (SLR) on mobile edge devices - AutoML and Transfer Learning - are recommended and compared. Their performance in SLR is compared in this article in terms of model size, speed, and accuracy. To perform a more reliable comparative analysis, we first built and applied data augmentation to obtain a benchmark dataset based on Chinese Sign Language (CSL). With this dataset, models were then trained and deployed to mobile edge devices for real-time testing. It is found that data augmentation can effectively improve the diversity of datasets and improve the robustness of the trained model. The results of the model comparison show that AutoML is more dominant in accuracy, and Transfer Learning is more suitable for low-latency applications. In addition, AutoML excels at classifying macroscopically homogeneous images, such as gestures, despite its low training speed. On the other hand, the training process of Transfer Learning is speedy, but low accuracy remains to be its problem. These conclusions are of guiding significance to the lightweight mobile application implementation of SLR.