{"title":"基于GridMask的孟加拉文手写字素分类数据增强","authors":"Jiayu Yang","doi":"10.1145/3399637.3399650","DOIUrl":null,"url":null,"abstract":"In this paper, we describe the deep learning-based Bengali handwritten grapheme classification. Specifically, our recognition approach is based on the convolutional neural networks (CNNs) as deep CNNs have achieved splendid performance on many different visual recognition tasks. Moreover, we employ GridMask-based data augmentation to improve the recognition performance further. We compare the GridMask-based data augmentation with conventional data augmentations (such as flip, rotation, mixup) on three widely-used CNN architectures: ResNet101, DenseNet169 and EfficientNet B0. Extensive experiments demonstrate GridMask can utilize the information removal to improve the robustness of the neural networks, and the boost of hierarchical macro-averaged recall on the validation set suggest that GridMask data augmentation can be efficiently used for the Bengali handwritten grapheme analysis without any prior grapheme segmentation.","PeriodicalId":248664,"journal":{"name":"Proceedings of the 2020 2nd International Conference on Intelligent Medicine and Image Processing","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"GridMask Based Data Augmentation For Bengali Handwritten Grapheme Classification\",\"authors\":\"Jiayu Yang\",\"doi\":\"10.1145/3399637.3399650\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we describe the deep learning-based Bengali handwritten grapheme classification. Specifically, our recognition approach is based on the convolutional neural networks (CNNs) as deep CNNs have achieved splendid performance on many different visual recognition tasks. Moreover, we employ GridMask-based data augmentation to improve the recognition performance further. We compare the GridMask-based data augmentation with conventional data augmentations (such as flip, rotation, mixup) on three widely-used CNN architectures: ResNet101, DenseNet169 and EfficientNet B0. Extensive experiments demonstrate GridMask can utilize the information removal to improve the robustness of the neural networks, and the boost of hierarchical macro-averaged recall on the validation set suggest that GridMask data augmentation can be efficiently used for the Bengali handwritten grapheme analysis without any prior grapheme segmentation.\",\"PeriodicalId\":248664,\"journal\":{\"name\":\"Proceedings of the 2020 2nd International Conference on Intelligent Medicine and Image Processing\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-04-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2020 2nd International Conference on Intelligent Medicine and Image Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3399637.3399650\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 2nd International Conference on Intelligent Medicine and Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3399637.3399650","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
GridMask Based Data Augmentation For Bengali Handwritten Grapheme Classification
In this paper, we describe the deep learning-based Bengali handwritten grapheme classification. Specifically, our recognition approach is based on the convolutional neural networks (CNNs) as deep CNNs have achieved splendid performance on many different visual recognition tasks. Moreover, we employ GridMask-based data augmentation to improve the recognition performance further. We compare the GridMask-based data augmentation with conventional data augmentations (such as flip, rotation, mixup) on three widely-used CNN architectures: ResNet101, DenseNet169 and EfficientNet B0. Extensive experiments demonstrate GridMask can utilize the information removal to improve the robustness of the neural networks, and the boost of hierarchical macro-averaged recall on the validation set suggest that GridMask data augmentation can be efficiently used for the Bengali handwritten grapheme analysis without any prior grapheme segmentation.