{"title":"基于深度神经网络迁移学习的佛法峨山文字分类","authors":"Narit Hnoohom, Sumeth Yuenyong","doi":"10.1109/JCSSE.2018.8457361","DOIUrl":null,"url":null,"abstract":"We present an image classification of Dhamma Esan characters by fine-tuning the Inception V3 deep neural network trained on the ImageNet dataset. Dhamma Esan is a traditional alphabet used in the north-eastern region of Thailand, primarily written on Corypha leaves for the purpose of recording Buddhist scriptures. Preservation of these historical documents calls for the ability to classify the characters of the alphabet in order to facilitate digital indexing and searching, as well as assist anyone trying to read them. Our dataset consists of over 70,000 Dhamma Esan character images, much larger than any previous work. The result of ten-fold cross-validation showed that our model had 100% accuracy for four folds, and 99.99% for the other six folds. The previous best accuracy reported was 97.77%. We also developed a Dhamma Esan character classification web service where users can upload images of characters and get immediate classification results as well as mapping to the modern Thai alphabet.","PeriodicalId":338973,"journal":{"name":"2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Classification of Dhamma Esan Characters By Transfer Learning of a Deep Neural Network\",\"authors\":\"Narit Hnoohom, Sumeth Yuenyong\",\"doi\":\"10.1109/JCSSE.2018.8457361\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present an image classification of Dhamma Esan characters by fine-tuning the Inception V3 deep neural network trained on the ImageNet dataset. Dhamma Esan is a traditional alphabet used in the north-eastern region of Thailand, primarily written on Corypha leaves for the purpose of recording Buddhist scriptures. Preservation of these historical documents calls for the ability to classify the characters of the alphabet in order to facilitate digital indexing and searching, as well as assist anyone trying to read them. Our dataset consists of over 70,000 Dhamma Esan character images, much larger than any previous work. The result of ten-fold cross-validation showed that our model had 100% accuracy for four folds, and 99.99% for the other six folds. The previous best accuracy reported was 97.77%. We also developed a Dhamma Esan character classification web service where users can upload images of characters and get immediate classification results as well as mapping to the modern Thai alphabet.\",\"PeriodicalId\":338973,\"journal\":{\"name\":\"2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)\",\"volume\":\"59 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/JCSSE.2018.8457361\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JCSSE.2018.8457361","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Classification of Dhamma Esan Characters By Transfer Learning of a Deep Neural Network
We present an image classification of Dhamma Esan characters by fine-tuning the Inception V3 deep neural network trained on the ImageNet dataset. Dhamma Esan is a traditional alphabet used in the north-eastern region of Thailand, primarily written on Corypha leaves for the purpose of recording Buddhist scriptures. Preservation of these historical documents calls for the ability to classify the characters of the alphabet in order to facilitate digital indexing and searching, as well as assist anyone trying to read them. Our dataset consists of over 70,000 Dhamma Esan character images, much larger than any previous work. The result of ten-fold cross-validation showed that our model had 100% accuracy for four folds, and 99.99% for the other six folds. The previous best accuracy reported was 97.77%. We also developed a Dhamma Esan character classification web service where users can upload images of characters and get immediate classification results as well as mapping to the modern Thai alphabet.