D. Sudarsan, Parvathy Vijayakumar, Sharon Biju, Soniya Sanu, Sreelakshmi K. Shivadas
{"title":"Digitalization of Malayalam Palmleaf Manuscripts Based on Contrast-Based Adaptive Binarization and Convolutional Neural Networks","authors":"D. Sudarsan, Parvathy Vijayakumar, Sharon Biju, Soniya Sanu, Sreelakshmi K. Shivadas","doi":"10.1109/WISPNET.2018.8538588","DOIUrl":null,"url":null,"abstract":"The palm leaf manuscripts are an abundant source of knowledge, tradition and ancient culture. These scriptures are an unavoidable part of our rich culture and have to be preserved in the best possible way. But the information extraction from palm leaf is a tedious task due to various challenges such as noise enormous character set and the difficulty in reading and understanding the ancient Malayalam script. Handwriting recognition in Malayalam is a challenging and emerging area of pattern recognition. Our proposed system aims at extracting information from old palm leaves (thaaliyola) and translating the ancient Malayalam scripts to their current version based on contrast-based adaptive binarization and convolutional neural networks which simplifies the entire process by avoiding feature extraction. The proposed method is different from the conventional methods which require handcrafted features that are used for classification. Initially, the system is trained with a set of characters. This can be expanded to work with the remaining characters as well. The input will be images of Malayalam palmleaf manuscript and the expected output is their translated script. Our system aims to transform these scripts so as to make it accessible and useful to the current generation. The system will be trained using a number of samples to build a convolutional neural network using which the characters will be recognized.","PeriodicalId":6858,"journal":{"name":"2018 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET)","volume":"46 1","pages":"1-4"},"PeriodicalIF":0.0000,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WISPNET.2018.8538588","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
The palm leaf manuscripts are an abundant source of knowledge, tradition and ancient culture. These scriptures are an unavoidable part of our rich culture and have to be preserved in the best possible way. But the information extraction from palm leaf is a tedious task due to various challenges such as noise enormous character set and the difficulty in reading and understanding the ancient Malayalam script. Handwriting recognition in Malayalam is a challenging and emerging area of pattern recognition. Our proposed system aims at extracting information from old palm leaves (thaaliyola) and translating the ancient Malayalam scripts to their current version based on contrast-based adaptive binarization and convolutional neural networks which simplifies the entire process by avoiding feature extraction. The proposed method is different from the conventional methods which require handcrafted features that are used for classification. Initially, the system is trained with a set of characters. This can be expanded to work with the remaining characters as well. The input will be images of Malayalam palmleaf manuscript and the expected output is their translated script. Our system aims to transform these scripts so as to make it accessible and useful to the current generation. The system will be trained using a number of samples to build a convolutional neural network using which the characters will be recognized.