基于对比自适应二值化和卷积神经网络的马来亚拉姆棕叶手稿数字化

D. Sudarsan, Parvathy Vijayakumar, Sharon Biju, Soniya Sanu, Sreelakshmi K. Shivadas
{"title":"基于对比自适应二值化和卷积神经网络的马来亚拉姆棕叶手稿数字化","authors":"D. Sudarsan, Parvathy Vijayakumar, Sharon Biju, Soniya Sanu, Sreelakshmi K. Shivadas","doi":"10.1109/WISPNET.2018.8538588","DOIUrl":null,"url":null,"abstract":"The palm leaf manuscripts are an abundant source of knowledge, tradition and ancient culture. These scriptures are an unavoidable part of our rich culture and have to be preserved in the best possible way. But the information extraction from palm leaf is a tedious task due to various challenges such as noise enormous character set and the difficulty in reading and understanding the ancient Malayalam script. Handwriting recognition in Malayalam is a challenging and emerging area of pattern recognition. Our proposed system aims at extracting information from old palm leaves (thaaliyola) and translating the ancient Malayalam scripts to their current version based on contrast-based adaptive binarization and convolutional neural networks which simplifies the entire process by avoiding feature extraction. The proposed method is different from the conventional methods which require handcrafted features that are used for classification. Initially, the system is trained with a set of characters. This can be expanded to work with the remaining characters as well. The input will be images of Malayalam palmleaf manuscript and the expected output is their translated script. Our system aims to transform these scripts so as to make it accessible and useful to the current generation. The system will be trained using a number of samples to build a convolutional neural network using which the characters will be recognized.","PeriodicalId":6858,"journal":{"name":"2018 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET)","volume":"46 1","pages":"1-4"},"PeriodicalIF":0.0000,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Digitalization of Malayalam Palmleaf Manuscripts Based on Contrast-Based Adaptive Binarization and Convolutional Neural Networks\",\"authors\":\"D. Sudarsan, Parvathy Vijayakumar, Sharon Biju, Soniya Sanu, Sreelakshmi K. Shivadas\",\"doi\":\"10.1109/WISPNET.2018.8538588\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The palm leaf manuscripts are an abundant source of knowledge, tradition and ancient culture. These scriptures are an unavoidable part of our rich culture and have to be preserved in the best possible way. But the information extraction from palm leaf is a tedious task due to various challenges such as noise enormous character set and the difficulty in reading and understanding the ancient Malayalam script. Handwriting recognition in Malayalam is a challenging and emerging area of pattern recognition. Our proposed system aims at extracting information from old palm leaves (thaaliyola) and translating the ancient Malayalam scripts to their current version based on contrast-based adaptive binarization and convolutional neural networks which simplifies the entire process by avoiding feature extraction. The proposed method is different from the conventional methods which require handcrafted features that are used for classification. Initially, the system is trained with a set of characters. This can be expanded to work with the remaining characters as well. The input will be images of Malayalam palmleaf manuscript and the expected output is their translated script. Our system aims to transform these scripts so as to make it accessible and useful to the current generation. The system will be trained using a number of samples to build a convolutional neural network using which the characters will be recognized.\",\"PeriodicalId\":6858,\"journal\":{\"name\":\"2018 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET)\",\"volume\":\"46 1\",\"pages\":\"1-4\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WISPNET.2018.8538588\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WISPNET.2018.8538588","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

棕榈叶手稿是知识、传统和古代文化的丰富来源。这些经文是我们丰富文化不可避免的一部分,必须以最好的方式保存下来。但是,由于噪音、庞大的字符集以及阅读和理解古代马拉雅拉姆文字的困难等各种挑战,从棕榈叶中提取信息是一项繁琐的任务。马拉雅拉姆语的手写识别是模式识别中一个具有挑战性的新兴领域。我们提出的系统旨在从古棕榈叶(thaaliyola)中提取信息,并基于基于对比度的自适应二值化和卷积神经网络将古马拉雅拉姆文字翻译成当前版本,从而简化了整个过程,避免了特征提取。该方法不同于传统的方法,传统的方法需要手工制作特征来进行分类。最初,系统用一组字符进行训练。这也可以扩展到其他角色。输入将是马拉雅拉姆棕榈叶手稿的图像,预期输出是他们的翻译脚本。我们的系统旨在转换这些脚本,使其易于访问,并对当前一代有用。该系统将使用大量样本进行训练,以建立一个卷积神经网络,使用该网络将识别字符。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Digitalization of Malayalam Palmleaf Manuscripts Based on Contrast-Based Adaptive Binarization and Convolutional Neural Networks
The palm leaf manuscripts are an abundant source of knowledge, tradition and ancient culture. These scriptures are an unavoidable part of our rich culture and have to be preserved in the best possible way. But the information extraction from palm leaf is a tedious task due to various challenges such as noise enormous character set and the difficulty in reading and understanding the ancient Malayalam script. Handwriting recognition in Malayalam is a challenging and emerging area of pattern recognition. Our proposed system aims at extracting information from old palm leaves (thaaliyola) and translating the ancient Malayalam scripts to their current version based on contrast-based adaptive binarization and convolutional neural networks which simplifies the entire process by avoiding feature extraction. The proposed method is different from the conventional methods which require handcrafted features that are used for classification. Initially, the system is trained with a set of characters. This can be expanded to work with the remaining characters as well. The input will be images of Malayalam palmleaf manuscript and the expected output is their translated script. Our system aims to transform these scripts so as to make it accessible and useful to the current generation. The system will be trained using a number of samples to build a convolutional neural network using which the characters will be recognized.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信