Combination of Multiple Image Features along with KNN Classifier for Classification of Marathi Barakhadi

Dhanashree Joshi, S. Pansare
{"title":"Combination of Multiple Image Features along with KNN Classifier for Classification of Marathi Barakhadi","authors":"Dhanashree Joshi, S. Pansare","doi":"10.1109/ICCUBEA.2015.124","DOIUrl":null,"url":null,"abstract":"Character recognition is an emerging area for research in terms of different languages spoken all over the world and the associated writing of them. India itself has 11 different scripts and each script has its own subscripts. This diversity gives a wide scope for research out of which devnagari script has been chosen for studying its problems and solutions for those problems. Devnagari has marathi as one of its complicated language which has barakhadi as its characteristic part. A lot of researchers have worked on determining the marathi characters more efficiently, problem listed during this work are the styles of writing, strokes, aspect ratio etc. Data mining is evolving in various fields such as satellite images, medical images, object specific images etc. This paper discusses a new system that combines the Image processing methods along with the data mining classification algorithm which is a new trend called as image mining. The proposed technique applies data acquisition, pre-processing steps such as grayscale conversion, edge detection, binarization and feature extraction methods such as hu moments and GLCM feature extraction from image processing and extracted features are given to Data mining KNN classification algorithm for getting the classification results. The Database used is handwritten barakhadi of 3024 images of 36 barakhadi consonants and 12 vowels written by 7 different people from different age groups. The Proposed system will efficiently and effectively classify the character into its exact category and will reflect a very high performance as compared to others for this hybrid system which is never done before.","PeriodicalId":325841,"journal":{"name":"2015 International Conference on Computing Communication Control and Automation","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Computing Communication Control and Automation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCUBEA.2015.124","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

Abstract

Character recognition is an emerging area for research in terms of different languages spoken all over the world and the associated writing of them. India itself has 11 different scripts and each script has its own subscripts. This diversity gives a wide scope for research out of which devnagari script has been chosen for studying its problems and solutions for those problems. Devnagari has marathi as one of its complicated language which has barakhadi as its characteristic part. A lot of researchers have worked on determining the marathi characters more efficiently, problem listed during this work are the styles of writing, strokes, aspect ratio etc. Data mining is evolving in various fields such as satellite images, medical images, object specific images etc. This paper discusses a new system that combines the Image processing methods along with the data mining classification algorithm which is a new trend called as image mining. The proposed technique applies data acquisition, pre-processing steps such as grayscale conversion, edge detection, binarization and feature extraction methods such as hu moments and GLCM feature extraction from image processing and extracted features are given to Data mining KNN classification algorithm for getting the classification results. The Database used is handwritten barakhadi of 3024 images of 36 barakhadi consonants and 12 vowels written by 7 different people from different age groups. The Proposed system will efficiently and effectively classify the character into its exact category and will reflect a very high performance as compared to others for this hybrid system which is never done before.
多图像特征结合KNN分类器对马拉地语Barakhadi进行分类
字符识别是研究世界各地不同语言及其相关文字的新兴领域。印度本身有11种不同的文字,每种文字都有自己的下标。这种多样性提供了广泛的研究范围,从中选择了devnagari文字来研究其问题和解决这些问题的方法。Devnagari的马拉地语是其复杂的语言之一,巴拉卡迪语是其特征部分。许多研究人员都致力于更有效地确定马拉地文字,在这项工作中列出的问题是书写风格,笔画,纵横比等。数据挖掘在卫星图像、医学图像、特定对象图像等各个领域不断发展。本文讨论了一种将图像处理方法与数据挖掘分类算法相结合的新系统,这是图像挖掘的新趋势。该技术将数据采集、灰度转换、边缘检测、二值化等预处理步骤以及图像处理中的hu矩和GLCM特征提取等特征提取方法,并将提取的特征交给数据挖掘KNN分类算法,得到分类结果。使用的数据库是由7个不同年龄段的人手写的巴拉卡迪语,包含3024张巴拉卡迪语的36个辅音和12个元音。所提出的系统将有效地将角色分类到其确切的类别中,并且与其他混合系统相比,将反映出非常高的性能,这是以前从未做过的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信