{"title":"Classification of Ancient Epigraphs into Different Periods Using Random Forests","authors":"Soumya A, G. Hemantha Kumar","doi":"10.1109/ICSIP.2014.33","DOIUrl":null,"url":null,"abstract":"Epigraphists, who identify the ancient inscriptions, reconstruct, translate, draw conclusions about the writings, and classify their uses according to dates, are decreasing in number and also because of the fact that repetitive tasks can be exhausting for humans and prone to errors there is a need arising for the automation of these kinds of tasks. It is observed that the characters of a script have evolved over years and transformed to the current form. The purpose of this work is to estimate the period of an epigraph which is the initial step towards automating the task of reading and deciphering inscriptions. The proposed system considers a reconstructed grayscale image of an epigraph pertaining to ancient Kannada script as its input, which is binarized using Otsu's method and then segmented to characters using Connected Component analysis. Normalized Central Moments and Zernike Moments are extracted from the segmented characters and used as the feature vectors for classification. Random Forest (RF) is used as the classifier, which is an ensemble of classification trees, and each tree votes for a class and the output class is the majority of the votes which determines the era of the input epigraph. The system developed is used to classify ancient Kannada epigraphs belonging to the period of any of these dynasties: Ashoka, Satavahana, Kadamba, Chalukya, Rastrakuta and Hoysala. The system showed good results when tested on 110 Kannada epigraph images from different eras. An analysis of the prediction rate of the epigraphs was carried out and obtained a rate of 85% using RF classifier.","PeriodicalId":111591,"journal":{"name":"2014 Fifth International Conference on Signal and Image Processing","volume":"85 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 Fifth International Conference on Signal and Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSIP.2014.33","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Epigraphists, who identify the ancient inscriptions, reconstruct, translate, draw conclusions about the writings, and classify their uses according to dates, are decreasing in number and also because of the fact that repetitive tasks can be exhausting for humans and prone to errors there is a need arising for the automation of these kinds of tasks. It is observed that the characters of a script have evolved over years and transformed to the current form. The purpose of this work is to estimate the period of an epigraph which is the initial step towards automating the task of reading and deciphering inscriptions. The proposed system considers a reconstructed grayscale image of an epigraph pertaining to ancient Kannada script as its input, which is binarized using Otsu's method and then segmented to characters using Connected Component analysis. Normalized Central Moments and Zernike Moments are extracted from the segmented characters and used as the feature vectors for classification. Random Forest (RF) is used as the classifier, which is an ensemble of classification trees, and each tree votes for a class and the output class is the majority of the votes which determines the era of the input epigraph. The system developed is used to classify ancient Kannada epigraphs belonging to the period of any of these dynasties: Ashoka, Satavahana, Kadamba, Chalukya, Rastrakuta and Hoysala. The system showed good results when tested on 110 Kannada epigraph images from different eras. An analysis of the prediction rate of the epigraphs was carried out and obtained a rate of 85% using RF classifier.