{"title":"利用深度学习检测古鲁姆希语报纸复杂版面的混合方法","authors":"Atul Kumar, Gurpreet Singh Lehal","doi":"10.52756/ijerr.2023.v35spl.004","DOIUrl":null,"url":null,"abstract":"Layout analysis is the crucial stage in the recognition system of newspapers. A good layout analysis results in better recognition results. In this paper, we detected the complex layout of newspapers in the Gurumukhi script. We have used a hybrid approach. In this approach, firstly, we proposed an algorithm to remove pictures from newspaper images that involves various image preprocessing tasks based on binarization, finding contours, and erosion on the image to remove the graphics from the image. This method also removes pictures from complex non-Manhattan layouts. Finally, we have trained the deep-leaning model based on a convolutional network to detect the columns of text from newspapers. We have created a dataset of 500 images labelled with five classes on which the model was trained. We have tested this method on the number of newspapers of the Gurumukhi script. The results show very good accuracy with this hybrid approach of layout detection.","PeriodicalId":190842,"journal":{"name":"International Journal of Experimental Research and Review","volume":"59 3","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Hybrid Approach for Complex Layout Detection of Newspapers in Gurumukhi Script Using Deep Learning\",\"authors\":\"Atul Kumar, Gurpreet Singh Lehal\",\"doi\":\"10.52756/ijerr.2023.v35spl.004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Layout analysis is the crucial stage in the recognition system of newspapers. A good layout analysis results in better recognition results. In this paper, we detected the complex layout of newspapers in the Gurumukhi script. We have used a hybrid approach. In this approach, firstly, we proposed an algorithm to remove pictures from newspaper images that involves various image preprocessing tasks based on binarization, finding contours, and erosion on the image to remove the graphics from the image. This method also removes pictures from complex non-Manhattan layouts. Finally, we have trained the deep-leaning model based on a convolutional network to detect the columns of text from newspapers. We have created a dataset of 500 images labelled with five classes on which the model was trained. We have tested this method on the number of newspapers of the Gurumukhi script. The results show very good accuracy with this hybrid approach of layout detection.\",\"PeriodicalId\":190842,\"journal\":{\"name\":\"International Journal of Experimental Research and Review\",\"volume\":\"59 3\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-11-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Experimental Research and Review\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.52756/ijerr.2023.v35spl.004\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Experimental Research and Review","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52756/ijerr.2023.v35spl.004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Hybrid Approach for Complex Layout Detection of Newspapers in Gurumukhi Script Using Deep Learning
Layout analysis is the crucial stage in the recognition system of newspapers. A good layout analysis results in better recognition results. In this paper, we detected the complex layout of newspapers in the Gurumukhi script. We have used a hybrid approach. In this approach, firstly, we proposed an algorithm to remove pictures from newspaper images that involves various image preprocessing tasks based on binarization, finding contours, and erosion on the image to remove the graphics from the image. This method also removes pictures from complex non-Manhattan layouts. Finally, we have trained the deep-leaning model based on a convolutional network to detect the columns of text from newspapers. We have created a dataset of 500 images labelled with five classes on which the model was trained. We have tested this method on the number of newspapers of the Gurumukhi script. The results show very good accuracy with this hybrid approach of layout detection.