Adnan Öncevarlık, Akın Akön, Kemal Doruk Yıldız, E. Adali
{"title":"Two Level Document Image Classification","authors":"Adnan Öncevarlık, Akın Akön, Kemal Doruk Yıldız, E. Adali","doi":"10.1109/UBMK52708.2021.9558895","DOIUrl":null,"url":null,"abstract":"Classifying documents is an important process for organizations that are responsible for keeping a large number of documents in a digital archive. In this paper, a two-level method was used to classify approximately 253 class of documents. In the first stage, the documents were visually classified. In the second stage, unclassifiable documents were tried to be classified using natural language processing (NLP). Training set was created by classifying approximately 28.000 documents by hand. Studies were carried out on single-page documents. Performances were measured by making experiments on the full, 1/2 and 1/3 of document using AlexNet for image classification, bag-of-words and LSTM algorithm for NLP. The performances of the study were measured according to different options. The success rate was 75% and 85% at the end of first and second stages respectively","PeriodicalId":106516,"journal":{"name":"2021 6th International Conference on Computer Science and Engineering (UBMK)","volume":"186 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 6th International Conference on Computer Science and Engineering (UBMK)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UBMK52708.2021.9558895","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Classifying documents is an important process for organizations that are responsible for keeping a large number of documents in a digital archive. In this paper, a two-level method was used to classify approximately 253 class of documents. In the first stage, the documents were visually classified. In the second stage, unclassifiable documents were tried to be classified using natural language processing (NLP). Training set was created by classifying approximately 28.000 documents by hand. Studies were carried out on single-page documents. Performances were measured by making experiments on the full, 1/2 and 1/3 of document using AlexNet for image classification, bag-of-words and LSTM algorithm for NLP. The performances of the study were measured according to different options. The success rate was 75% and 85% at the end of first and second stages respectively