{"title":"Digitalization of Administrative Documents A Digital Transformation Step in Practice","authors":"Sinh Van Nguyen, Dung Anh Nguyen, Lam-Son Pham","doi":"10.1109/NICS54270.2021.9701547","DOIUrl":null,"url":null,"abstract":"Digital transformation is one of the most popular keyword in recent years. It is not only a trend in science research based on the development of information technology, but also a proposed duty that applied in the companies or organizations nowadays. Digitalization of administrative documents is therefore considered as the first step in digital transformation of public organization. Through the digitizing process, the information that were in written format or hard copies will be converted into digital format (e.g. document files) to serve for storing, mining, processing and managing the documents. This paper presents a method to build a web application for digitizing the administrative documents applied in most public organizations. The method is based on the OCR (Optical Character Recognition) combined with the image processing techniques. Our digital process is implemented as following steps. (i) Scanning the hard copies of the administrative documents. (ii) Removing noise data and filtering necessary information in the content based on image processing technique. (iii) Classifying automatically the acquired contents into the respective components of a template form following the structured format of Vietnam Government. (iv) Generating automatically a document file. The application can process a document with a single or multiple pages. To compare with similar applications, our application is processed very fast, without limitation of pages for each document and obtained accuracy as our expectation.","PeriodicalId":296963,"journal":{"name":"2021 8th NAFOSTED Conference on Information and Computer Science (NICS)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 8th NAFOSTED Conference on Information and Computer Science (NICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NICS54270.2021.9701547","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Digital transformation is one of the most popular keyword in recent years. It is not only a trend in science research based on the development of information technology, but also a proposed duty that applied in the companies or organizations nowadays. Digitalization of administrative documents is therefore considered as the first step in digital transformation of public organization. Through the digitizing process, the information that were in written format or hard copies will be converted into digital format (e.g. document files) to serve for storing, mining, processing and managing the documents. This paper presents a method to build a web application for digitizing the administrative documents applied in most public organizations. The method is based on the OCR (Optical Character Recognition) combined with the image processing techniques. Our digital process is implemented as following steps. (i) Scanning the hard copies of the administrative documents. (ii) Removing noise data and filtering necessary information in the content based on image processing technique. (iii) Classifying automatically the acquired contents into the respective components of a template form following the structured format of Vietnam Government. (iv) Generating automatically a document file. The application can process a document with a single or multiple pages. To compare with similar applications, our application is processed very fast, without limitation of pages for each document and obtained accuracy as our expectation.