{"title":"Efficiency evaluation of filter sizes on graph convolutional neural networks for information extraction from receipts","authors":"An C. Tran, Bao Thai Le, Hai Thanh Nguyen","doi":"10.1007/s41870-024-02089-1","DOIUrl":null,"url":null,"abstract":"<p>Graph Neural Networks (GNNs) have attracted considerable attention due to their ability to analyze structured data represented as graphs. In invoice information extraction, GNNs have proven to be a powerful tool for automatically extracting relevant information from invoices, streamlining data entry processes, and improving efficiency. By modeling the invoice layout as a graph and exploiting the inherent structural dependencies, GNNs enable end-to-end extraction by encoding the graph structure and using deep learning techniques. This work proposes a Graph Convolution Network to extract information from invoices. Furthermore, an evaluation of the effect of filter sizes on the model’s accuracy was performed. We built an extraction model based on the filter size selected by the evaluation. We achieved the accuracy of the test set of 96.4% and the training set of 98.5% on the dataset of about 1.500 invoice images we collected.</p>","PeriodicalId":14138,"journal":{"name":"International Journal of Information Technology","volume":"62 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s41870-024-02089-1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Graph Neural Networks (GNNs) have attracted considerable attention due to their ability to analyze structured data represented as graphs. In invoice information extraction, GNNs have proven to be a powerful tool for automatically extracting relevant information from invoices, streamlining data entry processes, and improving efficiency. By modeling the invoice layout as a graph and exploiting the inherent structural dependencies, GNNs enable end-to-end extraction by encoding the graph structure and using deep learning techniques. This work proposes a Graph Convolution Network to extract information from invoices. Furthermore, an evaluation of the effect of filter sizes on the model’s accuracy was performed. We built an extraction model based on the filter size selected by the evaluation. We achieved the accuracy of the test set of 96.4% and the training set of 98.5% on the dataset of about 1.500 invoice images we collected.