{"title":"发票中的对象检测","authors":"Andrei-Stefan Bulzan, C. Cernazanu-Glavan","doi":"10.1109/ICSTCC55426.2022.9931900","DOIUrl":null,"url":null,"abstract":"Key field information extraction from documents is an increasingly covetable task. Previous related work has touched upon the subject through the lens of rule-based systems or through natural language processing methods. In this paper we see the task of information extraction from invoices as an object detection task. To this end, we used three different models YOLOv5, Scaled YOLOv4 and Faster R-CNN to detect key field information in invoices. Additionally, we propose a data preprocessing method that helps to better generalize the learning. All of the experiments were performed on a custom made dataset with a very high variety of invoice layouts. This decision comes in part from the lack of any suitable public dataset and from the need of finding the best procedure for annotating data pertaining to this task. The obtained results were encouraging, leading us to the conclusion that object detection is a viable method for information extraction.","PeriodicalId":220845,"journal":{"name":"2022 26th International Conference on System Theory, Control and Computing (ICSTCC)","volume":"8 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Object Detection in Invoices\",\"authors\":\"Andrei-Stefan Bulzan, C. Cernazanu-Glavan\",\"doi\":\"10.1109/ICSTCC55426.2022.9931900\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Key field information extraction from documents is an increasingly covetable task. Previous related work has touched upon the subject through the lens of rule-based systems or through natural language processing methods. In this paper we see the task of information extraction from invoices as an object detection task. To this end, we used three different models YOLOv5, Scaled YOLOv4 and Faster R-CNN to detect key field information in invoices. Additionally, we propose a data preprocessing method that helps to better generalize the learning. All of the experiments were performed on a custom made dataset with a very high variety of invoice layouts. This decision comes in part from the lack of any suitable public dataset and from the need of finding the best procedure for annotating data pertaining to this task. The obtained results were encouraging, leading us to the conclusion that object detection is a viable method for information extraction.\",\"PeriodicalId\":220845,\"journal\":{\"name\":\"2022 26th International Conference on System Theory, Control and Computing (ICSTCC)\",\"volume\":\"8 4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 26th International Conference on System Theory, Control and Computing (ICSTCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSTCC55426.2022.9931900\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 26th International Conference on System Theory, Control and Computing (ICSTCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSTCC55426.2022.9931900","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Key field information extraction from documents is an increasingly covetable task. Previous related work has touched upon the subject through the lens of rule-based systems or through natural language processing methods. In this paper we see the task of information extraction from invoices as an object detection task. To this end, we used three different models YOLOv5, Scaled YOLOv4 and Faster R-CNN to detect key field information in invoices. Additionally, we propose a data preprocessing method that helps to better generalize the learning. All of the experiments were performed on a custom made dataset with a very high variety of invoice layouts. This decision comes in part from the lack of any suitable public dataset and from the need of finding the best procedure for annotating data pertaining to this task. The obtained results were encouraging, leading us to the conclusion that object detection is a viable method for information extraction.