Izuardo Zulkarnain, Rin Rin Nurmalasari, F. N. Azizah
{"title":"基于深度学习和图像处理的数据增强表信息提取","authors":"Izuardo Zulkarnain, Rin Rin Nurmalasari, F. N. Azizah","doi":"10.1109/TSSA56819.2022.10063909","DOIUrl":null,"url":null,"abstract":"Generally, the extraction of information in the table is done quickly if the table is within a document with a tabular structure. However, in the case of tables presented in the image document, steps are needed first to detect the table. Seeing a table in image documents becomes more complicated if the table to be seen does not have clear boundaries. This research focuses on extracting information from borderless tables in image documents. The study applies the Mask RCNN-FPN deep learning model to detect borderless tables using augmentation data. The use of data augmentation is expected to increase the accuracy of deep learning models even though there is only a small amount of training data available. The data augmentation technique proposed in this study is a fine-tuning method with CutMask augmentation data. For model formation and testing, this study uses the UNLV data set. This data set consists of scanned images of documents from various sources, including financial reports, journals, and different tabular research papers. The total amount of data used is 427 samples. After data augmentation, the amount of data used is 854 samples. The table detection model is based on the Mask RCNN created using Python. The testing parameters used for table detection quality are precise detection, partial detection, false detection, Precision, Recall, and F-Measure. The table's structure recognition rate is measured from the detection intersection value, rows, columns, and cells compared to ground truth. The test results show that using data augmentation with the CutMask technique can improve the performance of deep learning models to detect borderless tables. The use of image processing is shown to enhance table segmentation. However, the table structure recognition result does not offer a good result compared to the effects of other research.","PeriodicalId":164665,"journal":{"name":"2022 16th International Conference on Telecommunication Systems, Services, and Applications (TSSA)","volume":"77 7","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Table Information Extraction Using Data Augmentation on Deep Learning and Image Processing\",\"authors\":\"Izuardo Zulkarnain, Rin Rin Nurmalasari, F. N. Azizah\",\"doi\":\"10.1109/TSSA56819.2022.10063909\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Generally, the extraction of information in the table is done quickly if the table is within a document with a tabular structure. However, in the case of tables presented in the image document, steps are needed first to detect the table. Seeing a table in image documents becomes more complicated if the table to be seen does not have clear boundaries. This research focuses on extracting information from borderless tables in image documents. The study applies the Mask RCNN-FPN deep learning model to detect borderless tables using augmentation data. The use of data augmentation is expected to increase the accuracy of deep learning models even though there is only a small amount of training data available. The data augmentation technique proposed in this study is a fine-tuning method with CutMask augmentation data. For model formation and testing, this study uses the UNLV data set. This data set consists of scanned images of documents from various sources, including financial reports, journals, and different tabular research papers. The total amount of data used is 427 samples. After data augmentation, the amount of data used is 854 samples. The table detection model is based on the Mask RCNN created using Python. The testing parameters used for table detection quality are precise detection, partial detection, false detection, Precision, Recall, and F-Measure. The table's structure recognition rate is measured from the detection intersection value, rows, columns, and cells compared to ground truth. The test results show that using data augmentation with the CutMask technique can improve the performance of deep learning models to detect borderless tables. The use of image processing is shown to enhance table segmentation. However, the table structure recognition result does not offer a good result compared to the effects of other research.\",\"PeriodicalId\":164665,\"journal\":{\"name\":\"2022 16th International Conference on Telecommunication Systems, Services, and Applications (TSSA)\",\"volume\":\"77 7\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 16th International Conference on Telecommunication Systems, Services, and Applications (TSSA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TSSA56819.2022.10063909\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 16th International Conference on Telecommunication Systems, Services, and Applications (TSSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TSSA56819.2022.10063909","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Table Information Extraction Using Data Augmentation on Deep Learning and Image Processing
Generally, the extraction of information in the table is done quickly if the table is within a document with a tabular structure. However, in the case of tables presented in the image document, steps are needed first to detect the table. Seeing a table in image documents becomes more complicated if the table to be seen does not have clear boundaries. This research focuses on extracting information from borderless tables in image documents. The study applies the Mask RCNN-FPN deep learning model to detect borderless tables using augmentation data. The use of data augmentation is expected to increase the accuracy of deep learning models even though there is only a small amount of training data available. The data augmentation technique proposed in this study is a fine-tuning method with CutMask augmentation data. For model formation and testing, this study uses the UNLV data set. This data set consists of scanned images of documents from various sources, including financial reports, journals, and different tabular research papers. The total amount of data used is 427 samples. After data augmentation, the amount of data used is 854 samples. The table detection model is based on the Mask RCNN created using Python. The testing parameters used for table detection quality are precise detection, partial detection, false detection, Precision, Recall, and F-Measure. The table's structure recognition rate is measured from the detection intersection value, rows, columns, and cells compared to ground truth. The test results show that using data augmentation with the CutMask technique can improve the performance of deep learning models to detect borderless tables. The use of image processing is shown to enhance table segmentation. However, the table structure recognition result does not offer a good result compared to the effects of other research.