{"title":"DDSnet: A Deep Document Segmentation with Hybrid Blocks Architecture Network","authors":"Jing-Ming Guo, Li-Ying Chang, Hao-Hsuan Lee","doi":"10.1109/IS3C50286.2020.00031","DOIUrl":null,"url":null,"abstract":"In recent years, the development of document segmentation technology is gaining more and more attention in the area of semantic segmentation, which plays an important role in the task of understanding the structure of documents. Although this demand applying deep learning approaches has undergone continuing advancement, the document segmentation systems still suffer from low accuracy rate. This paper presents a new high-performance document segmentation algorithm, namely Deep Document Segmentation Network (DDSnet), which incorporates advanced end-to-end deep learning methods for segmenting four different types of document features, including background, texts, tables, and figures. To overcome the small receptive fields, the atrous residual block is proposed, which is efficiently boosted by the adoption of multi-branches structure. For the better fine-grained output, the proposed atrous convolution residual block is conducted to achieve high accuracy. Moreover, this paper also releases the brand-new large-scale database, namely PPSD2019, for document segmentation that provides a pixel-level database for another benchmark. As documented in the experimental results, the proposed document segmentation method achieves a superior segmentation rate than that of the former competitive schemes. As a result, the proposed method and database can be considered as a very competitive candidate for the document segmentation applications.","PeriodicalId":143430,"journal":{"name":"2020 International Symposium on Computer, Consumer and Control (IS3C)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Symposium on Computer, Consumer and Control (IS3C)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IS3C50286.2020.00031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In recent years, the development of document segmentation technology is gaining more and more attention in the area of semantic segmentation, which plays an important role in the task of understanding the structure of documents. Although this demand applying deep learning approaches has undergone continuing advancement, the document segmentation systems still suffer from low accuracy rate. This paper presents a new high-performance document segmentation algorithm, namely Deep Document Segmentation Network (DDSnet), which incorporates advanced end-to-end deep learning methods for segmenting four different types of document features, including background, texts, tables, and figures. To overcome the small receptive fields, the atrous residual block is proposed, which is efficiently boosted by the adoption of multi-branches structure. For the better fine-grained output, the proposed atrous convolution residual block is conducted to achieve high accuracy. Moreover, this paper also releases the brand-new large-scale database, namely PPSD2019, for document segmentation that provides a pixel-level database for another benchmark. As documented in the experimental results, the proposed document segmentation method achieves a superior segmentation rate than that of the former competitive schemes. As a result, the proposed method and database can be considered as a very competitive candidate for the document segmentation applications.