{"title":"基于全卷积网络的ASAR 2018竞赛页面布局分析","authors":"Ahmad Droby, Berat Kurar Barakat, Jihad El-Sana","doi":"10.1109/ASAR.2018.8480326","DOIUrl":null,"url":null,"abstract":"This technical report presents a Fully Convolutional Network based method for layout analysis of benchmarking dataset provided by the competition. The document image is segmented into text and non-text zones by dense pixel prediction. Convolutional part of the network can learn useful features from the document images and is robust to uncontrained layouts. We have evaluated the zone segmentation with average black pixel rate, over-segmentation error, under-segmentation error, correct-segmentation, missed-segmentation error, false alarm error, overall block error rate whereas the zone classification with precision, recall, F1-measure and average class accuracy on both pixel and block levels.","PeriodicalId":165564,"journal":{"name":"2018 IEEE 2nd International Workshop on Arabic and Derived Script Analysis and Recognition (ASAR)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"ASAR 2018 Competition Page Layout Analysis Using Fully Convolutional Networks\",\"authors\":\"Ahmad Droby, Berat Kurar Barakat, Jihad El-Sana\",\"doi\":\"10.1109/ASAR.2018.8480326\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This technical report presents a Fully Convolutional Network based method for layout analysis of benchmarking dataset provided by the competition. The document image is segmented into text and non-text zones by dense pixel prediction. Convolutional part of the network can learn useful features from the document images and is robust to uncontrained layouts. We have evaluated the zone segmentation with average black pixel rate, over-segmentation error, under-segmentation error, correct-segmentation, missed-segmentation error, false alarm error, overall block error rate whereas the zone classification with precision, recall, F1-measure and average class accuracy on both pixel and block levels.\",\"PeriodicalId\":165564,\"journal\":{\"name\":\"2018 IEEE 2nd International Workshop on Arabic and Derived Script Analysis and Recognition (ASAR)\",\"volume\":\"56 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 2nd International Workshop on Arabic and Derived Script Analysis and Recognition (ASAR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ASAR.2018.8480326\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 2nd International Workshop on Arabic and Derived Script Analysis and Recognition (ASAR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASAR.2018.8480326","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
ASAR 2018 Competition Page Layout Analysis Using Fully Convolutional Networks
This technical report presents a Fully Convolutional Network based method for layout analysis of benchmarking dataset provided by the competition. The document image is segmented into text and non-text zones by dense pixel prediction. Convolutional part of the network can learn useful features from the document images and is robust to uncontrained layouts. We have evaluated the zone segmentation with average black pixel rate, over-segmentation error, under-segmentation error, correct-segmentation, missed-segmentation error, false alarm error, overall block error rate whereas the zone classification with precision, recall, F1-measure and average class accuracy on both pixel and block levels.