基于特征重用卷积神经网络的医学图像文本区域检测

Proceedings of the 2019 International Conference on Robotics Systems and Vehicle Technology - RSVT '19 Pub Date : 2019-10-16 DOI:10.1145/3366715.3366738

Yang Liu, Jun Liu, S. Sun, Zhuang Du

{"title":"基于特征重用卷积神经网络的医学图像文本区域检测","authors":"Yang Liu, Jun Liu, S. Sun, Zhuang Du","doi":"10.1145/3366715.3366738","DOIUrl":null,"url":null,"abstract":"In order to solve the problem of Chinese medical image text being missed and misdetected under the CTPN model, a new convolutional neural network DVNet based on the fusion of VGG convolutional neural network and DenseNet dense network was proposed. DVNet takes the first two layers of VGG network for deep feature extraction, and then connects DenseNet dense modules. Using the idea of feature reuse, the features of the front convolutional layer and the features of the back convolutional layer are output together. During post-processing, NMS is used to filter out redundant text boxes. In the Chinese medical text data set provided, three different networks, VGG, DenseNet and DVNet, were used to detect the text. The experimental results showed that the precision rate of DVNet were improved by 2%-3% compared with VGG and DenseNet.","PeriodicalId":425980,"journal":{"name":"Proceedings of the 2019 International Conference on Robotics Systems and Vehicle Technology - RSVT '19","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Medical Image Text Area Detection Based on Feature Reuse Convolutional Neural Network\",\"authors\":\"Yang Liu, Jun Liu, S. Sun, Zhuang Du\",\"doi\":\"10.1145/3366715.3366738\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In order to solve the problem of Chinese medical image text being missed and misdetected under the CTPN model, a new convolutional neural network DVNet based on the fusion of VGG convolutional neural network and DenseNet dense network was proposed. DVNet takes the first two layers of VGG network for deep feature extraction, and then connects DenseNet dense modules. Using the idea of feature reuse, the features of the front convolutional layer and the features of the back convolutional layer are output together. During post-processing, NMS is used to filter out redundant text boxes. In the Chinese medical text data set provided, three different networks, VGG, DenseNet and DVNet, were used to detect the text. The experimental results showed that the precision rate of DVNet were improved by 2%-3% compared with VGG and DenseNet.\",\"PeriodicalId\":425980,\"journal\":{\"name\":\"Proceedings of the 2019 International Conference on Robotics Systems and Vehicle Technology - RSVT '19\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2019 International Conference on Robotics Systems and Vehicle Technology - RSVT '19\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3366715.3366738\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 International Conference on Robotics Systems and Vehicle Technology - RSVT '19","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3366715.3366738","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

为了解决CTPN模型下中医图像文本的漏检和误检问题，提出了一种基于VGG卷积神经网络和DenseNet密集网络融合的新型卷积神经网络DVNet。DVNet采用前两层VGG网络进行深度特征提取，然后连接DenseNet密集模块。采用特征重用的思想，将前卷积层的特征和后卷积层的特征一起输出。在后处理过程中，使用NMS对多余的文本框进行过滤。在提供的中医文本数据集中，使用VGG、DenseNet和DVNet三种不同的网络对文本进行检测。实验结果表明，与VGG和DenseNet相比，DVNet的准确率提高了2% ~ 3%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Medical Image Text Area Detection Based on Feature Reuse Convolutional Neural Network

In order to solve the problem of Chinese medical image text being missed and misdetected under the CTPN model, a new convolutional neural network DVNet based on the fusion of VGG convolutional neural network and DenseNet dense network was proposed. DVNet takes the first two layers of VGG network for deep feature extraction, and then connects DenseNet dense modules. Using the idea of feature reuse, the features of the front convolutional layer and the features of the back convolutional layer are output together. During post-processing, NMS is used to filter out redundant text boxes. In the Chinese medical text data set provided, three different networks, VGG, DenseNet and DVNet, were used to detect the text. The experimental results showed that the precision rate of DVNet were improved by 2%-3% compared with VGG and DenseNet.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2019 International Conference on Robotics Systems and Vehicle Technology - RSVT '19

自引率

0.00%

发文量