P. R. S. C. Junior, Alexandre Abreu de Freitas, Rafael Daisuke Akiyama, B. Miranda, Tiago Araújo, Carlos G. R. Santos, B. Meiguins, J. Morais
{"title":"基于卷积神经网络的图表图像数据提取架构方案","authors":"P. R. S. C. Junior, Alexandre Abreu de Freitas, Rafael Daisuke Akiyama, B. Miranda, Tiago Araújo, Carlos G. R. Santos, B. Meiguins, J. Morais","doi":"10.1109/iV.2017.37","DOIUrl":null,"url":null,"abstract":"Different information visualization techniques can be found in the literature due to the quantity and variety of data stored in computational systems. In this context, the classification of chart images becomes important because it allows various types of graphs to be detected automatically in different contexts, allowing a more specific processing for each type of visualization, for example, data extraction. Several techniques of image classification can be used, where the most common are based on the extraction of features of the images, and a later classification using these features. However, one technique that has been gaining prominence in the context of image classification is the Convolutional Neural Network (CNN). This technique is based on deep learning and, in a way, encapsulates the feature extraction process. In this way, the proposal of this article is to use an architecture of a client-server based model to do the chart image classification and later data extraction from this image. The main advantage is doing the CNN processing on the server side, so the application does not rely on client device limitations. For this, an image dataset was generated from the web, and it has ten classes of graphs. From the experiments done, it was seen that the use of this technique was feasible, and modifications in the architecture can be made as a proposal to improve the accuracy of the model.","PeriodicalId":410876,"journal":{"name":"2017 21st International Conference Information Visualisation (IV)","volume":"239 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Architecture Proposal for Data Extraction of Chart Images Using Convolutional Neural Network\",\"authors\":\"P. R. S. C. Junior, Alexandre Abreu de Freitas, Rafael Daisuke Akiyama, B. Miranda, Tiago Araújo, Carlos G. R. Santos, B. Meiguins, J. Morais\",\"doi\":\"10.1109/iV.2017.37\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Different information visualization techniques can be found in the literature due to the quantity and variety of data stored in computational systems. In this context, the classification of chart images becomes important because it allows various types of graphs to be detected automatically in different contexts, allowing a more specific processing for each type of visualization, for example, data extraction. Several techniques of image classification can be used, where the most common are based on the extraction of features of the images, and a later classification using these features. However, one technique that has been gaining prominence in the context of image classification is the Convolutional Neural Network (CNN). This technique is based on deep learning and, in a way, encapsulates the feature extraction process. In this way, the proposal of this article is to use an architecture of a client-server based model to do the chart image classification and later data extraction from this image. The main advantage is doing the CNN processing on the server side, so the application does not rely on client device limitations. For this, an image dataset was generated from the web, and it has ten classes of graphs. From the experiments done, it was seen that the use of this technique was feasible, and modifications in the architecture can be made as a proposal to improve the accuracy of the model.\",\"PeriodicalId\":410876,\"journal\":{\"name\":\"2017 21st International Conference Information Visualisation (IV)\",\"volume\":\"239 \",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 21st International Conference Information Visualisation (IV)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/iV.2017.37\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 21st International Conference Information Visualisation (IV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iV.2017.37","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Architecture Proposal for Data Extraction of Chart Images Using Convolutional Neural Network
Different information visualization techniques can be found in the literature due to the quantity and variety of data stored in computational systems. In this context, the classification of chart images becomes important because it allows various types of graphs to be detected automatically in different contexts, allowing a more specific processing for each type of visualization, for example, data extraction. Several techniques of image classification can be used, where the most common are based on the extraction of features of the images, and a later classification using these features. However, one technique that has been gaining prominence in the context of image classification is the Convolutional Neural Network (CNN). This technique is based on deep learning and, in a way, encapsulates the feature extraction process. In this way, the proposal of this article is to use an architecture of a client-server based model to do the chart image classification and later data extraction from this image. The main advantage is doing the CNN processing on the server side, so the application does not rely on client device limitations. For this, an image dataset was generated from the web, and it has ten classes of graphs. From the experiments done, it was seen that the use of this technique was feasible, and modifications in the architecture can be made as a proposal to improve the accuracy of the model.