深度卷积神经网络的融合

2017 IEEE Applied Imagery Pattern Recognition Workshop (AIPR) Pub Date : 2017-10-01 DOI:10.1109/AIPR.2017.8457945

Robert Suchy, Soundararajan Ezekiel, Maria Scalzo-Cornacchia

{"title":"深度卷积神经网络的融合","authors":"Robert Suchy, Soundararajan Ezekiel, Maria Scalzo-Cornacchia","doi":"10.1109/AIPR.2017.8457945","DOIUrl":null,"url":null,"abstract":"In recent years, the concept of big data has become a more prominent research topic as the volume of data and the rate at which it is produced are increasing exponentially. By 2020 the amount of data being stored is estimated to be 44 Zettabytes and currently over 31 Terabytes of data is being generated every second. Algorithms and applications must be able to effectively scale to the volume of data being generated. One such application that has excelled due to the surge in Big Data is the Convolutional Neural Network. The breakthroughs in the development of Graphical Processing Units have led to the advancements in the state-of-the-art on tasks such as image classification and speech recognition. These multi-layered convolutional neural networks are very large, complex and require significant computational resources to train and evaluate models. In this paper, we explore several novel architectures for the fusion of multiple convolutional neural networks, including stacked representation fusions and mixed model fusion. We differ from existing fusion methods in that our approaches take in the raw outputs of several CNN models and use classifiers as fusers. Other methods typically hand-craft the fusion or have used the original input space as the fusion method. Advancements in this area will better enable the leveraging of the vast amount of pre-trained models and improve accuracy of these models. The approaches generated are application agnostic and will apply across a breadth of tasks.","PeriodicalId":128779,"journal":{"name":"2017 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Fusion of Deep Convolutional Neural Networks\",\"authors\":\"Robert Suchy, Soundararajan Ezekiel, Maria Scalzo-Cornacchia\",\"doi\":\"10.1109/AIPR.2017.8457945\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, the concept of big data has become a more prominent research topic as the volume of data and the rate at which it is produced are increasing exponentially. By 2020 the amount of data being stored is estimated to be 44 Zettabytes and currently over 31 Terabytes of data is being generated every second. Algorithms and applications must be able to effectively scale to the volume of data being generated. One such application that has excelled due to the surge in Big Data is the Convolutional Neural Network. The breakthroughs in the development of Graphical Processing Units have led to the advancements in the state-of-the-art on tasks such as image classification and speech recognition. These multi-layered convolutional neural networks are very large, complex and require significant computational resources to train and evaluate models. In this paper, we explore several novel architectures for the fusion of multiple convolutional neural networks, including stacked representation fusions and mixed model fusion. We differ from existing fusion methods in that our approaches take in the raw outputs of several CNN models and use classifiers as fusers. Other methods typically hand-craft the fusion or have used the original input space as the fusion method. Advancements in this area will better enable the leveraging of the vast amount of pre-trained models and improve accuracy of these models. The approaches generated are application agnostic and will apply across a breadth of tasks.\",\"PeriodicalId\":128779,\"journal\":{\"name\":\"2017 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)\",\"volume\":\"49 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AIPR.2017.8457945\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIPR.2017.8457945","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

近年来，随着数据量和产生速度呈指数级增长，大数据的概念已经成为一个更加突出的研究课题。到2020年，存储的数据量预计将达到44zb，目前每秒产生的数据量超过31tb。算法和应用程序必须能够有效地扩展到生成的数据量。由于大数据的激增，卷积神经网络(Convolutional Neural Network)就是这样一个应用程序。图形处理单元发展的突破导致了图像分类和语音识别等任务的最先进技术的进步。这些多层卷积神经网络非常庞大、复杂，需要大量的计算资源来训练和评估模型。本文探讨了多种卷积神经网络融合的新架构，包括堆叠表示融合和混合模型融合。我们与现有融合方法的不同之处在于，我们的方法采用几个CNN模型的原始输出，并使用分类器作为融合器。其他方法通常手工制作融合或使用原始输入空间作为融合方法。这一领域的进步将更好地利用大量预训练的模型，并提高这些模型的准确性。所生成的方法与应用程序无关，并将适用于广泛的任务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Fusion of Deep Convolutional Neural Networks

In recent years, the concept of big data has become a more prominent research topic as the volume of data and the rate at which it is produced are increasing exponentially. By 2020 the amount of data being stored is estimated to be 44 Zettabytes and currently over 31 Terabytes of data is being generated every second. Algorithms and applications must be able to effectively scale to the volume of data being generated. One such application that has excelled due to the surge in Big Data is the Convolutional Neural Network. The breakthroughs in the development of Graphical Processing Units have led to the advancements in the state-of-the-art on tasks such as image classification and speech recognition. These multi-layered convolutional neural networks are very large, complex and require significant computational resources to train and evaluate models. In this paper, we explore several novel architectures for the fusion of multiple convolutional neural networks, including stacked representation fusions and mixed model fusion. We differ from existing fusion methods in that our approaches take in the raw outputs of several CNN models and use classifiers as fusers. Other methods typically hand-craft the fusion or have used the original input space as the fusion method. Advancements in this area will better enable the leveraging of the vast amount of pre-trained models and improve accuracy of these models. The approaches generated are application agnostic and will apply across a breadth of tasks.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)

自引率

0.00%

发文量