Convolutional Networks Used to Classify Video and Audio Data

Research Papers Faculty of Materials Science and Technology Slovak University of Technology Pub Date : 2019-09-01 DOI:10.2478/rput-2019-0034

Marcel Nikmon, Roman Budjac, Daniel Kuchár, Peter Schreiber, D. Janácová

{"title":"Convolutional Networks Used to Classify Video and Audio Data","authors":"Marcel Nikmon, Roman Budjac, Daniel Kuchár, Peter Schreiber, D. Janácová","doi":"10.2478/rput-2019-0034","DOIUrl":null,"url":null,"abstract":"Abstract Deep learning is a kind of machine learning, and machine learning is a kind of artificial intelligence. Machine learning depicts groups of various technologies, and deep learning is one of them. The use of deep learning is an integral part of the current data classification practice in today’s world. This paper introduces the possibilities of classification using convolutional networks. Experiments focused on audio and video data show different approaches to data classification. Most experiments use the well-known pre-trained AlexNet network with various pre-processing types of input data. However, there are also comparisons of other neural network architectures, and we also show the results of training on small and larger datasets. The paper comprises description of eight different kinds of experiments. Several training sessions were conducted in each experiment with different aspects that were monitored. The focus was put on the effect of batch size on the accuracy of deep learning, including many other parameters that affect deep learning [1].","PeriodicalId":21013,"journal":{"name":"Research Papers Faculty of Materials Science and Technology Slovak University of Technology","volume":"157 1","pages":"113 - 120"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research Papers Faculty of Materials Science and Technology Slovak University of Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/rput-2019-0034","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

Abstract Deep learning is a kind of machine learning, and machine learning is a kind of artificial intelligence. Machine learning depicts groups of various technologies, and deep learning is one of them. The use of deep learning is an integral part of the current data classification practice in today’s world. This paper introduces the possibilities of classification using convolutional networks. Experiments focused on audio and video data show different approaches to data classification. Most experiments use the well-known pre-trained AlexNet network with various pre-processing types of input data. However, there are also comparisons of other neural network architectures, and we also show the results of training on small and larger datasets. The paper comprises description of eight different kinds of experiments. Several training sessions were conducted in each experiment with different aspects that were monitored. The focus was put on the effect of batch size on the accuracy of deep learning, including many other parameters that affect deep learning [1].

查看原文本刊更多论文

用于视频和音频数据分类的卷积网络

深度学习是机器学习的一种，机器学习是人工智能的一种。机器学习描述了各种技术的组合，深度学习就是其中之一。深度学习的使用是当今世界当前数据分类实践的一个组成部分。本文介绍了使用卷积网络进行分类的可能性。针对音频和视频数据的实验显示了不同的数据分类方法。大多数实验使用众所周知的预训练AlexNet网络和各种预处理类型的输入数据。然而，也有其他神经网络架构的比较，我们也展示了小数据集和大数据集的训练结果。本文包括八种不同实验的描述。在每个实验中都进行了几次培训，对不同方面进行了监测。重点放在批大小对深度学习准确性的影响上，包括影响深度学习的许多其他参数[1]。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Research Papers Faculty of Materials Science and Technology Slovak University of Technology

自引率

0.00%

发文量