SYSTEM OF AUTOMATIC SEGMENTATION OF PAUSES IN PHONOGRAMS ON THE BASIS OF NEURON NETWORKS OF THE DEEP LEARNING

Q3 Engineering

Journal of Automation and Information Sciences Pub Date : 2021-01-01 DOI:10.34229/0572-2691-2021-1-8

Viktor I. Soloviev, O. Rybalsky, V. Zhuravel

{"title":"SYSTEM OF AUTOMATIC SEGMENTATION OF PAUSES IN PHONOGRAMS ON THE BASIS OF NEURON NETWORKS OF THE DEEP LEARNING","authors":"Viktor I. Soloviev, O. Rybalsky, V. Zhuravel","doi":"10.34229/0572-2691-2021-1-8","DOIUrl":null,"url":null,"abstract":"The use of neuron networks of the deep learning for the construction of tool for realization of examinations of materials and apparatus of the digital audio recording allows to solve the «frigging» problem of such examination — problem of exposure of tracks of editing in digital phonograms. These networks provide high probability of exposure of such tracks in the pauses of speech information writtenin on a phonogram. Before man-hunting of tracks of editing in the investigated phonogram it is necessary to distinguish pauses (to perform its segmentation), and tool built on the basis of neuron networks of the deep learning, requires its work to be done in automatic mode. The basic requirement of automatic segmentation is high efficiency of selection of pauses in the conditions of permanent change of level of noises in phonograms. It is determined by probability of errors of І and ІІ kinds. It is offered on the basis of neuron networks of the deep learning to create CAS of segmentation of phonograms, possessing high efficiency of selection of pauses in speech information. Thus the system must be independent of level of noises in every concrete pause, and also language, context and announcer, whose speech is fixed in a phonogram. It is suggested to examine pauses as one of the types of voice information, which characteristics differ from characteristics of speech information fixed in a phonogram. For educating of such network it was required to create the primary base of these sounds and pauses. On its basis three arrays of the data, intended for learning, testing and determination of the crooked errors of І and ІІ kinds, are created. After learning and testing the system passed verification on the real phonograms. As a result taking into account some features of speech on the neuron networks of deep learning there has been built the system providing effective segmentation of pauses in phonograms in the automatics mode. The obtained results suit examination that is conformed by given curves over of errors of І and ІІ kinds.","PeriodicalId":54874,"journal":{"name":"Journal of Automation and Information Sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Automation and Information Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.34229/0572-2691-2021-1-8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Engineering","Score":null,"Total":0}

引用次数: 0

Abstract

The use of neuron networks of the deep learning for the construction of tool for realization of examinations of materials and apparatus of the digital audio recording allows to solve the «frigging» problem of such examination — problem of exposure of tracks of editing in digital phonograms. These networks provide high probability of exposure of such tracks in the pauses of speech information writtenin on a phonogram. Before man-hunting of tracks of editing in the investigated phonogram it is necessary to distinguish pauses (to perform its segmentation), and tool built on the basis of neuron networks of the deep learning, requires its work to be done in automatic mode. The basic requirement of automatic segmentation is high efficiency of selection of pauses in the conditions of permanent change of level of noises in phonograms. It is determined by probability of errors of І and ІІ kinds. It is offered on the basis of neuron networks of the deep learning to create CAS of segmentation of phonograms, possessing high efficiency of selection of pauses in speech information. Thus the system must be independent of level of noises in every concrete pause, and also language, context and announcer, whose speech is fixed in a phonogram. It is suggested to examine pauses as one of the types of voice information, which characteristics differ from characteristics of speech information fixed in a phonogram. For educating of such network it was required to create the primary base of these sounds and pauses. On its basis three arrays of the data, intended for learning, testing and determination of the crooked errors of І and ІІ kinds, are created. After learning and testing the system passed verification on the real phonograms. As a result taking into account some features of speech on the neuron networks of deep learning there has been built the system providing effective segmentation of pauses in phonograms in the automatics mode. The obtained results suit examination that is conformed by given curves over of errors of І and ІІ kinds.

查看原文本刊更多论文

基于深度学习神经元网络的留声图停顿自动分割系统

利用深度学习的神经元网络构建工具来实现对数字录音材料和设备的检查，可以解决这种检查的“恼人”问题-数字录音中编辑轨迹的暴露问题。这些网络提供了在写在音图上的语音信息的停顿中暴露这些轨迹的高概率。在对被调查的音图进行编辑音轨的人工搜索之前，需要对停顿进行区分(进行分割)，而基于深度学习的神经元网络构建的工具要求其工作在自动模式下完成。自动分割的基本要求是在留声机噪声水平不断变化的情况下，能够高效地选择停顿。它是由І和ІІ类的误差概率决定的。在深度学习神经元网络的基础上，提出了一种基于语音图分割的CAS，具有语音信息中停顿选择的高效率。因此，该系统必须独立于每个具体停顿中的噪声水平，以及语言、上下文和播音员(其讲话固定在音图中)。建议将停顿作为语音信息的一种，其特征不同于语音图中固定的语音信息特征。为了教育这种网络，需要创造这些声音和停顿的基本基础。在此基础上，创建了三个数据数组，用于学习、测试和确定І和ІІ类型的弯曲误差。经过学习和测试，该系统通过了在真实录音机上的验证。因此，考虑到深度学习神经元网络中语音的一些特征，建立了在自动模式下对留声图中的停顿进行有效分割的系统。所得结果符合І和ІІ两种误差曲线的检验结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Automation and Information Sciences AUTOMATION & CONTROL SYSTEMS-

自引率

0.00%

发文量

审稿时长

6-12 weeks

期刊介绍： This journal contains translations of papers from the Russian-language bimonthly "Mezhdunarodnyi nauchno-tekhnicheskiy zhurnal "Problemy upravleniya i informatiki". Subjects covered include information sciences such as pattern recognition, forecasting, identification and evaluation of complex systems, information security, fault diagnosis and reliability. In addition, the journal also deals with such automation subjects as adaptive, stochastic and optimal control, control and identification under uncertainty, robotics, and applications of user-friendly computers in management of economic, industrial, biological, and medical systems. The Journal of Automation and Information Sciences will appeal to professionals in control systems, communications, computers, engineering in biology and medicine, instrumentation and measurement, and those interested in the social implications of technology.