Application of Feed-Forward Neural Network and MMI-Supervised Vector Quantizer to the Task of Content Based Audio Segmentation by Co-operative Unmanned Flying Robots
{"title":"Application of Feed-Forward Neural Network and MMI-Supervised Vector Quantizer to the Task of Content Based Audio Segmentation by Co-operative Unmanned Flying Robots","authors":"L. Janku, K. Hyniova","doi":"10.1109/ISMS.2010.32","DOIUrl":null,"url":null,"abstract":"This paper deals with the preliminary experiments on general audio segmentation using a MMI-supervised tree-based vector quantizer and feed-forward neural network. This method has been tested with the aim of detection of environmental sounds and speech in a sound stream. The segmentation of an audio stream is needed for successful localization of speech or environmental sounds in a stream and their possible future classification or even separation. This method has been developed as a preliminary solution of the task of real-world audio signal segmentation by a set of co-operative unmanned flying robots. Application of the proposed method has been tested in simulating software NESCUAR 1.0. (Natural Environment Simulator for Cooperative Unmanned Aerial Robots, version 1.0), a simulating software tool developed by the authors of this paper. The presented method can be also applied separately; its application is not dependent on the simulating software NESCUAR 1.0.","PeriodicalId":434315,"journal":{"name":"2010 International Conference on Intelligent Systems, Modelling and Simulation","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 International Conference on Intelligent Systems, Modelling and Simulation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISMS.2010.32","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper deals with the preliminary experiments on general audio segmentation using a MMI-supervised tree-based vector quantizer and feed-forward neural network. This method has been tested with the aim of detection of environmental sounds and speech in a sound stream. The segmentation of an audio stream is needed for successful localization of speech or environmental sounds in a stream and their possible future classification or even separation. This method has been developed as a preliminary solution of the task of real-world audio signal segmentation by a set of co-operative unmanned flying robots. Application of the proposed method has been tested in simulating software NESCUAR 1.0. (Natural Environment Simulator for Cooperative Unmanned Aerial Robots, version 1.0), a simulating software tool developed by the authors of this paper. The presented method can be also applied separately; its application is not dependent on the simulating software NESCUAR 1.0.