{"title":"前馈神经网络和mmi监督矢量量化在协同无人飞行机器人基于内容的音频分割任务中的应用","authors":"L. Janku, K. Hyniova","doi":"10.1109/ISMS.2010.32","DOIUrl":null,"url":null,"abstract":"This paper deals with the preliminary experiments on general audio segmentation using a MMI-supervised tree-based vector quantizer and feed-forward neural network. This method has been tested with the aim of detection of environmental sounds and speech in a sound stream. The segmentation of an audio stream is needed for successful localization of speech or environmental sounds in a stream and their possible future classification or even separation. This method has been developed as a preliminary solution of the task of real-world audio signal segmentation by a set of co-operative unmanned flying robots. Application of the proposed method has been tested in simulating software NESCUAR 1.0. (Natural Environment Simulator for Cooperative Unmanned Aerial Robots, version 1.0), a simulating software tool developed by the authors of this paper. The presented method can be also applied separately; its application is not dependent on the simulating software NESCUAR 1.0.","PeriodicalId":434315,"journal":{"name":"2010 International Conference on Intelligent Systems, Modelling and Simulation","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Application of Feed-Forward Neural Network and MMI-Supervised Vector Quantizer to the Task of Content Based Audio Segmentation by Co-operative Unmanned Flying Robots\",\"authors\":\"L. Janku, K. Hyniova\",\"doi\":\"10.1109/ISMS.2010.32\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper deals with the preliminary experiments on general audio segmentation using a MMI-supervised tree-based vector quantizer and feed-forward neural network. This method has been tested with the aim of detection of environmental sounds and speech in a sound stream. The segmentation of an audio stream is needed for successful localization of speech or environmental sounds in a stream and their possible future classification or even separation. This method has been developed as a preliminary solution of the task of real-world audio signal segmentation by a set of co-operative unmanned flying robots. Application of the proposed method has been tested in simulating software NESCUAR 1.0. (Natural Environment Simulator for Cooperative Unmanned Aerial Robots, version 1.0), a simulating software tool developed by the authors of this paper. The presented method can be also applied separately; its application is not dependent on the simulating software NESCUAR 1.0.\",\"PeriodicalId\":434315,\"journal\":{\"name\":\"2010 International Conference on Intelligent Systems, Modelling and Simulation\",\"volume\":\"30 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-01-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 International Conference on Intelligent Systems, Modelling and Simulation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISMS.2010.32\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 International Conference on Intelligent Systems, Modelling and Simulation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISMS.2010.32","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
本文对基于mmi监督树的矢量量化器和前馈神经网络在一般音频分割中的应用进行了初步实验。该方法已经过测试,目的是检测声音流中的环境声音和语音。音频流的分割对于成功地定位流中的语音或环境声音以及它们未来可能的分类甚至分离是必要的。该方法是一组协作式无人飞行机器人对真实音频信号分割任务的初步解决方案。该方法在仿真软件NESCUAR 1.0中得到了应用验证。(自然环境模拟器for Cooperative Unmanned Aerial Robots, version 1.0),本文作者开发的仿真软件工具。所提出的方法也可以单独应用;其应用不依赖于仿真软件NESCUAR 1.0。
Application of Feed-Forward Neural Network and MMI-Supervised Vector Quantizer to the Task of Content Based Audio Segmentation by Co-operative Unmanned Flying Robots
This paper deals with the preliminary experiments on general audio segmentation using a MMI-supervised tree-based vector quantizer and feed-forward neural network. This method has been tested with the aim of detection of environmental sounds and speech in a sound stream. The segmentation of an audio stream is needed for successful localization of speech or environmental sounds in a stream and their possible future classification or even separation. This method has been developed as a preliminary solution of the task of real-world audio signal segmentation by a set of co-operative unmanned flying robots. Application of the proposed method has been tested in simulating software NESCUAR 1.0. (Natural Environment Simulator for Cooperative Unmanned Aerial Robots, version 1.0), a simulating software tool developed by the authors of this paper. The presented method can be also applied separately; its application is not dependent on the simulating software NESCUAR 1.0.