基于神经网络的海声信号分类框架

A. Maccato, R.J.P. de Figueii-edo
{"title":"基于神经网络的海声信号分类框架","authors":"A. Maccato, R.J.P. de Figueii-edo","doi":"10.1109/OCEANS.1989.587491","DOIUrl":null,"url":null,"abstract":"This paper presents a new framework for intelligent acoustic signal processing by artificial neural networks. Problems addressed are the detection, classification, and estimation of signal parameters. The methodology consists of decomposing the above tasks into two stages. First, a highly structured, hierarchical/symbolic representation of the data is created using scale space algorithms. This calculation overcomes moderate noise and warping distortion present in the acoustic recording, and at the same time reduces the data to be processed. Second, neural network architectures are applied to the resulting symbolic structures to obtain the desired signal parameters. The use of neural network techniques allows training to be used in cases where the signals of interest are not easily characterized. Illustrations using simulated and real data will be presented. INTRODUCTION Artificial neural networks provide a new computational paradigm for solving a large class of signal recognition problems. In this paradigm, collections of elementary units called neurons work in parallel to perform a desired computational task. Each neuron performs a component of the overall calculation, and communicates its result to the other units in the network via neurosynaptic interconnections. The distinguishing characteristic of artificial neural networks, with respect to classical methods of computing, is that they can learn to perform a required calculation through training. Hence, rather than designing a procedure for computing the solution to a problem, a selection is made of a training set of exemplary input/output pairs. This is a very powerful property in problem instances for which the application data space is not characterized sufficiently well to allow explicit programming. This is often the case with oceanic acoustic signals, such as short duration transient signals resulting from spurious mechanical events in a vessel. Signal distortion caused by the water medium (amplitude and time warping effects), noise contamination, imprecise or unknown time of occurrence, and high nonstationarity are all difficulties to be confronted when trying to detect a signal, classify it, or estimate meaningful parameters of its source. Additionally, any viable algorithm for recognizing acoustic signals in real-time must possess a fast implementation and, in the case of neural nets, a relatively short training time. To this end, we propose to decompose the recognition task into two stages. The acoustic signal is first preprocessed by a numerical transformation that partially removes noise and distortion effects present in the raw data. In the second stage the resulting information is fed to a neural network for final recognition. The overall method benefits from the filtering characteristics (and possibly data reduction properties) of the first stage, and also from the learning ability of the second stage. OUTLINE O F T H E ALGORITHM To illustrate this concept, we have selected a scale space transformation [l] to preprocess the incoming acoustic signals, and a feedforward, graded response neural net, with the error backpropagation training algorithm [2], to complete the recognition task. The system operates by first obtaining the waveform of the recorded acoustic signal. It then applies to it a scale space transformation, which, as described in the following section, is a time domain algorithm which maps waveforms into surfaces. Elements of the surface topology are consequently captured by a hierarchic/symbolic tree data structure. This step reduces the amount of data to be processed by extracting and ordering the relevant surface features. Finally, the tree is interpreted as a pattern and given as input to the neural network. The neural net, in turn, responds with a code identifying the class to which the tree was assigned, hence identifying the original acoustic signal. Figure 1 is a diagram of the overall process. In the following sections we describe in greater detail the three stages of processing and provide some example applications. SCALE SPACE SIGNAL TRANSFORMATION In general, scale space is a linear transformation of R\" into R\" x R+. For n = 1, it maps waveforms z ( t ) defined on the real line into surfaces (b(t,u) defined on the semi-infinite plane. The mapping takes the form of a convolution integral. The scale parameter, U , determines the 'size' of the smallest allowed intensity features in the filtered signal. That is, if we interpret U as a fixed parameter, say uo, then the filtered waveform zgO(t ) = (b(t,a,) is a smoothed version of the original signal containing only those intensity features larger in size than U,, (in the sense that smaller scale features are greatly attenuated). Figure 2 is an example surface generated by the scale space computation. As an aside, we note that the gaussian kernel in the integral can be replaced by any of its derivatives, allowing 'higher order' surfaces to be generated. The natural hierarchy of inclusion of intensity features, according to size and temporal location, is made explicit by the topology of the level curves of the surface 4, as illustrated in Figure 3. For any fixed ao, these curves are defined by the restriction +(t,a) = a,. The resulting contours partition the semi-infinite surface into well defined regions, amenable to representation by symbolic tree structures. This is ensured by the nature of the transformation and the use of gaussian kernels [3]. Several attributes can be associated with these regions to help characterize the waveform features they represent. Among these are the maximum height (or depth) of the surface within each region, the maximum extent of the region in the time or scale dimensions, the region’s area, and so forth. The scale space transformation is such that gradual signal distortions due to both noise and time warping result in gradual distortions of the surface. Consequently, the contour topology can absorb moderate signal degradation, and provide the next stage of processing with a data structure representative of the underlying signal.","PeriodicalId":331017,"journal":{"name":"Proceedings OCEANS","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"1989-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"A Neural Network Based Framework For Classification Of Oceanic Acoustic Signals\",\"authors\":\"A. Maccato, R.J.P. de Figueii-edo\",\"doi\":\"10.1109/OCEANS.1989.587491\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a new framework for intelligent acoustic signal processing by artificial neural networks. Problems addressed are the detection, classification, and estimation of signal parameters. The methodology consists of decomposing the above tasks into two stages. First, a highly structured, hierarchical/symbolic representation of the data is created using scale space algorithms. This calculation overcomes moderate noise and warping distortion present in the acoustic recording, and at the same time reduces the data to be processed. Second, neural network architectures are applied to the resulting symbolic structures to obtain the desired signal parameters. The use of neural network techniques allows training to be used in cases where the signals of interest are not easily characterized. Illustrations using simulated and real data will be presented. INTRODUCTION Artificial neural networks provide a new computational paradigm for solving a large class of signal recognition problems. In this paradigm, collections of elementary units called neurons work in parallel to perform a desired computational task. Each neuron performs a component of the overall calculation, and communicates its result to the other units in the network via neurosynaptic interconnections. The distinguishing characteristic of artificial neural networks, with respect to classical methods of computing, is that they can learn to perform a required calculation through training. Hence, rather than designing a procedure for computing the solution to a problem, a selection is made of a training set of exemplary input/output pairs. This is a very powerful property in problem instances for which the application data space is not characterized sufficiently well to allow explicit programming. This is often the case with oceanic acoustic signals, such as short duration transient signals resulting from spurious mechanical events in a vessel. Signal distortion caused by the water medium (amplitude and time warping effects), noise contamination, imprecise or unknown time of occurrence, and high nonstationarity are all difficulties to be confronted when trying to detect a signal, classify it, or estimate meaningful parameters of its source. Additionally, any viable algorithm for recognizing acoustic signals in real-time must possess a fast implementation and, in the case of neural nets, a relatively short training time. To this end, we propose to decompose the recognition task into two stages. The acoustic signal is first preprocessed by a numerical transformation that partially removes noise and distortion effects present in the raw data. In the second stage the resulting information is fed to a neural network for final recognition. The overall method benefits from the filtering characteristics (and possibly data reduction properties) of the first stage, and also from the learning ability of the second stage. OUTLINE O F T H E ALGORITHM To illustrate this concept, we have selected a scale space transformation [l] to preprocess the incoming acoustic signals, and a feedforward, graded response neural net, with the error backpropagation training algorithm [2], to complete the recognition task. The system operates by first obtaining the waveform of the recorded acoustic signal. It then applies to it a scale space transformation, which, as described in the following section, is a time domain algorithm which maps waveforms into surfaces. Elements of the surface topology are consequently captured by a hierarchic/symbolic tree data structure. This step reduces the amount of data to be processed by extracting and ordering the relevant surface features. Finally, the tree is interpreted as a pattern and given as input to the neural network. The neural net, in turn, responds with a code identifying the class to which the tree was assigned, hence identifying the original acoustic signal. Figure 1 is a diagram of the overall process. In the following sections we describe in greater detail the three stages of processing and provide some example applications. SCALE SPACE SIGNAL TRANSFORMATION In general, scale space is a linear transformation of R\\\" into R\\\" x R+. For n = 1, it maps waveforms z ( t ) defined on the real line into surfaces (b(t,u) defined on the semi-infinite plane. The mapping takes the form of a convolution integral. The scale parameter, U , determines the 'size' of the smallest allowed intensity features in the filtered signal. That is, if we interpret U as a fixed parameter, say uo, then the filtered waveform zgO(t ) = (b(t,a,) is a smoothed version of the original signal containing only those intensity features larger in size than U,, (in the sense that smaller scale features are greatly attenuated). Figure 2 is an example surface generated by the scale space computation. As an aside, we note that the gaussian kernel in the integral can be replaced by any of its derivatives, allowing 'higher order' surfaces to be generated. The natural hierarchy of inclusion of intensity features, according to size and temporal location, is made explicit by the topology of the level curves of the surface 4, as illustrated in Figure 3. For any fixed ao, these curves are defined by the restriction +(t,a) = a,. The resulting contours partition the semi-infinite surface into well defined regions, amenable to representation by symbolic tree structures. This is ensured by the nature of the transformation and the use of gaussian kernels [3]. Several attributes can be associated with these regions to help characterize the waveform features they represent. Among these are the maximum height (or depth) of the surface within each region, the maximum extent of the region in the time or scale dimensions, the region’s area, and so forth. The scale space transformation is such that gradual signal distortions due to both noise and time warping result in gradual distortions of the surface. Consequently, the contour topology can absorb moderate signal degradation, and provide the next stage of processing with a data structure representative of the underlying signal.\",\"PeriodicalId\":331017,\"journal\":{\"name\":\"Proceedings OCEANS\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1989-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings OCEANS\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/OCEANS.1989.587491\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings OCEANS","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/OCEANS.1989.587491","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

摘要

本文提出了一种基于人工神经网络的智能声信号处理新框架。解决的问题是信号参数的检测、分类和估计。该方法包括将上述任务分解为两个阶段。首先,使用尺度空间算法创建数据的高度结构化、分层/符号表示。这种计算克服了声学记录中存在的中度噪声和翘曲失真,同时减少了要处理的数据。其次,将神经网络架构应用于生成的符号结构以获得所需的信号参数。神经网络技术的使用允许在不容易表征感兴趣的信号的情况下使用训练。使用模拟和真实数据的插图将被呈现。人工神经网络为解决大量信号识别问题提供了一种新的计算范式。在这个范例中,称为神经元的基本单元集合并行工作以执行期望的计算任务。每个神经元执行整体计算的一个组成部分,并通过神经突触互连将其结果传递给网络中的其他单元。相对于经典的计算方法,人工神经网络的显著特征是,它们可以通过训练来学习执行所需的计算。因此,不是设计一个程序来计算问题的解决方案,而是从典型的输入/输出对的训练集中进行选择。在应用程序数据空间没有得到充分表征而无法进行显式编程的问题实例中,这是一个非常强大的属性。这通常是海洋声信号的情况,例如由船舶中虚假的机械事件引起的短时间瞬态信号。水介质引起的信号失真(振幅和时间扭曲效应)、噪声污染、发生时间不精确或未知以及高度非平稳性都是检测信号、对信号进行分类或估计信号源有意义参数时所面临的困难。此外,任何可行的实时识别声信号的算法都必须具有快速的实现,并且在神经网络的情况下,训练时间相对较短。为此,我们建议将识别任务分解为两个阶段。声学信号首先通过数字变换进行预处理,该变换部分去除原始数据中存在的噪声和失真效应。在第二阶段,结果信息被馈送到神经网络进行最终识别。整个方法受益于第一阶段的过滤特性(可能还有数据约简特性),也受益于第二阶段的学习能力。为了说明这一概念,我们选择了尺度空间变换[1]对传入的声信号进行预处理,并使用误差反向传播训练算法[2]的前馈、梯度响应神经网络来完成识别任务。该系统通过首先获得所记录的声信号的波形来操作。然后对其应用尺度空间变换,如下面一节所述,这是一种将波形映射到曲面的时域算法。因此,表面拓扑的元素由层次/符号树数据结构捕获。这一步通过提取和排序相关的表面特征来减少需要处理的数据量。最后,树被解释为一个模式,并作为神经网络的输入。神经网络,反过来,用一个代码来识别树被分配到的类别,从而识别原始的声音信号。图1是整个过程的示意图。在下面几节中,我们将更详细地描述处理的三个阶段,并提供一些示例应用程序。尺度空间信号变换通常,尺度空间是R '到R ' x R+的线性变换。当n = 1时,它将实线上定义的波形z (t)映射到半无限平面上定义的曲面(b(t,u)上。映射的形式是卷积积分。尺度参数U决定了滤波信号中最小允许强度特征的“大小”。也就是说,如果我们将U解释为一个固定的参数,例如uo,那么滤波后的波形zgO(t) = (b(t,a,)是原始信号的平滑版本,只包含那些尺寸大于U的强度特征,,(从某种意义上说,较小尺度的特征被大大衰减)。图2是通过尺度空间计算生成的曲面示例。作为题外话,我们注意到积分中的高斯核可以被它的任何导数所取代,从而允许生成“高阶”曲面。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Neural Network Based Framework For Classification Of Oceanic Acoustic Signals
This paper presents a new framework for intelligent acoustic signal processing by artificial neural networks. Problems addressed are the detection, classification, and estimation of signal parameters. The methodology consists of decomposing the above tasks into two stages. First, a highly structured, hierarchical/symbolic representation of the data is created using scale space algorithms. This calculation overcomes moderate noise and warping distortion present in the acoustic recording, and at the same time reduces the data to be processed. Second, neural network architectures are applied to the resulting symbolic structures to obtain the desired signal parameters. The use of neural network techniques allows training to be used in cases where the signals of interest are not easily characterized. Illustrations using simulated and real data will be presented. INTRODUCTION Artificial neural networks provide a new computational paradigm for solving a large class of signal recognition problems. In this paradigm, collections of elementary units called neurons work in parallel to perform a desired computational task. Each neuron performs a component of the overall calculation, and communicates its result to the other units in the network via neurosynaptic interconnections. The distinguishing characteristic of artificial neural networks, with respect to classical methods of computing, is that they can learn to perform a required calculation through training. Hence, rather than designing a procedure for computing the solution to a problem, a selection is made of a training set of exemplary input/output pairs. This is a very powerful property in problem instances for which the application data space is not characterized sufficiently well to allow explicit programming. This is often the case with oceanic acoustic signals, such as short duration transient signals resulting from spurious mechanical events in a vessel. Signal distortion caused by the water medium (amplitude and time warping effects), noise contamination, imprecise or unknown time of occurrence, and high nonstationarity are all difficulties to be confronted when trying to detect a signal, classify it, or estimate meaningful parameters of its source. Additionally, any viable algorithm for recognizing acoustic signals in real-time must possess a fast implementation and, in the case of neural nets, a relatively short training time. To this end, we propose to decompose the recognition task into two stages. The acoustic signal is first preprocessed by a numerical transformation that partially removes noise and distortion effects present in the raw data. In the second stage the resulting information is fed to a neural network for final recognition. The overall method benefits from the filtering characteristics (and possibly data reduction properties) of the first stage, and also from the learning ability of the second stage. OUTLINE O F T H E ALGORITHM To illustrate this concept, we have selected a scale space transformation [l] to preprocess the incoming acoustic signals, and a feedforward, graded response neural net, with the error backpropagation training algorithm [2], to complete the recognition task. The system operates by first obtaining the waveform of the recorded acoustic signal. It then applies to it a scale space transformation, which, as described in the following section, is a time domain algorithm which maps waveforms into surfaces. Elements of the surface topology are consequently captured by a hierarchic/symbolic tree data structure. This step reduces the amount of data to be processed by extracting and ordering the relevant surface features. Finally, the tree is interpreted as a pattern and given as input to the neural network. The neural net, in turn, responds with a code identifying the class to which the tree was assigned, hence identifying the original acoustic signal. Figure 1 is a diagram of the overall process. In the following sections we describe in greater detail the three stages of processing and provide some example applications. SCALE SPACE SIGNAL TRANSFORMATION In general, scale space is a linear transformation of R" into R" x R+. For n = 1, it maps waveforms z ( t ) defined on the real line into surfaces (b(t,u) defined on the semi-infinite plane. The mapping takes the form of a convolution integral. The scale parameter, U , determines the 'size' of the smallest allowed intensity features in the filtered signal. That is, if we interpret U as a fixed parameter, say uo, then the filtered waveform zgO(t ) = (b(t,a,) is a smoothed version of the original signal containing only those intensity features larger in size than U,, (in the sense that smaller scale features are greatly attenuated). Figure 2 is an example surface generated by the scale space computation. As an aside, we note that the gaussian kernel in the integral can be replaced by any of its derivatives, allowing 'higher order' surfaces to be generated. The natural hierarchy of inclusion of intensity features, according to size and temporal location, is made explicit by the topology of the level curves of the surface 4, as illustrated in Figure 3. For any fixed ao, these curves are defined by the restriction +(t,a) = a,. The resulting contours partition the semi-infinite surface into well defined regions, amenable to representation by symbolic tree structures. This is ensured by the nature of the transformation and the use of gaussian kernels [3]. Several attributes can be associated with these regions to help characterize the waveform features they represent. Among these are the maximum height (or depth) of the surface within each region, the maximum extent of the region in the time or scale dimensions, the region’s area, and so forth. The scale space transformation is such that gradual signal distortions due to both noise and time warping result in gradual distortions of the surface. Consequently, the contour topology can absorb moderate signal degradation, and provide the next stage of processing with a data structure representative of the underlying signal.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信