Low-power Spiking Neural Network audio source localisation using a Hilbert Transform audio event encoding scheme.

Communications engineering Pub Date : 2025-02-11 DOI:10.1038/s44172-025-00359-9

Saeid Haghighatshoar, Dylan Richard Muir

{"title":"Low-power Spiking Neural Network audio source localisation using a Hilbert Transform audio event encoding scheme.","authors":"Saeid Haghighatshoar, Dylan Richard Muir","doi":"10.1038/s44172-025-00359-9","DOIUrl":null,"url":null,"abstract":"<p><p>Sound source localisation is used in many consumer devices, to isolate audio from individual speakers and reject noise. Localization is frequently accomplished by \"beamforming\", which combines phase-shifted audio streams to increase power from chosen source directions, under a known microphone array geometry. Dense band-pass filters are often needed to obtain narrowband signal components from wideband audio. These approaches achieve high accuracy, but narrowband beamforming is computationally demanding, and not ideal for low-power IoT devices. We introduce a method for sound source localisation on arbitrary microphone arrays, designed for efficient implementation in ultra-low-power spiking neural networks (SNNs). We use a Hilbert transform to avoid dense band-pass filters, and introduce an event-based encoding method that captures the phase of the complex analytic signal. Our approach achieves high accuracy for SNN methods, comparable with traditional non-SNN super-resolution beamforming. We deploy our method to low-power SNN inference hardware, with much lower power consumption than super-resolution methods. We demonstrate that signal processing approaches co-designed with spiking neural network implementations can achieve much improved power efficiency. Our Hilbert-transform-based method for beamforming can also improve the efficiency of traditional digital signal processing.</p>","PeriodicalId":72644,"journal":{"name":"Communications engineering","volume":"4 1","pages":"18"},"PeriodicalIF":0.0000,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11814312/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1038/s44172-025-00359-9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Sound source localisation is used in many consumer devices, to isolate audio from individual speakers and reject noise. Localization is frequently accomplished by "beamforming", which combines phase-shifted audio streams to increase power from chosen source directions, under a known microphone array geometry. Dense band-pass filters are often needed to obtain narrowband signal components from wideband audio. These approaches achieve high accuracy, but narrowband beamforming is computationally demanding, and not ideal for low-power IoT devices. We introduce a method for sound source localisation on arbitrary microphone arrays, designed for efficient implementation in ultra-low-power spiking neural networks (SNNs). We use a Hilbert transform to avoid dense band-pass filters, and introduce an event-based encoding method that captures the phase of the complex analytic signal. Our approach achieves high accuracy for SNN methods, comparable with traditional non-SNN super-resolution beamforming. We deploy our method to low-power SNN inference hardware, with much lower power consumption than super-resolution methods. We demonstrate that signal processing approaches co-designed with spiking neural network implementations can achieve much improved power efficiency. Our Hilbert-transform-based method for beamforming can also improve the efficiency of traditional digital signal processing.

查看原文本刊更多论文

使用希尔伯特变换音频事件编码方案的低功耗尖峰神经网络音频源定位。

声源定位在许多消费设备中使用，以隔离单个扬声器的音频并排除噪声。定位通常通过“波束成形”来完成，它结合相移音频流来增加来自选定源方向的功率，在已知的麦克风阵列几何形状下。为了从宽带音频中获得窄带信号分量，通常需要密集的带通滤波器。这些方法实现了高精度，但窄带波束成形在计算上要求很高，不适合低功耗物联网设备。我们介绍了一种在任意麦克风阵列上进行声源定位的方法，该方法旨在有效地实现超低功耗尖峰神经网络（snn）。我们使用希尔伯特变换来避免密集的带通滤波器，并引入了一种基于事件的编码方法来捕获复分析信号的相位。该方法实现了与传统的非SNN超分辨率波束形成方法相当的高精度。我们将我们的方法部署到低功耗SNN推理硬件上，其功耗比超分辨率方法低得多。我们证明了与尖峰神经网络实现共同设计的信号处理方法可以大大提高功率效率。基于希尔伯特变换的波束形成方法也可以提高传统数字信号处理的效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Communications engineering

自引率

0.00%

发文量