{"title":"水声目标降低环境噪声扩散模型","authors":"Yunqi Zhang, Jiansen Hao, Qunfeng Zeng","doi":"10.1016/j.neunet.2025.107470","DOIUrl":null,"url":null,"abstract":"<div><div>The recognition of underwater acoustic targets is a challenging problem, and irregular ambient noise is a key factor limiting the effectiveness of the recognition. Research on diffusion models in the audio field has mainly centered around the human voice, and it may be valuable to apply them to the field of underwater acoustic. In this paper, we propose a general method for reducing ambient noise based on the diffusion model. A Decapitation normalization method is proposed, which balances the data distribution of different frequency scales and unifies the noise addition in the time and frequency domains. Then a Reducing Ambient Noise Diffusion (RAND) Model is proposed based on the diffusion model, which can effectively remove the ambient noise in a small range of steps. Considering that some steps of sampling may have a negative effect, a Three-condition mask method is proposed to make the model more robust during sampling. The effectiveness of the proposed method is verified by experiments in the time and frequency domains.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"188 ","pages":"Article 107470"},"PeriodicalIF":6.0000,"publicationDate":"2025-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reducing ambient noise diffusion model for underwater acoustic target\",\"authors\":\"Yunqi Zhang, Jiansen Hao, Qunfeng Zeng\",\"doi\":\"10.1016/j.neunet.2025.107470\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The recognition of underwater acoustic targets is a challenging problem, and irregular ambient noise is a key factor limiting the effectiveness of the recognition. Research on diffusion models in the audio field has mainly centered around the human voice, and it may be valuable to apply them to the field of underwater acoustic. In this paper, we propose a general method for reducing ambient noise based on the diffusion model. A Decapitation normalization method is proposed, which balances the data distribution of different frequency scales and unifies the noise addition in the time and frequency domains. Then a Reducing Ambient Noise Diffusion (RAND) Model is proposed based on the diffusion model, which can effectively remove the ambient noise in a small range of steps. Considering that some steps of sampling may have a negative effect, a Three-condition mask method is proposed to make the model more robust during sampling. The effectiveness of the proposed method is verified by experiments in the time and frequency domains.</div></div>\",\"PeriodicalId\":49763,\"journal\":{\"name\":\"Neural Networks\",\"volume\":\"188 \",\"pages\":\"Article 107470\"},\"PeriodicalIF\":6.0000,\"publicationDate\":\"2025-04-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neural Networks\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0893608025003491\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0893608025003491","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Reducing ambient noise diffusion model for underwater acoustic target
The recognition of underwater acoustic targets is a challenging problem, and irregular ambient noise is a key factor limiting the effectiveness of the recognition. Research on diffusion models in the audio field has mainly centered around the human voice, and it may be valuable to apply them to the field of underwater acoustic. In this paper, we propose a general method for reducing ambient noise based on the diffusion model. A Decapitation normalization method is proposed, which balances the data distribution of different frequency scales and unifies the noise addition in the time and frequency domains. Then a Reducing Ambient Noise Diffusion (RAND) Model is proposed based on the diffusion model, which can effectively remove the ambient noise in a small range of steps. Considering that some steps of sampling may have a negative effect, a Three-condition mask method is proposed to make the model more robust during sampling. The effectiveness of the proposed method is verified by experiments in the time and frequency domains.
期刊介绍:
Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.