Erwann Betton-Ployon, Abbes Kacem, Jérôme Mars, Nadine Martin
{"title":"Robust automatic train pass-by detection combining deep learning and sound level analysis.","authors":"Erwann Betton-Ployon, Abbes Kacem, Jérôme Mars, Nadine Martin","doi":"10.1121/10.0036754","DOIUrl":null,"url":null,"abstract":"<p><p>The increasing needs for controlling high noise levels motivate development of automatic sound event detection and classification methods. Little work deals with automatic train pass-by detection despite a high degree of annoyance. To this matter, an innovative approach is proposed in this paper. A generic classifier identifies vehicle noise on the raw audio signal. Then, combined short sound level analysis and mel-spectrogram-based classification refine this outcome to discard anything but train pass-bys. On various long-term signals, a 90% temporal overlap with reference demarcation is observed. This high detection rate allows a proper railway noise contribution estimation in different soundscapes.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 5","pages":""},"PeriodicalIF":1.4000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JASA express letters","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1121/10.0036754","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0
Abstract
The increasing needs for controlling high noise levels motivate development of automatic sound event detection and classification methods. Little work deals with automatic train pass-by detection despite a high degree of annoyance. To this matter, an innovative approach is proposed in this paper. A generic classifier identifies vehicle noise on the raw audio signal. Then, combined short sound level analysis and mel-spectrogram-based classification refine this outcome to discard anything but train pass-bys. On various long-term signals, a 90% temporal overlap with reference demarcation is observed. This high detection rate allows a proper railway noise contribution estimation in different soundscapes.