{"title":"Research on Urban Audio Classification Based on Residual Neural Network","authors":"Duling Xv, Li Yang","doi":"10.1109/ICCEA53728.2021.00047","DOIUrl":null,"url":null,"abstract":"In recent years, audio classification has been extensively studied, and the classification of urban sounds has great application requirements in criminal investigation and environmental protection. In this paper, a multi-feature hybrid description method is used to classify target city sounds with a multi-layer residual network structure. Firstly, a plurality of feature extraction results were compared with a conventional single feature. Secondly, different network models are studied, and their performance under different characteristics is tested and compared. Finally, comparing Resnet and multi-layer perceptrons, it is found that the Resnet50v2 method under mixed features has a better classification effect on the Ubansound8k data set, reaching 90.7%.","PeriodicalId":325790,"journal":{"name":"2021 International Conference on Computer Engineering and Application (ICCEA)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computer Engineering and Application (ICCEA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCEA53728.2021.00047","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, audio classification has been extensively studied, and the classification of urban sounds has great application requirements in criminal investigation and environmental protection. In this paper, a multi-feature hybrid description method is used to classify target city sounds with a multi-layer residual network structure. Firstly, a plurality of feature extraction results were compared with a conventional single feature. Secondly, different network models are studied, and their performance under different characteristics is tested and compared. Finally, comparing Resnet and multi-layer perceptrons, it is found that the Resnet50v2 method under mixed features has a better classification effect on the Ubansound8k data set, reaching 90.7%.