{"title":"基于残差神经网络的城市音频分类研究","authors":"Duling Xv, Li Yang","doi":"10.1109/ICCEA53728.2021.00047","DOIUrl":null,"url":null,"abstract":"In recent years, audio classification has been extensively studied, and the classification of urban sounds has great application requirements in criminal investigation and environmental protection. In this paper, a multi-feature hybrid description method is used to classify target city sounds with a multi-layer residual network structure. Firstly, a plurality of feature extraction results were compared with a conventional single feature. Secondly, different network models are studied, and their performance under different characteristics is tested and compared. Finally, comparing Resnet and multi-layer perceptrons, it is found that the Resnet50v2 method under mixed features has a better classification effect on the Ubansound8k data set, reaching 90.7%.","PeriodicalId":325790,"journal":{"name":"2021 International Conference on Computer Engineering and Application (ICCEA)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Research on Urban Audio Classification Based on Residual Neural Network\",\"authors\":\"Duling Xv, Li Yang\",\"doi\":\"10.1109/ICCEA53728.2021.00047\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, audio classification has been extensively studied, and the classification of urban sounds has great application requirements in criminal investigation and environmental protection. In this paper, a multi-feature hybrid description method is used to classify target city sounds with a multi-layer residual network structure. Firstly, a plurality of feature extraction results were compared with a conventional single feature. Secondly, different network models are studied, and their performance under different characteristics is tested and compared. Finally, comparing Resnet and multi-layer perceptrons, it is found that the Resnet50v2 method under mixed features has a better classification effect on the Ubansound8k data set, reaching 90.7%.\",\"PeriodicalId\":325790,\"journal\":{\"name\":\"2021 International Conference on Computer Engineering and Application (ICCEA)\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Computer Engineering and Application (ICCEA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCEA53728.2021.00047\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computer Engineering and Application (ICCEA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCEA53728.2021.00047","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research on Urban Audio Classification Based on Residual Neural Network
In recent years, audio classification has been extensively studied, and the classification of urban sounds has great application requirements in criminal investigation and environmental protection. In this paper, a multi-feature hybrid description method is used to classify target city sounds with a multi-layer residual network structure. Firstly, a plurality of feature extraction results were compared with a conventional single feature. Secondly, different network models are studied, and their performance under different characteristics is tested and compared. Finally, comparing Resnet and multi-layer perceptrons, it is found that the Resnet50v2 method under mixed features has a better classification effect on the Ubansound8k data set, reaching 90.7%.