{"title":"Exploring the Influence of Noise in Speech Emotion Recognition Devices for Internet of Thing","authors":"Mingke Xu, Fan Zhang, Jiannan Yang, S. Khan","doi":"10.1109/ICEI49372.2020.00031","DOIUrl":null,"url":null,"abstract":"With the development of the Energy Internet (EI), the application of smart grids has expanded from the industrial field to homes and individuals, which effectively promotes the development of home Internet of Things (IoT) devices. The research of the home IoT aims to improve the user experience, and the focus is on the intelligence of the device. The intelligence is inseparable from human-computer interaction (HCI). Speech emotion recognition(SER) uses machines to recognize emotions in human speech, which is an important part of HCI. However, noise usually greatly influences the recognition accuracy in HCI. In this paper, we conduct experiments on 16 types of common noise in the environment. We find that some types of noise influence the recognition effect significantly but some do not. We explain this difference from two perspectives—spectrogram and statistical characteristics so that we will gain more insight as to what type of noise will and/or will not influence the recognition accuracy of a particular SER task.","PeriodicalId":418017,"journal":{"name":"2020 IEEE International Conference on Energy Internet (ICEI)","volume":"233 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Energy Internet (ICEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEI49372.2020.00031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
With the development of the Energy Internet (EI), the application of smart grids has expanded from the industrial field to homes and individuals, which effectively promotes the development of home Internet of Things (IoT) devices. The research of the home IoT aims to improve the user experience, and the focus is on the intelligence of the device. The intelligence is inseparable from human-computer interaction (HCI). Speech emotion recognition(SER) uses machines to recognize emotions in human speech, which is an important part of HCI. However, noise usually greatly influences the recognition accuracy in HCI. In this paper, we conduct experiments on 16 types of common noise in the environment. We find that some types of noise influence the recognition effect significantly but some do not. We explain this difference from two perspectives—spectrogram and statistical characteristics so that we will gain more insight as to what type of noise will and/or will not influence the recognition accuracy of a particular SER task.