{"title":"民航语音信息的智能处理:信息识别与说话人情绪状态分析","authors":"N. Andriyanov, D. Andriyanov","doi":"10.1109/SIBCON50419.2021.9438881","DOIUrl":null,"url":null,"abstract":"The text deals with the problems of recognition of speech messages pronounced in the prescribed form in accordance with aviation regulations. In addition, during the analysis of the phrase of an aircraft crew member, a forecast is made about the emotional state of such a person. The basis is the frequency representation of signals associated with the known time Fourier transform. Algorithms based on correlation analysis, neural networks with backpropagation of errors and Gaussian mixture models are compared. The correlation algorithm provides the best results when recognizing a specific phrase pattern (reaching 94% with a signal-to-noise ratio of 2), however, it is inferior to the neural network algorithm and the algorithm based on the Gaussian mixture model when predicting the emotional state of the speaker.","PeriodicalId":150550,"journal":{"name":"2021 International Siberian Conference on Control and Communications (SIBCON)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Intelligent Processing of Voice Messages in Civil Aviation: Message Recognition and the Emotional State of the Speaker Analysis\",\"authors\":\"N. Andriyanov, D. Andriyanov\",\"doi\":\"10.1109/SIBCON50419.2021.9438881\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The text deals with the problems of recognition of speech messages pronounced in the prescribed form in accordance with aviation regulations. In addition, during the analysis of the phrase of an aircraft crew member, a forecast is made about the emotional state of such a person. The basis is the frequency representation of signals associated with the known time Fourier transform. Algorithms based on correlation analysis, neural networks with backpropagation of errors and Gaussian mixture models are compared. The correlation algorithm provides the best results when recognizing a specific phrase pattern (reaching 94% with a signal-to-noise ratio of 2), however, it is inferior to the neural network algorithm and the algorithm based on the Gaussian mixture model when predicting the emotional state of the speaker.\",\"PeriodicalId\":150550,\"journal\":{\"name\":\"2021 International Siberian Conference on Control and Communications (SIBCON)\",\"volume\":\"49 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-05-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Siberian Conference on Control and Communications (SIBCON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SIBCON50419.2021.9438881\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Siberian Conference on Control and Communications (SIBCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIBCON50419.2021.9438881","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Intelligent Processing of Voice Messages in Civil Aviation: Message Recognition and the Emotional State of the Speaker Analysis
The text deals with the problems of recognition of speech messages pronounced in the prescribed form in accordance with aviation regulations. In addition, during the analysis of the phrase of an aircraft crew member, a forecast is made about the emotional state of such a person. The basis is the frequency representation of signals associated with the known time Fourier transform. Algorithms based on correlation analysis, neural networks with backpropagation of errors and Gaussian mixture models are compared. The correlation algorithm provides the best results when recognizing a specific phrase pattern (reaching 94% with a signal-to-noise ratio of 2), however, it is inferior to the neural network algorithm and the algorithm based on the Gaussian mixture model when predicting the emotional state of the speaker.