{"title":"说话人识别:一种减少呼号混淆事件的方法","authors":"Sara Sekkate, Mohammed Khalil, A. Adib","doi":"10.1109/ATSIP.2017.8075593","DOIUrl":null,"url":null,"abstract":"This paper examines the development of a speaker identification system (SIS) for future aeronautical communication systems. SIS promises to improve flight safety by reducing the incidence of call-sign confusion events. However, the practical development of such a system faces many challenges, especially related to the signal corruption by the channel noise. Due to the dynamic motion of aircraft, the aeronautical channel experiences high Doppler shifts and fading due to multipath propagation. This means that the SIS is required to be robust against such perturbations. In the proposed system, aeronautical channel noise was generated and mixed with speech signals to get the testing data. Four spectral features including Linear Predictive Cepstral Coefficients (LPCC), Perceptual Linear Prediction (PLP), Mel Frequency Cepstral Coefficient (MFCC) and Gammatone Frequency Cepstral Coefficient (GFCC) were extracted and then Support Vector Machines (SVM) were used for classification. The performance of the system was evaluated using noiseless and noisy signals from the ATCOSIM speech corpus. The experimental results show that the better recognition rate is obtained for GFCC under noisy conditions as compared to PLP, LPCC and MFCC.","PeriodicalId":259951,"journal":{"name":"2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Speaker identification: A way to reduce call-sign confusion events\",\"authors\":\"Sara Sekkate, Mohammed Khalil, A. Adib\",\"doi\":\"10.1109/ATSIP.2017.8075593\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper examines the development of a speaker identification system (SIS) for future aeronautical communication systems. SIS promises to improve flight safety by reducing the incidence of call-sign confusion events. However, the practical development of such a system faces many challenges, especially related to the signal corruption by the channel noise. Due to the dynamic motion of aircraft, the aeronautical channel experiences high Doppler shifts and fading due to multipath propagation. This means that the SIS is required to be robust against such perturbations. In the proposed system, aeronautical channel noise was generated and mixed with speech signals to get the testing data. Four spectral features including Linear Predictive Cepstral Coefficients (LPCC), Perceptual Linear Prediction (PLP), Mel Frequency Cepstral Coefficient (MFCC) and Gammatone Frequency Cepstral Coefficient (GFCC) were extracted and then Support Vector Machines (SVM) were used for classification. The performance of the system was evaluated using noiseless and noisy signals from the ATCOSIM speech corpus. The experimental results show that the better recognition rate is obtained for GFCC under noisy conditions as compared to PLP, LPCC and MFCC.\",\"PeriodicalId\":259951,\"journal\":{\"name\":\"2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)\",\"volume\":\"51 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-05-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ATSIP.2017.8075593\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ATSIP.2017.8075593","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Speaker identification: A way to reduce call-sign confusion events
This paper examines the development of a speaker identification system (SIS) for future aeronautical communication systems. SIS promises to improve flight safety by reducing the incidence of call-sign confusion events. However, the practical development of such a system faces many challenges, especially related to the signal corruption by the channel noise. Due to the dynamic motion of aircraft, the aeronautical channel experiences high Doppler shifts and fading due to multipath propagation. This means that the SIS is required to be robust against such perturbations. In the proposed system, aeronautical channel noise was generated and mixed with speech signals to get the testing data. Four spectral features including Linear Predictive Cepstral Coefficients (LPCC), Perceptual Linear Prediction (PLP), Mel Frequency Cepstral Coefficient (MFCC) and Gammatone Frequency Cepstral Coefficient (GFCC) were extracted and then Support Vector Machines (SVM) were used for classification. The performance of the system was evaluated using noiseless and noisy signals from the ATCOSIM speech corpus. The experimental results show that the better recognition rate is obtained for GFCC under noisy conditions as compared to PLP, LPCC and MFCC.