{"title":"基于激励源信息的语音信号分析","authors":"Shreya R. Garipalli, B. Sathe-Pathak, A. Panat","doi":"10.1109/ICMETE.2016.12","DOIUrl":null,"url":null,"abstract":"Speech is output of the time varying vocal tractsystem excited with the time varying excitation. Speech isproduced due to the impulse like excitation in each glottal cycle. During the production of speech, the instant of significantexcitation of the vocal tract system is referred to as epoch. In caseof voiced speech, most significant excitation takes place at theinstants of glottal closure i.e. glottal closure instants can bereferred as instants of significant excitation. Speech laugh is asignal produced when laughter occurs with neutral speech. Thespeech-laugh signal occurs frequently in natural conversationwith people. The features of speech-laugh, laughter and singingvoice deviates from the features of neutral speech. In this paper, we discriminate laughter, speech-laugh and neutral speech anddiscriminate singing voice and speech by obtaining epochlocations and extracting new features from these epochs. Themethod used here for the extraction of epochs is the ModifiedZero Frequency Filtering method. The features extracted fromepochs for the discrimination are fundamental frequency(f0) andslope of f0(α) at epoch locations and number of epochs (k) andstrength of excitation (β).","PeriodicalId":167368,"journal":{"name":"2016 International Conference on Micro-Electronics and Telecommunication Engineering (ICMETE)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Analysis of Speech Signals Using Excitation Source Information\",\"authors\":\"Shreya R. Garipalli, B. Sathe-Pathak, A. Panat\",\"doi\":\"10.1109/ICMETE.2016.12\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speech is output of the time varying vocal tractsystem excited with the time varying excitation. Speech isproduced due to the impulse like excitation in each glottal cycle. During the production of speech, the instant of significantexcitation of the vocal tract system is referred to as epoch. In caseof voiced speech, most significant excitation takes place at theinstants of glottal closure i.e. glottal closure instants can bereferred as instants of significant excitation. Speech laugh is asignal produced when laughter occurs with neutral speech. Thespeech-laugh signal occurs frequently in natural conversationwith people. The features of speech-laugh, laughter and singingvoice deviates from the features of neutral speech. In this paper, we discriminate laughter, speech-laugh and neutral speech anddiscriminate singing voice and speech by obtaining epochlocations and extracting new features from these epochs. Themethod used here for the extraction of epochs is the ModifiedZero Frequency Filtering method. The features extracted fromepochs for the discrimination are fundamental frequency(f0) andslope of f0(α) at epoch locations and number of epochs (k) andstrength of excitation (β).\",\"PeriodicalId\":167368,\"journal\":{\"name\":\"2016 International Conference on Micro-Electronics and Telecommunication Engineering (ICMETE)\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Conference on Micro-Electronics and Telecommunication Engineering (ICMETE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMETE.2016.12\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Micro-Electronics and Telecommunication Engineering (ICMETE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMETE.2016.12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Analysis of Speech Signals Using Excitation Source Information
Speech is output of the time varying vocal tractsystem excited with the time varying excitation. Speech isproduced due to the impulse like excitation in each glottal cycle. During the production of speech, the instant of significantexcitation of the vocal tract system is referred to as epoch. In caseof voiced speech, most significant excitation takes place at theinstants of glottal closure i.e. glottal closure instants can bereferred as instants of significant excitation. Speech laugh is asignal produced when laughter occurs with neutral speech. Thespeech-laugh signal occurs frequently in natural conversationwith people. The features of speech-laugh, laughter and singingvoice deviates from the features of neutral speech. In this paper, we discriminate laughter, speech-laugh and neutral speech anddiscriminate singing voice and speech by obtaining epochlocations and extracting new features from these epochs. Themethod used here for the extraction of epochs is the ModifiedZero Frequency Filtering method. The features extracted fromepochs for the discrimination are fundamental frequency(f0) andslope of f0(α) at epoch locations and number of epochs (k) andstrength of excitation (β).