{"title":"基于感知特征的男女语音分类","authors":"Saptarshi Sengupta, Ghazaala Yasmin, Arijit Ghosal","doi":"10.1109/ICCCNT.2017.8204065","DOIUrl":null,"url":null,"abstract":"Gender identification systems nowadays, are gaining momentum in terms of popularity because of their wide areas of application. They can be used in a variety of fields ranging from security and authentication services to content based information retrieval and also criminal investigations. Gender detection has started to gain importance because of the fact that recent studies conducted showed that the performance of gender dependent speech recognition models performs much better than gender independent models. In the proposed work, we aim to build such a system involving perceptual audio features such as pitch and tempo based features, short time energy etc., which are used to train classifiers to differentiate between the two classes of gender. We have selected such a combination of features as because previous works focused only on either pitch approach, MFCC approach etc., whereas our work is perhaps one of the first involving a combination of several such perceptual features. The system was tested on a wide range of speech files and was shown to be yielding promising results.","PeriodicalId":6581,"journal":{"name":"2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT)","volume":"5 1","pages":"1-7"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Classification of male and female speech using perceptual features\",\"authors\":\"Saptarshi Sengupta, Ghazaala Yasmin, Arijit Ghosal\",\"doi\":\"10.1109/ICCCNT.2017.8204065\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Gender identification systems nowadays, are gaining momentum in terms of popularity because of their wide areas of application. They can be used in a variety of fields ranging from security and authentication services to content based information retrieval and also criminal investigations. Gender detection has started to gain importance because of the fact that recent studies conducted showed that the performance of gender dependent speech recognition models performs much better than gender independent models. In the proposed work, we aim to build such a system involving perceptual audio features such as pitch and tempo based features, short time energy etc., which are used to train classifiers to differentiate between the two classes of gender. We have selected such a combination of features as because previous works focused only on either pitch approach, MFCC approach etc., whereas our work is perhaps one of the first involving a combination of several such perceptual features. The system was tested on a wide range of speech files and was shown to be yielding promising results.\",\"PeriodicalId\":6581,\"journal\":{\"name\":\"2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT)\",\"volume\":\"5 1\",\"pages\":\"1-7\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCNT.2017.8204065\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCNT.2017.8204065","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Classification of male and female speech using perceptual features
Gender identification systems nowadays, are gaining momentum in terms of popularity because of their wide areas of application. They can be used in a variety of fields ranging from security and authentication services to content based information retrieval and also criminal investigations. Gender detection has started to gain importance because of the fact that recent studies conducted showed that the performance of gender dependent speech recognition models performs much better than gender independent models. In the proposed work, we aim to build such a system involving perceptual audio features such as pitch and tempo based features, short time energy etc., which are used to train classifiers to differentiate between the two classes of gender. We have selected such a combination of features as because previous works focused only on either pitch approach, MFCC approach etc., whereas our work is perhaps one of the first involving a combination of several such perceptual features. The system was tested on a wide range of speech files and was shown to be yielding promising results.