基于生成对抗网络表征的自闭症谱系障碍语音诊断

Proceedings of the 2017 International Conference on Digital Health Pub Date : 2017-07-02 DOI:10.1145/3079452.3079492

Jun Deng, N. Cummins, Maximilian Schmitt, Kun Qian, F. Ringeval, Björn Schuller

{"title":"基于生成对抗网络表征的自闭症谱系障碍语音诊断","authors":"Jun Deng, N. Cummins, Maximilian Schmitt, Kun Qian, F. Ringeval, Björn Schuller","doi":"10.1145/3079452.3079492","DOIUrl":null,"url":null,"abstract":"Machine learning paradigms based on child vocalisations show great promise as an objective marker of developmental disorders such as Autism. In conventional detection systems, hand-crafted acoustic features are usually fed into a discriminative classifier (e.g, Support Vector Machines); however it is well known that the accuracy and robustness of such a system is limited by the size of the associated training data. This paper explores, for the first time, the use of feature representations learnt using a deep Generative Adversarial Network (GAN) for classifying children's speech affected by developmental disorders. A comparative evaluation of our proposed system with different acoustic feature sets is performed on the Child Pathological and Emotional Speech database. Key experimental results presented demonstrate that GAN based methods exhibit competitive performance with the conventional paradigms in terms of the unweighted average recall metric.","PeriodicalId":245682,"journal":{"name":"Proceedings of the 2017 International Conference on Digital Health","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"38","resultStr":"{\"title\":\"Speech-based Diagnosis of Autism Spectrum Condition by Generative Adversarial Network Representations\",\"authors\":\"Jun Deng, N. Cummins, Maximilian Schmitt, Kun Qian, F. Ringeval, Björn Schuller\",\"doi\":\"10.1145/3079452.3079492\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Machine learning paradigms based on child vocalisations show great promise as an objective marker of developmental disorders such as Autism. In conventional detection systems, hand-crafted acoustic features are usually fed into a discriminative classifier (e.g, Support Vector Machines); however it is well known that the accuracy and robustness of such a system is limited by the size of the associated training data. This paper explores, for the first time, the use of feature representations learnt using a deep Generative Adversarial Network (GAN) for classifying children's speech affected by developmental disorders. A comparative evaluation of our proposed system with different acoustic feature sets is performed on the Child Pathological and Emotional Speech database. Key experimental results presented demonstrate that GAN based methods exhibit competitive performance with the conventional paradigms in terms of the unweighted average recall metric.\",\"PeriodicalId\":245682,\"journal\":{\"name\":\"Proceedings of the 2017 International Conference on Digital Health\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"38\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2017 International Conference on Digital Health\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3079452.3079492\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2017 International Conference on Digital Health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3079452.3079492","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 38

摘要

基于儿童发声的机器学习范式有望作为自闭症等发育障碍的客观标记。在传统的检测系统中，手工制作的声学特征通常被送入判别分类器(例如，支持向量机);然而，众所周知，这种系统的准确性和鲁棒性受到相关训练数据大小的限制。本文首次探讨了使用深度生成对抗网络(GAN)学习的特征表示来对受发育障碍影响的儿童语言进行分类。我们提出的系统与不同的声学特征集在儿童病理和情绪语言数据库上进行了比较评估。关键实验结果表明，基于GAN的方法在未加权平均召回度量方面表现出与传统范式的竞争力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Speech-based Diagnosis of Autism Spectrum Condition by Generative Adversarial Network Representations

Machine learning paradigms based on child vocalisations show great promise as an objective marker of developmental disorders such as Autism. In conventional detection systems, hand-crafted acoustic features are usually fed into a discriminative classifier (e.g, Support Vector Machines); however it is well known that the accuracy and robustness of such a system is limited by the size of the associated training data. This paper explores, for the first time, the use of feature representations learnt using a deep Generative Adversarial Network (GAN) for classifying children's speech affected by developmental disorders. A comparative evaluation of our proposed system with different acoustic feature sets is performed on the Child Pathological and Emotional Speech database. Key experimental results presented demonstrate that GAN based methods exhibit competitive performance with the conventional paradigms in terms of the unweighted average recall metric.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2017 International Conference on Digital Health

自引率

0.00%

发文量