{"title":"多模态凹陷检测与估计","authors":"Le Yang","doi":"10.1109/ACIIW.2019.8925288","DOIUrl":null,"url":null,"abstract":"Depression and anxiety disorders are critical problems in modern society. The WHO studies suggest that roughly 12.8 percent of the world's population are suffering from a depressive disorder. In this work, we propose several novel approaches towards multi-modal depression detection and estimation. Our previous studies mainly explored the multi-modal features and multi-modal fusion strategies, experimental results showed that the proposed hybrid depression classification and estimation multi-modal fusion framework obtains promising performance. The current work contains two parts: 1) In order to mitigate the impact of lack of data on training depression deep models, we utilize Generative Adversarial Network (GAN) to augment depression audio features, so as to improve depression severity estimation performance. 2) We propose a novel FACS3D-Net to integrate $3D$ and $2D$ convolution network for facial Action Unit (AU) detection. As far as we know, this is the first work to apply $3D$ CNN to the problem of AU detection. Our future work will 1) focus on combining depression estimation with dimensional affective analysis through the proposed FACS3D-Net, and 2) collect Chinese depression database. When completed, these studies will compose the author's dissertation.","PeriodicalId":193568,"journal":{"name":"2019 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Multi-Modal Depression Detection and Estimation\",\"authors\":\"Le Yang\",\"doi\":\"10.1109/ACIIW.2019.8925288\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Depression and anxiety disorders are critical problems in modern society. The WHO studies suggest that roughly 12.8 percent of the world's population are suffering from a depressive disorder. In this work, we propose several novel approaches towards multi-modal depression detection and estimation. Our previous studies mainly explored the multi-modal features and multi-modal fusion strategies, experimental results showed that the proposed hybrid depression classification and estimation multi-modal fusion framework obtains promising performance. The current work contains two parts: 1) In order to mitigate the impact of lack of data on training depression deep models, we utilize Generative Adversarial Network (GAN) to augment depression audio features, so as to improve depression severity estimation performance. 2) We propose a novel FACS3D-Net to integrate $3D$ and $2D$ convolution network for facial Action Unit (AU) detection. As far as we know, this is the first work to apply $3D$ CNN to the problem of AU detection. Our future work will 1) focus on combining depression estimation with dimensional affective analysis through the proposed FACS3D-Net, and 2) collect Chinese depression database. When completed, these studies will compose the author's dissertation.\",\"PeriodicalId\":193568,\"journal\":{\"name\":\"2019 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACIIW.2019.8925288\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACIIW.2019.8925288","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Depression and anxiety disorders are critical problems in modern society. The WHO studies suggest that roughly 12.8 percent of the world's population are suffering from a depressive disorder. In this work, we propose several novel approaches towards multi-modal depression detection and estimation. Our previous studies mainly explored the multi-modal features and multi-modal fusion strategies, experimental results showed that the proposed hybrid depression classification and estimation multi-modal fusion framework obtains promising performance. The current work contains two parts: 1) In order to mitigate the impact of lack of data on training depression deep models, we utilize Generative Adversarial Network (GAN) to augment depression audio features, so as to improve depression severity estimation performance. 2) We propose a novel FACS3D-Net to integrate $3D$ and $2D$ convolution network for facial Action Unit (AU) detection. As far as we know, this is the first work to apply $3D$ CNN to the problem of AU detection. Our future work will 1) focus on combining depression estimation with dimensional affective analysis through the proposed FACS3D-Net, and 2) collect Chinese depression database. When completed, these studies will compose the author's dissertation.