用变分自编码器对面部运动特征进行过采样以估计面部功能障碍的严重程度

2023 International Conference on Computer Graphics and Image Processing (CGIP) Pub Date : 2023-01-01 DOI:10.1109/CGIP58526.2023.00013

Trassandra Jewelle Ipapo, Charlize Del Rosario, R. Alampay, P. Abu

{"title":"用变分自编码器对面部运动特征进行过采样以估计面部功能障碍的严重程度","authors":"Trassandra Jewelle Ipapo, Charlize Del Rosario, R. Alampay, P. Abu","doi":"10.1109/CGIP58526.2023.00013","DOIUrl":null,"url":null,"abstract":"Class imbalance, which negatively affects classification model performance, is a common problem with machine learning. Various oversampling methods have been developed as potential solutions to compensate for imbalanced data. SMOTE is one of the more common methods employed. However, deep generative models such as the variational autoencoder are showing promise as alternatives to traditional oversampling methods. This study investigated the potential of variational autoencoders in learning the distribution of the minority class and producing new observations of facial motion features extracted from an imbalanced medical dataset as well as to see the effects of oversampling before and after the train-test split. The effectiveness of the variational autoencoder was compared to SMOTE in increasing ordinal classification performance across the metrics of accuracy, accuracy±1, inter-rater reliability, specificity, and sensitivity with no oversampling serving as the baseline. The results show that the variational autoencoder has potential as an oversampling method for facial motion features in the context of oro-facial dysfunction estimation. Oversampling prior to the train-test split was also shown to improve classification performance.","PeriodicalId":286064,"journal":{"name":"2023 International Conference on Computer Graphics and Image Processing (CGIP)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Oversampling Facial Motion Features Using the Variational Autoencoder to Estimate Oro-facial Dysfunction Severity\",\"authors\":\"Trassandra Jewelle Ipapo, Charlize Del Rosario, R. Alampay, P. Abu\",\"doi\":\"10.1109/CGIP58526.2023.00013\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Class imbalance, which negatively affects classification model performance, is a common problem with machine learning. Various oversampling methods have been developed as potential solutions to compensate for imbalanced data. SMOTE is one of the more common methods employed. However, deep generative models such as the variational autoencoder are showing promise as alternatives to traditional oversampling methods. This study investigated the potential of variational autoencoders in learning the distribution of the minority class and producing new observations of facial motion features extracted from an imbalanced medical dataset as well as to see the effects of oversampling before and after the train-test split. The effectiveness of the variational autoencoder was compared to SMOTE in increasing ordinal classification performance across the metrics of accuracy, accuracy±1, inter-rater reliability, specificity, and sensitivity with no oversampling serving as the baseline. The results show that the variational autoencoder has potential as an oversampling method for facial motion features in the context of oro-facial dysfunction estimation. Oversampling prior to the train-test split was also shown to improve classification performance.\",\"PeriodicalId\":286064,\"journal\":{\"name\":\"2023 International Conference on Computer Graphics and Image Processing (CGIP)\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Computer Graphics and Image Processing (CGIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CGIP58526.2023.00013\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Computer Graphics and Image Processing (CGIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CGIP58526.2023.00013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

类不平衡是机器学习的一个常见问题，它会对分类模型的性能产生负面影响。各种过采样方法已经发展成为补偿不平衡数据的潜在解决方案。SMOTE是比较常用的方法之一。然而，深度生成模型，如变分自编码器，正在显示出替代传统过采样方法的希望。本研究探讨了变分自编码器在学习少数族裔的分布和从不平衡的医学数据集中提取面部运动特征的新观察结果方面的潜力，以及在训练测试分割之前和之后观察过采样的影响。与SMOTE相比，变分自编码器在提高顺序分类性能的准确性、准确性±1、评分间可靠性、特异性和灵敏度方面的有效性得到了比较，且无过采样作为基线。结果表明，变分自编码器在面部功能障碍估计中具有作为面部运动特征过采样方法的潜力。在训练-测试分割之前的过采样也被证明可以提高分类性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Oversampling Facial Motion Features Using the Variational Autoencoder to Estimate Oro-facial Dysfunction Severity

Class imbalance, which negatively affects classification model performance, is a common problem with machine learning. Various oversampling methods have been developed as potential solutions to compensate for imbalanced data. SMOTE is one of the more common methods employed. However, deep generative models such as the variational autoencoder are showing promise as alternatives to traditional oversampling methods. This study investigated the potential of variational autoencoders in learning the distribution of the minority class and producing new observations of facial motion features extracted from an imbalanced medical dataset as well as to see the effects of oversampling before and after the train-test split. The effectiveness of the variational autoencoder was compared to SMOTE in increasing ordinal classification performance across the metrics of accuracy, accuracy±1, inter-rater reliability, specificity, and sensitivity with no oversampling serving as the baseline. The results show that the variational autoencoder has potential as an oversampling method for facial motion features in the context of oro-facial dysfunction estimation. Oversampling prior to the train-test split was also shown to improve classification performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 International Conference on Computer Graphics and Image Processing (CGIP)

自引率

0.00%

发文量