Hengqing Cui, Ziqi Zhang, Wenjun Zhang, Jun Zhang, Haiyan Cui
{"title":"A Deep Learning-based Ensemble Model for Automated Nasolabial Fold Severity Grading.","authors":"Hengqing Cui, Ziqi Zhang, Wenjun Zhang, Jun Zhang, Haiyan Cui","doi":"10.1093/asj/sjaf161","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Nasolabial fold (NLF) severity is a key indicator of facial aging and a frequent target in aesthetic treatments. The Wrinkle Severity Rating Scale (WSRS) is widely used for clinical grading but remains inherently subjective and vulnerable to inter-observer variability.</p><p><strong>Objectives: </strong>This study aimed to develop and validate DeepFold, a deep learning-based ensemble model for automated, objective, and clinically interpretable grading of NLF severity based on the WSRS.</p><p><strong>Methods: </strong>A dataset of 6,718 facial images was constructed, including 1,718 images from clinical outpatients and 5,000 from the CelebA dataset. All images were split into left and right halves and annotated independently by three senior plastic surgeons using the WSRS. ResNet-50 served as the base model architecture, and an ensemble strategy was applied using majority voting over three independently trained networks. Model training used focal loss to address class imbalance and was conducted in PyTorch with early stopping based on validation loss. Performance was assessed using accuracy, F1-score, and confusion matrix analysis.</p><p><strong>Results: </strong>The DeepFold ensemble model achieved a validation accuracy and F1-score of 0.917, outperforming individual baseline models such as ResNet-50 (accuracy: 0.904) and SeResNet-50 (accuracy: 0.882). Ensemble strategies reduced prediction variance and enhanced model robustness under class imbalance.</p><p><strong>Conclusions: </strong>DeepFold provides a reliable and standardized approach to NLF severity assessment, offering potential clinical value in aesthetic evaluation, treatment planning, and outcome monitoring.</p>","PeriodicalId":7728,"journal":{"name":"Aesthetic Surgery Journal","volume":" ","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aesthetic Surgery Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/asj/sjaf161","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SURGERY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Nasolabial fold (NLF) severity is a key indicator of facial aging and a frequent target in aesthetic treatments. The Wrinkle Severity Rating Scale (WSRS) is widely used for clinical grading but remains inherently subjective and vulnerable to inter-observer variability.
Objectives: This study aimed to develop and validate DeepFold, a deep learning-based ensemble model for automated, objective, and clinically interpretable grading of NLF severity based on the WSRS.
Methods: A dataset of 6,718 facial images was constructed, including 1,718 images from clinical outpatients and 5,000 from the CelebA dataset. All images were split into left and right halves and annotated independently by three senior plastic surgeons using the WSRS. ResNet-50 served as the base model architecture, and an ensemble strategy was applied using majority voting over three independently trained networks. Model training used focal loss to address class imbalance and was conducted in PyTorch with early stopping based on validation loss. Performance was assessed using accuracy, F1-score, and confusion matrix analysis.
Results: The DeepFold ensemble model achieved a validation accuracy and F1-score of 0.917, outperforming individual baseline models such as ResNet-50 (accuracy: 0.904) and SeResNet-50 (accuracy: 0.882). Ensemble strategies reduced prediction variance and enhanced model robustness under class imbalance.
Conclusions: DeepFold provides a reliable and standardized approach to NLF severity assessment, offering potential clinical value in aesthetic evaluation, treatment planning, and outcome monitoring.
期刊介绍:
Aesthetic Surgery Journal is a peer-reviewed international journal focusing on scientific developments and clinical techniques in aesthetic surgery. The official publication of The Aesthetic Society, ASJ is also the official English-language journal of many major international societies of plastic, aesthetic and reconstructive surgery representing South America, Central America, Europe, Asia, and the Middle East. It is also the official journal of the British Association of Aesthetic Plastic Surgeons, the Canadian Society for Aesthetic Plastic Surgery and The Rhinoplasty Society.