Lei Shi , Xinran Huang , Wenchi Ke , Hrvoje Brkić , Yuchi Zhou , Ting Lu , Xian’e Tang , Lirong Qiu , Shuai Luo , Xingtao Zhang , Ziqi Cheng , Yushan Lin , Peixi Liao , Hu Chen , Yi Zhang , Yijiu Chen , Zhenhua Deng , Fei Fan
{"title":"通过自蒸馏混合注意网络增强儿童头部MRI年龄估计","authors":"Lei Shi , Xinran Huang , Wenchi Ke , Hrvoje Brkić , Yuchi Zhou , Ting Lu , Xian’e Tang , Lirong Qiu , Shuai Luo , Xingtao Zhang , Ziqi Cheng , Yushan Lin , Peixi Liao , Hu Chen , Yi Zhang , Yijiu Chen , Zhenhua Deng , Fei Fan","doi":"10.1016/j.bspc.2025.108748","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and objective:</h3><div>Age estimation is crucial in pediatrics, developmental medicine, often conducted by radiographic techniques exposing children to ionizing radiation. Magnetic Resonance Imaging (MRI) offers a safer, radiation-free alternative. Automatic age estimation is rapidly advancing, offering an efficient approach that reduces human bias and saves manpower. This study aims to exploit the potential of head MRI in automatic age estimation in pediatric population via deep learning.</div></div><div><h3>Methods and materials:</h3><div>We propose a self-distillation and hybrid-attention network (SDHA) to estimate age from 3-T head MRI from children. We train SDHA network utilizing self-distillation and integrating Squeeze-and-Excitation (SE), Spatial Transformer (ST) attention mechanisms. Four stacked attention modules (SE, ST) were embedded to backbone network ResNet50 (teacher), generating deeper predictions; early exit branches (students) were added to generate shallower predictions. Three types of losses are employed to achieve knowledge distillation to enhance both performance and computational efficiency. SDHA is evaluated against manual and traditional CNN methods by mean absolute error (MAE) and root mean squared error (RMSE).</div></div><div><h3>Results:</h3><div>SDHA (MAE = 0.34 years) yielded a lower MAE than manual method (MAE = 0.44 years). MAE decreased by 63.4% with SDHA compared to non-distilled SENet (MAE = 0.93 years). Prediction error density curve shows higher precision by SDHA. Grad-CAM visualization revealed that SDHA adaptively focuses on age-relevant dental, facial and brain structures. SDHA reduced prediction time from 120 s (manual assessment) to 0.11 s per subject.</div></div><div><h3>Conclusion:</h3><div>The proposed SDHA demonstrates superior performance over manual and existing CNN methods for dental age estimation from head MRI. Its adaptive attention to age-relevant anatomical structures and significant efficiency gains make it valuable for applications in pediatric age estimation.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"112 ","pages":"Article 108748"},"PeriodicalIF":4.9000,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhanced pediatric age estimation from head MRI via self-distillation hybrid-attention network\",\"authors\":\"Lei Shi , Xinran Huang , Wenchi Ke , Hrvoje Brkić , Yuchi Zhou , Ting Lu , Xian’e Tang , Lirong Qiu , Shuai Luo , Xingtao Zhang , Ziqi Cheng , Yushan Lin , Peixi Liao , Hu Chen , Yi Zhang , Yijiu Chen , Zhenhua Deng , Fei Fan\",\"doi\":\"10.1016/j.bspc.2025.108748\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background and objective:</h3><div>Age estimation is crucial in pediatrics, developmental medicine, often conducted by radiographic techniques exposing children to ionizing radiation. Magnetic Resonance Imaging (MRI) offers a safer, radiation-free alternative. Automatic age estimation is rapidly advancing, offering an efficient approach that reduces human bias and saves manpower. This study aims to exploit the potential of head MRI in automatic age estimation in pediatric population via deep learning.</div></div><div><h3>Methods and materials:</h3><div>We propose a self-distillation and hybrid-attention network (SDHA) to estimate age from 3-T head MRI from children. We train SDHA network utilizing self-distillation and integrating Squeeze-and-Excitation (SE), Spatial Transformer (ST) attention mechanisms. Four stacked attention modules (SE, ST) were embedded to backbone network ResNet50 (teacher), generating deeper predictions; early exit branches (students) were added to generate shallower predictions. Three types of losses are employed to achieve knowledge distillation to enhance both performance and computational efficiency. SDHA is evaluated against manual and traditional CNN methods by mean absolute error (MAE) and root mean squared error (RMSE).</div></div><div><h3>Results:</h3><div>SDHA (MAE = 0.34 years) yielded a lower MAE than manual method (MAE = 0.44 years). MAE decreased by 63.4% with SDHA compared to non-distilled SENet (MAE = 0.93 years). Prediction error density curve shows higher precision by SDHA. Grad-CAM visualization revealed that SDHA adaptively focuses on age-relevant dental, facial and brain structures. SDHA reduced prediction time from 120 s (manual assessment) to 0.11 s per subject.</div></div><div><h3>Conclusion:</h3><div>The proposed SDHA demonstrates superior performance over manual and existing CNN methods for dental age estimation from head MRI. Its adaptive attention to age-relevant anatomical structures and significant efficiency gains make it valuable for applications in pediatric age estimation.</div></div>\",\"PeriodicalId\":55362,\"journal\":{\"name\":\"Biomedical Signal Processing and Control\",\"volume\":\"112 \",\"pages\":\"Article 108748\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-10-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Signal Processing and Control\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1746809425012595\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425012595","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
Enhanced pediatric age estimation from head MRI via self-distillation hybrid-attention network
Background and objective:
Age estimation is crucial in pediatrics, developmental medicine, often conducted by radiographic techniques exposing children to ionizing radiation. Magnetic Resonance Imaging (MRI) offers a safer, radiation-free alternative. Automatic age estimation is rapidly advancing, offering an efficient approach that reduces human bias and saves manpower. This study aims to exploit the potential of head MRI in automatic age estimation in pediatric population via deep learning.
Methods and materials:
We propose a self-distillation and hybrid-attention network (SDHA) to estimate age from 3-T head MRI from children. We train SDHA network utilizing self-distillation and integrating Squeeze-and-Excitation (SE), Spatial Transformer (ST) attention mechanisms. Four stacked attention modules (SE, ST) were embedded to backbone network ResNet50 (teacher), generating deeper predictions; early exit branches (students) were added to generate shallower predictions. Three types of losses are employed to achieve knowledge distillation to enhance both performance and computational efficiency. SDHA is evaluated against manual and traditional CNN methods by mean absolute error (MAE) and root mean squared error (RMSE).
Results:
SDHA (MAE = 0.34 years) yielded a lower MAE than manual method (MAE = 0.44 years). MAE decreased by 63.4% with SDHA compared to non-distilled SENet (MAE = 0.93 years). Prediction error density curve shows higher precision by SDHA. Grad-CAM visualization revealed that SDHA adaptively focuses on age-relevant dental, facial and brain structures. SDHA reduced prediction time from 120 s (manual assessment) to 0.11 s per subject.
Conclusion:
The proposed SDHA demonstrates superior performance over manual and existing CNN methods for dental age estimation from head MRI. Its adaptive attention to age-relevant anatomical structures and significant efficiency gains make it valuable for applications in pediatric age estimation.
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.