{"title":"基于磁共振血管造影数据特征子集选择的机器学习年龄预测。","authors":"Hoon-Seok Yoon, Yoon-Chul Kim","doi":"10.4258/hir.2025.31.3.284","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>The objective of this study was to evaluate the effectiveness of machine learning (ML) models using selected subsets of features to predict age based on intracranial arterial segments' tortuosity and diameter characteristics derived from magnetic resonance angiography (MRA) data. Additionally, this study aimed to identify key vascular features important for predicting vascular age.</p><p><strong>Methods: </strong>Three-dimensional time-of-flight MRA image data from 171 subjects were analyzed. After annotating the endpoints for each arterial segment, 169 features-comprising tortuosity metrics and arterial segment diameter statistics-were extracted. Five ML models (random forest, linear regression, AdaBoost, XGBoost, and lightGBM) were trained and validated. Two feature selection methods, correlation-based feature selection (CFS) and Relief-F, were applied to identify optimal feature subsets.</p><p><strong>Results: </strong>The random forest model utilizing the CFS-based 50% feature subset achieved the best performance, with a root mean square error of 14.0 years, a coefficient of determination (R2) of 0.275, and a Pearson correlation coefficient of 0.560. Tortuosity metrics (e.g., triangular index of the left posterior cerebral artery P1 segment) appeared more frequently than diameter statistics among the top five most important features.</p><p><strong>Conclusions: </strong>CFS-based feature selection enhanced the performance of ML-based age prediction compared with using the complete feature set. Linear regression consistently demonstrated the poorest performance across all evaluation metrics. ML-based age prediction using segmental tortuosity metrics and diameter statistics is feasible, potentially revealing significant features related to vascular aging.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 3","pages":"284-294"},"PeriodicalIF":2.1000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12370420/pdf/","citationCount":"0","resultStr":"{\"title\":\"Machine Learning-Based Age Prediction with Feature Subset Selection from Magnetic Resonance Angiography Data.\",\"authors\":\"Hoon-Seok Yoon, Yoon-Chul Kim\",\"doi\":\"10.4258/hir.2025.31.3.284\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objectives: </strong>The objective of this study was to evaluate the effectiveness of machine learning (ML) models using selected subsets of features to predict age based on intracranial arterial segments' tortuosity and diameter characteristics derived from magnetic resonance angiography (MRA) data. Additionally, this study aimed to identify key vascular features important for predicting vascular age.</p><p><strong>Methods: </strong>Three-dimensional time-of-flight MRA image data from 171 subjects were analyzed. After annotating the endpoints for each arterial segment, 169 features-comprising tortuosity metrics and arterial segment diameter statistics-were extracted. Five ML models (random forest, linear regression, AdaBoost, XGBoost, and lightGBM) were trained and validated. Two feature selection methods, correlation-based feature selection (CFS) and Relief-F, were applied to identify optimal feature subsets.</p><p><strong>Results: </strong>The random forest model utilizing the CFS-based 50% feature subset achieved the best performance, with a root mean square error of 14.0 years, a coefficient of determination (R2) of 0.275, and a Pearson correlation coefficient of 0.560. Tortuosity metrics (e.g., triangular index of the left posterior cerebral artery P1 segment) appeared more frequently than diameter statistics among the top five most important features.</p><p><strong>Conclusions: </strong>CFS-based feature selection enhanced the performance of ML-based age prediction compared with using the complete feature set. Linear regression consistently demonstrated the poorest performance across all evaluation metrics. ML-based age prediction using segmental tortuosity metrics and diameter statistics is feasible, potentially revealing significant features related to vascular aging.</p>\",\"PeriodicalId\":12947,\"journal\":{\"name\":\"Healthcare Informatics Research\",\"volume\":\"31 3\",\"pages\":\"284-294\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12370420/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Healthcare Informatics Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4258/hir.2025.31.3.284\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/7/31 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"MEDICAL INFORMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Healthcare Informatics Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4258/hir.2025.31.3.284","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/31 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
Machine Learning-Based Age Prediction with Feature Subset Selection from Magnetic Resonance Angiography Data.
Objectives: The objective of this study was to evaluate the effectiveness of machine learning (ML) models using selected subsets of features to predict age based on intracranial arterial segments' tortuosity and diameter characteristics derived from magnetic resonance angiography (MRA) data. Additionally, this study aimed to identify key vascular features important for predicting vascular age.
Methods: Three-dimensional time-of-flight MRA image data from 171 subjects were analyzed. After annotating the endpoints for each arterial segment, 169 features-comprising tortuosity metrics and arterial segment diameter statistics-were extracted. Five ML models (random forest, linear regression, AdaBoost, XGBoost, and lightGBM) were trained and validated. Two feature selection methods, correlation-based feature selection (CFS) and Relief-F, were applied to identify optimal feature subsets.
Results: The random forest model utilizing the CFS-based 50% feature subset achieved the best performance, with a root mean square error of 14.0 years, a coefficient of determination (R2) of 0.275, and a Pearson correlation coefficient of 0.560. Tortuosity metrics (e.g., triangular index of the left posterior cerebral artery P1 segment) appeared more frequently than diameter statistics among the top five most important features.
Conclusions: CFS-based feature selection enhanced the performance of ML-based age prediction compared with using the complete feature set. Linear regression consistently demonstrated the poorest performance across all evaluation metrics. ML-based age prediction using segmental tortuosity metrics and diameter statistics is feasible, potentially revealing significant features related to vascular aging.