牙科服务在巴西南部的成年人中使用预测:以性别和种族公平为导向的机器学习方法。

IF 4.8 2区 医学 Q1 DENTISTRY, ORAL SURGERY & MEDICINE
Luiz Alexandre Chisini , Cínthia Fonseca Araújo , Felipe Mendes Delpino , Lílian Munhoz Figueiredo , Alexandre Dias Porto Chiavegatto Filho , Helena Silveira Schuch , Bruno Pereira Nunes , Flávio Fernando Demarco
{"title":"牙科服务在巴西南部的成年人中使用预测:以性别和种族公平为导向的机器学习方法。","authors":"Luiz Alexandre Chisini ,&nbsp;Cínthia Fonseca Araújo ,&nbsp;Felipe Mendes Delpino ,&nbsp;Lílian Munhoz Figueiredo ,&nbsp;Alexandre Dias Porto Chiavegatto Filho ,&nbsp;Helena Silveira Schuch ,&nbsp;Bruno Pereira Nunes ,&nbsp;Flávio Fernando Demarco","doi":"10.1016/j.jdent.2025.105929","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>To develop machine learning models to predict the use of dental services among adults aged 18 and older.</div></div><div><h3>Methods</h3><div>This is a prospective cohort study that uses data from the survey “EAI Pelotas?”. The sample consisted of individuals who participated in both the baseline and follow-up, totaling 3461 people. Predictors were collected as baseline and comprised 47 sociodemographic, behavioral, oral and general health characteristics. The outcome was dental service use in the last year assessed during the one-year follow-up. Data was divided into training (80 %) and test (20 %) sets. Five machine learning models were tested. Hyperparameter tuning was optimized through 10-fold cross-validation, utilizing 30 iterations. Model performance was assessed based on the area under the Receiver Operating Characteristic (ROC) curve (AUC), accuracy, recall, precision, and F1-score.</div></div><div><h3>Results</h3><div>The prevalence of dental service use in the follow-up was 47.2 % (95 % CI, 45.5 – 48.9). All models in the test set demonstrated an AUC-ROC between 0.76 and 0.77. The CatBoost Classifier model exhibited the highest performance in the test dataset among the models concerning the AUC metric (AUC = 0.77, CI95 %,[0.73–0.80]), displaying an accuracy = 0.69, recall = 0.69, precision = 0.68, and F1-score = 0.69. Fairness estimations for the best model indicated consistent performance across gender categories. However, disparities were observed among racial groups, AUC = 0.57 for individuals who self-reported mixed (“<em>pardos”</em>) skin color. The explainability analysis shows that the most important features were the last dental visit at baseline and education level.</div></div><div><h3>Conclusion</h3><div>Despite our findings suggesting a sufficient prediction of overall dental services’ use, performance varied across racial groups.</div></div><div><h3>Clinical significance</h3><div>Our findings highlight the potential of machine learning models to predict dental service use with good overall accuracy. However, the significantly lower performance for mixed-race individuals raises concerns about fairness and equity. Therefore, despite promising results, the model requires further refinement before it can be applied in real-world public health settings.</div></div>","PeriodicalId":15585,"journal":{"name":"Journal of dentistry","volume":"161 ","pages":"Article 105929"},"PeriodicalIF":4.8000,"publicationDate":"2025-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dental services use prediction among adults in Southern Brazil: A gender and racial fairness-oriented machine learning approach\",\"authors\":\"Luiz Alexandre Chisini ,&nbsp;Cínthia Fonseca Araújo ,&nbsp;Felipe Mendes Delpino ,&nbsp;Lílian Munhoz Figueiredo ,&nbsp;Alexandre Dias Porto Chiavegatto Filho ,&nbsp;Helena Silveira Schuch ,&nbsp;Bruno Pereira Nunes ,&nbsp;Flávio Fernando Demarco\",\"doi\":\"10.1016/j.jdent.2025.105929\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Objective</h3><div>To develop machine learning models to predict the use of dental services among adults aged 18 and older.</div></div><div><h3>Methods</h3><div>This is a prospective cohort study that uses data from the survey “EAI Pelotas?”. The sample consisted of individuals who participated in both the baseline and follow-up, totaling 3461 people. Predictors were collected as baseline and comprised 47 sociodemographic, behavioral, oral and general health characteristics. The outcome was dental service use in the last year assessed during the one-year follow-up. Data was divided into training (80 %) and test (20 %) sets. Five machine learning models were tested. Hyperparameter tuning was optimized through 10-fold cross-validation, utilizing 30 iterations. Model performance was assessed based on the area under the Receiver Operating Characteristic (ROC) curve (AUC), accuracy, recall, precision, and F1-score.</div></div><div><h3>Results</h3><div>The prevalence of dental service use in the follow-up was 47.2 % (95 % CI, 45.5 – 48.9). All models in the test set demonstrated an AUC-ROC between 0.76 and 0.77. The CatBoost Classifier model exhibited the highest performance in the test dataset among the models concerning the AUC metric (AUC = 0.77, CI95 %,[0.73–0.80]), displaying an accuracy = 0.69, recall = 0.69, precision = 0.68, and F1-score = 0.69. Fairness estimations for the best model indicated consistent performance across gender categories. However, disparities were observed among racial groups, AUC = 0.57 for individuals who self-reported mixed (“<em>pardos”</em>) skin color. The explainability analysis shows that the most important features were the last dental visit at baseline and education level.</div></div><div><h3>Conclusion</h3><div>Despite our findings suggesting a sufficient prediction of overall dental services’ use, performance varied across racial groups.</div></div><div><h3>Clinical significance</h3><div>Our findings highlight the potential of machine learning models to predict dental service use with good overall accuracy. However, the significantly lower performance for mixed-race individuals raises concerns about fairness and equity. Therefore, despite promising results, the model requires further refinement before it can be applied in real-world public health settings.</div></div>\",\"PeriodicalId\":15585,\"journal\":{\"name\":\"Journal of dentistry\",\"volume\":\"161 \",\"pages\":\"Article 105929\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2025-06-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of dentistry\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0300571225003732\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"DENTISTRY, ORAL SURGERY & MEDICINE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of dentistry","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0300571225003732","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}
引用次数: 0

摘要

目的:开发机器学习模型来预测18岁及以上成年人的牙科服务使用情况。方法:这是一项前瞻性队列研究,使用来自“EAI Pelotas?”调查的数据。样本包括参加基线和随访的个人,共计3461人。预测指标作为基线收集,包括47个社会人口学、行为、口腔和一般健康特征。结果是在一年的随访中评估去年的牙科服务使用情况。数据分为训练集(80%)和测试集(20%)。测试了五个机器学习模型。通过10倍交叉验证优化超参数调优,利用30次迭代。根据受试者工作特征(ROC)曲线下面积(AUC)、准确率、召回率、精密度和f1评分来评估模型的性能。结果:随访中使用牙科服务的患病率为47.2% (95% CI, 45.5 ~ 48.9)。检验集中所有模型的AUC-ROC在0.76 ~ 0.77之间。在AUC度量模型中,CatBoost Classifier模型在测试数据集中表现出最高的性能(AUC=0.77, CI95%,[0.73-0.80]),准确率=0.69,召回率=0.69,精度=0.68,F1-score=0.69。对最佳模型的公平性估计表明,跨性别类别的表现是一致的。然而,在种族群体中观察到差异-自我报告混合(“pardos”)肤色的个体的AUC=0.57。可解释性分析表明,最重要的特征是最后一次牙科就诊基线和教育水平。结论:尽管我们的研究结果表明了对整体牙科服务使用的充分预测,但表现在种族群体中有所不同。临床意义:我们的研究结果突出了机器学习模型在预测牙科服务使用方面的潜力,总体准确性很高。然而,混血儿的表现明显较低,引发了人们对公平和平等的担忧。因此,尽管取得了令人鼓舞的结果,但该模型需要进一步完善,才能应用于现实世界的公共卫生环境。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Dental services use prediction among adults in Southern Brazil: A gender and racial fairness-oriented machine learning approach

Objective

To develop machine learning models to predict the use of dental services among adults aged 18 and older.

Methods

This is a prospective cohort study that uses data from the survey “EAI Pelotas?”. The sample consisted of individuals who participated in both the baseline and follow-up, totaling 3461 people. Predictors were collected as baseline and comprised 47 sociodemographic, behavioral, oral and general health characteristics. The outcome was dental service use in the last year assessed during the one-year follow-up. Data was divided into training (80 %) and test (20 %) sets. Five machine learning models were tested. Hyperparameter tuning was optimized through 10-fold cross-validation, utilizing 30 iterations. Model performance was assessed based on the area under the Receiver Operating Characteristic (ROC) curve (AUC), accuracy, recall, precision, and F1-score.

Results

The prevalence of dental service use in the follow-up was 47.2 % (95 % CI, 45.5 – 48.9). All models in the test set demonstrated an AUC-ROC between 0.76 and 0.77. The CatBoost Classifier model exhibited the highest performance in the test dataset among the models concerning the AUC metric (AUC = 0.77, CI95 %,[0.73–0.80]), displaying an accuracy = 0.69, recall = 0.69, precision = 0.68, and F1-score = 0.69. Fairness estimations for the best model indicated consistent performance across gender categories. However, disparities were observed among racial groups, AUC = 0.57 for individuals who self-reported mixed (“pardos”) skin color. The explainability analysis shows that the most important features were the last dental visit at baseline and education level.

Conclusion

Despite our findings suggesting a sufficient prediction of overall dental services’ use, performance varied across racial groups.

Clinical significance

Our findings highlight the potential of machine learning models to predict dental service use with good overall accuracy. However, the significantly lower performance for mixed-race individuals raises concerns about fairness and equity. Therefore, despite promising results, the model requires further refinement before it can be applied in real-world public health settings.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of dentistry
Journal of dentistry 医学-牙科与口腔外科
CiteScore
7.30
自引率
11.40%
发文量
349
审稿时长
35 days
期刊介绍: The Journal of Dentistry has an open access mirror journal The Journal of Dentistry: X, sharing the same aims and scope, editorial team, submission system and rigorous peer review. The Journal of Dentistry is the leading international dental journal within the field of Restorative Dentistry. Placing an emphasis on publishing novel and high-quality research papers, the Journal aims to influence the practice of dentistry at clinician, research, industry and policy-maker level on an international basis. Topics covered include the management of dental disease, periodontology, endodontology, operative dentistry, fixed and removable prosthodontics, dental biomaterials science, long-term clinical trials including epidemiology and oral health, technology transfer of new scientific instrumentation or procedures, as well as clinically relevant oral biology and translational research. The Journal of Dentistry will publish original scientific research papers including short communications. It is also interested in publishing review articles and leaders in themed areas which will be linked to new scientific research. Conference proceedings are also welcome and expressions of interest should be communicated to the Editor.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信