Luiz Alexandre Chisini , Cínthia Fonseca Araújo , Felipe Mendes Delpino , Lílian Munhoz Figueiredo , Alexandre Dias Porto Chiavegatto Filho , Helena Silveira Schuch , Bruno Pereira Nunes , Flávio Fernando Demarco
{"title":"牙科服务在巴西南部的成年人中使用预测:以性别和种族公平为导向的机器学习方法。","authors":"Luiz Alexandre Chisini , Cínthia Fonseca Araújo , Felipe Mendes Delpino , Lílian Munhoz Figueiredo , Alexandre Dias Porto Chiavegatto Filho , Helena Silveira Schuch , Bruno Pereira Nunes , Flávio Fernando Demarco","doi":"10.1016/j.jdent.2025.105929","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>To develop machine learning models to predict the use of dental services among adults aged 18 and older.</div></div><div><h3>Methods</h3><div>This is a prospective cohort study that uses data from the survey “EAI Pelotas?”. The sample consisted of individuals who participated in both the baseline and follow-up, totaling 3461 people. Predictors were collected as baseline and comprised 47 sociodemographic, behavioral, oral and general health characteristics. The outcome was dental service use in the last year assessed during the one-year follow-up. Data was divided into training (80 %) and test (20 %) sets. Five machine learning models were tested. Hyperparameter tuning was optimized through 10-fold cross-validation, utilizing 30 iterations. Model performance was assessed based on the area under the Receiver Operating Characteristic (ROC) curve (AUC), accuracy, recall, precision, and F1-score.</div></div><div><h3>Results</h3><div>The prevalence of dental service use in the follow-up was 47.2 % (95 % CI, 45.5 – 48.9). All models in the test set demonstrated an AUC-ROC between 0.76 and 0.77. The CatBoost Classifier model exhibited the highest performance in the test dataset among the models concerning the AUC metric (AUC = 0.77, CI95 %,[0.73–0.80]), displaying an accuracy = 0.69, recall = 0.69, precision = 0.68, and F1-score = 0.69. Fairness estimations for the best model indicated consistent performance across gender categories. However, disparities were observed among racial groups, AUC = 0.57 for individuals who self-reported mixed (“<em>pardos”</em>) skin color. The explainability analysis shows that the most important features were the last dental visit at baseline and education level.</div></div><div><h3>Conclusion</h3><div>Despite our findings suggesting a sufficient prediction of overall dental services’ use, performance varied across racial groups.</div></div><div><h3>Clinical significance</h3><div>Our findings highlight the potential of machine learning models to predict dental service use with good overall accuracy. However, the significantly lower performance for mixed-race individuals raises concerns about fairness and equity. Therefore, despite promising results, the model requires further refinement before it can be applied in real-world public health settings.</div></div>","PeriodicalId":15585,"journal":{"name":"Journal of dentistry","volume":"161 ","pages":"Article 105929"},"PeriodicalIF":4.8000,"publicationDate":"2025-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dental services use prediction among adults in Southern Brazil: A gender and racial fairness-oriented machine learning approach\",\"authors\":\"Luiz Alexandre Chisini , Cínthia Fonseca Araújo , Felipe Mendes Delpino , Lílian Munhoz Figueiredo , Alexandre Dias Porto Chiavegatto Filho , Helena Silveira Schuch , Bruno Pereira Nunes , Flávio Fernando Demarco\",\"doi\":\"10.1016/j.jdent.2025.105929\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Objective</h3><div>To develop machine learning models to predict the use of dental services among adults aged 18 and older.</div></div><div><h3>Methods</h3><div>This is a prospective cohort study that uses data from the survey “EAI Pelotas?”. The sample consisted of individuals who participated in both the baseline and follow-up, totaling 3461 people. Predictors were collected as baseline and comprised 47 sociodemographic, behavioral, oral and general health characteristics. The outcome was dental service use in the last year assessed during the one-year follow-up. Data was divided into training (80 %) and test (20 %) sets. Five machine learning models were tested. Hyperparameter tuning was optimized through 10-fold cross-validation, utilizing 30 iterations. Model performance was assessed based on the area under the Receiver Operating Characteristic (ROC) curve (AUC), accuracy, recall, precision, and F1-score.</div></div><div><h3>Results</h3><div>The prevalence of dental service use in the follow-up was 47.2 % (95 % CI, 45.5 – 48.9). All models in the test set demonstrated an AUC-ROC between 0.76 and 0.77. The CatBoost Classifier model exhibited the highest performance in the test dataset among the models concerning the AUC metric (AUC = 0.77, CI95 %,[0.73–0.80]), displaying an accuracy = 0.69, recall = 0.69, precision = 0.68, and F1-score = 0.69. Fairness estimations for the best model indicated consistent performance across gender categories. However, disparities were observed among racial groups, AUC = 0.57 for individuals who self-reported mixed (“<em>pardos”</em>) skin color. The explainability analysis shows that the most important features were the last dental visit at baseline and education level.</div></div><div><h3>Conclusion</h3><div>Despite our findings suggesting a sufficient prediction of overall dental services’ use, performance varied across racial groups.</div></div><div><h3>Clinical significance</h3><div>Our findings highlight the potential of machine learning models to predict dental service use with good overall accuracy. However, the significantly lower performance for mixed-race individuals raises concerns about fairness and equity. Therefore, despite promising results, the model requires further refinement before it can be applied in real-world public health settings.</div></div>\",\"PeriodicalId\":15585,\"journal\":{\"name\":\"Journal of dentistry\",\"volume\":\"161 \",\"pages\":\"Article 105929\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2025-06-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of dentistry\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0300571225003732\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"DENTISTRY, ORAL SURGERY & MEDICINE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of dentistry","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0300571225003732","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}
Dental services use prediction among adults in Southern Brazil: A gender and racial fairness-oriented machine learning approach
Objective
To develop machine learning models to predict the use of dental services among adults aged 18 and older.
Methods
This is a prospective cohort study that uses data from the survey “EAI Pelotas?”. The sample consisted of individuals who participated in both the baseline and follow-up, totaling 3461 people. Predictors were collected as baseline and comprised 47 sociodemographic, behavioral, oral and general health characteristics. The outcome was dental service use in the last year assessed during the one-year follow-up. Data was divided into training (80 %) and test (20 %) sets. Five machine learning models were tested. Hyperparameter tuning was optimized through 10-fold cross-validation, utilizing 30 iterations. Model performance was assessed based on the area under the Receiver Operating Characteristic (ROC) curve (AUC), accuracy, recall, precision, and F1-score.
Results
The prevalence of dental service use in the follow-up was 47.2 % (95 % CI, 45.5 – 48.9). All models in the test set demonstrated an AUC-ROC between 0.76 and 0.77. The CatBoost Classifier model exhibited the highest performance in the test dataset among the models concerning the AUC metric (AUC = 0.77, CI95 %,[0.73–0.80]), displaying an accuracy = 0.69, recall = 0.69, precision = 0.68, and F1-score = 0.69. Fairness estimations for the best model indicated consistent performance across gender categories. However, disparities were observed among racial groups, AUC = 0.57 for individuals who self-reported mixed (“pardos”) skin color. The explainability analysis shows that the most important features were the last dental visit at baseline and education level.
Conclusion
Despite our findings suggesting a sufficient prediction of overall dental services’ use, performance varied across racial groups.
Clinical significance
Our findings highlight the potential of machine learning models to predict dental service use with good overall accuracy. However, the significantly lower performance for mixed-race individuals raises concerns about fairness and equity. Therefore, despite promising results, the model requires further refinement before it can be applied in real-world public health settings.
期刊介绍:
The Journal of Dentistry has an open access mirror journal The Journal of Dentistry: X, sharing the same aims and scope, editorial team, submission system and rigorous peer review.
The Journal of Dentistry is the leading international dental journal within the field of Restorative Dentistry. Placing an emphasis on publishing novel and high-quality research papers, the Journal aims to influence the practice of dentistry at clinician, research, industry and policy-maker level on an international basis.
Topics covered include the management of dental disease, periodontology, endodontology, operative dentistry, fixed and removable prosthodontics, dental biomaterials science, long-term clinical trials including epidemiology and oral health, technology transfer of new scientific instrumentation or procedures, as well as clinically relevant oral biology and translational research.
The Journal of Dentistry will publish original scientific research papers including short communications. It is also interested in publishing review articles and leaders in themed areas which will be linked to new scientific research. Conference proceedings are also welcome and expressions of interest should be communicated to the Editor.