Pradeep Kumar Yadalam , Prabhu Manickam Natarajan , Carlos M. Ardila
{"title":"Interpretable Ensemble Learning Predicts Antibiotic Resistance in Treponema denticola Using Expert Classifiers","authors":"Pradeep Kumar Yadalam , Prabhu Manickam Natarajan , Carlos M. Ardila","doi":"10.1016/j.identj.2025.100884","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction and Objectives</h3><div>Antibiotic resistance is a global health concern, contributing to prolonged hospital stays, increased medical costs, and higher mortality rates. Addressing antimicrobial resistance (AMR) in periodontal infections requires targeted therapies and a multifaceted approach. This study aims to predict and classify AMR genomic sequences in <em>Treponema denticola</em>, a key pathogen in periodontal disease, using machine learning (ML).</div></div><div><h3>Methods</h3><div>UniProt FASTA sequences were used to investigate AMR in <em>T. denticola</em>. Data were retrieved and preprocessed using the BioPython library in a Jupyter Notebook. A structured approach included data exploration, feature extraction, and visualization. Four classification models – Random Forest, Support Vector Machine (SVM), Gradient Boosting, and Neural Network (Multilayer Perceptron Classifier) – were optimized using specific hyperparameters. Model performance was evaluated using fivefold stratified cross-validation. A Voting Classifier, combining multiple models, was implemented to enhance predictive accuracy.</div></div><div><h3>Results</h3><div>The Voting Classifier outperformed Random Forest, SVM, Gradient Boosting, and Neural Network models, achieving the highest test accuracy (96.46%) and F1-score (0.9646). High accuracy was also demonstrated by SVM and Neural Networks (95.58%), but the robustness of the Voting Classifier was highlighted by its ability to balance accuracy with low log loss (0.1504).</div></div><div><h3>Conclusion</h3><div>This study highlights the effectiveness of the Voting Classifier in classifying AMR genomic sequences in <em>T. denticola</em>. The findings underscore the potential of interpretable ML approaches for advancing AMR research in periodontal pathogens and informing targeted therapeutic strategies.</div></div><div><h3>Clinical Relevance</h3><div>The ability to accurately predict AMR in <em>T. denticola</em> using ML models like the Voting Classifier can significantly enhance clinical decision-making. By identifying resistance patterns, clinicians can tailor antibiotic therapies more effectively, reducing treatment failures and mitigating the spread of resistance. This approach also supports the development of novel antimicrobial agents and strengthens public health surveillance efforts, particularly in resource-limited settings where periodontal infections are prevalent.</div></div>","PeriodicalId":13785,"journal":{"name":"International dental journal","volume":"75 5","pages":"Article 100884"},"PeriodicalIF":3.2000,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International dental journal","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S002065392500173X","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction and Objectives
Antibiotic resistance is a global health concern, contributing to prolonged hospital stays, increased medical costs, and higher mortality rates. Addressing antimicrobial resistance (AMR) in periodontal infections requires targeted therapies and a multifaceted approach. This study aims to predict and classify AMR genomic sequences in Treponema denticola, a key pathogen in periodontal disease, using machine learning (ML).
Methods
UniProt FASTA sequences were used to investigate AMR in T. denticola. Data were retrieved and preprocessed using the BioPython library in a Jupyter Notebook. A structured approach included data exploration, feature extraction, and visualization. Four classification models – Random Forest, Support Vector Machine (SVM), Gradient Boosting, and Neural Network (Multilayer Perceptron Classifier) – were optimized using specific hyperparameters. Model performance was evaluated using fivefold stratified cross-validation. A Voting Classifier, combining multiple models, was implemented to enhance predictive accuracy.
Results
The Voting Classifier outperformed Random Forest, SVM, Gradient Boosting, and Neural Network models, achieving the highest test accuracy (96.46%) and F1-score (0.9646). High accuracy was also demonstrated by SVM and Neural Networks (95.58%), but the robustness of the Voting Classifier was highlighted by its ability to balance accuracy with low log loss (0.1504).
Conclusion
This study highlights the effectiveness of the Voting Classifier in classifying AMR genomic sequences in T. denticola. The findings underscore the potential of interpretable ML approaches for advancing AMR research in periodontal pathogens and informing targeted therapeutic strategies.
Clinical Relevance
The ability to accurately predict AMR in T. denticola using ML models like the Voting Classifier can significantly enhance clinical decision-making. By identifying resistance patterns, clinicians can tailor antibiotic therapies more effectively, reducing treatment failures and mitigating the spread of resistance. This approach also supports the development of novel antimicrobial agents and strengthens public health surveillance efforts, particularly in resource-limited settings where periodontal infections are prevalent.
期刊介绍:
The International Dental Journal features peer-reviewed, scientific articles relevant to international oral health issues, as well as practical, informative articles aimed at clinicians.