Y-H Lee,J H Lee,Q-S Auh,S Lee,D Nixdorf,A Chaurasia
{"title":"TMD Diagnosis Using a Masked Self-Supervised Tabular Transformer.","authors":"Y-H Lee,J H Lee,Q-S Auh,S Lee,D Nixdorf,A Chaurasia","doi":"10.1177/00220345251376974","DOIUrl":null,"url":null,"abstract":"Temporomandibular disorders (TMDs) encompass a heterogeneous group of musculoskeletal conditions involving the temporomandibular joint (TMJ), masticatory muscles, and associated structures. Diagnosis remains challenging due to overlapping symptoms, multifactorial etiology, and variability across clinical settings. To address these limitations, we developed the Gated Attention Tabular Transformer (GATT), a novel deep-learning model that uses masked self-supervised learning and gated attention mechanisms, to classify TMD subgroups based on the diagnostic criteria for TMD (DC/TMD). A total of 4,644 structured clinical records from a university-based registry were analyzed, comprising 3,524 female and 1,120 male patients (mean age 36.9 ± 14.7 y), across 12 core TMD subgroups. GATT achieved robust diagnostic performance with area under the receiver-operating characteristic curve values ranging from 0.815 to 1.000, sensitivity from 0.652 to 1.000, and specificity from 0.773 to 1.000. The model significantly outperformed conventional machine-learning methods including logistic regression, random forest, support vector machine, and XGBoost as well as advanced tabular deep-learning models such as TabNet, TabTransformer, AutoGluon Tabular Predictor, and FT-Transformer. Shapley additive explanations (SHAP) analysis revealed \"pain-free opening\" (SHAP = 6.78, P < 0.001) and \"current TMJ noise\" (SHAP = 2.87, P = 0.003) as key features of mechanical TMJ disorders. Co-occurrence network analysis uncovered side-specific clustering and potential time-lagged progression between bilateral TMJs. These findings demonstrate the feasibility of using deep learning to classify heterogeneous TMD subgroups using only structured clinical data, without the need for imaging. The GATT model offers an accurate, explainable, and scalable tool to support clinician-assisted diagnosis and reduce variability in TMD management in real-world practice. These results support the integration of AI-driven tools such as GATT into clinical workflows for standardized, efficient, and patient-specific TMD diagnosis.","PeriodicalId":15596,"journal":{"name":"Journal of Dental Research","volume":"3 1","pages":"220345251376974"},"PeriodicalIF":5.9000,"publicationDate":"2025-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Dental Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/00220345251376974","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}
引用次数: 0
Abstract
Temporomandibular disorders (TMDs) encompass a heterogeneous group of musculoskeletal conditions involving the temporomandibular joint (TMJ), masticatory muscles, and associated structures. Diagnosis remains challenging due to overlapping symptoms, multifactorial etiology, and variability across clinical settings. To address these limitations, we developed the Gated Attention Tabular Transformer (GATT), a novel deep-learning model that uses masked self-supervised learning and gated attention mechanisms, to classify TMD subgroups based on the diagnostic criteria for TMD (DC/TMD). A total of 4,644 structured clinical records from a university-based registry were analyzed, comprising 3,524 female and 1,120 male patients (mean age 36.9 ± 14.7 y), across 12 core TMD subgroups. GATT achieved robust diagnostic performance with area under the receiver-operating characteristic curve values ranging from 0.815 to 1.000, sensitivity from 0.652 to 1.000, and specificity from 0.773 to 1.000. The model significantly outperformed conventional machine-learning methods including logistic regression, random forest, support vector machine, and XGBoost as well as advanced tabular deep-learning models such as TabNet, TabTransformer, AutoGluon Tabular Predictor, and FT-Transformer. Shapley additive explanations (SHAP) analysis revealed "pain-free opening" (SHAP = 6.78, P < 0.001) and "current TMJ noise" (SHAP = 2.87, P = 0.003) as key features of mechanical TMJ disorders. Co-occurrence network analysis uncovered side-specific clustering and potential time-lagged progression between bilateral TMJs. These findings demonstrate the feasibility of using deep learning to classify heterogeneous TMD subgroups using only structured clinical data, without the need for imaging. The GATT model offers an accurate, explainable, and scalable tool to support clinician-assisted diagnosis and reduce variability in TMD management in real-world practice. These results support the integration of AI-driven tools such as GATT into clinical workflows for standardized, efficient, and patient-specific TMD diagnosis.
期刊介绍:
The Journal of Dental Research (JDR) is a peer-reviewed scientific journal committed to sharing new knowledge and information on all sciences related to dentistry and the oral cavity, covering health and disease. With monthly publications, JDR ensures timely communication of the latest research to the oral and dental community.