Nikolaos Angelakopoulos, Rizky Merdietio Boedi, Ademir Franco, Nikita Polukhin, Akiko Kumagai, Ivan Galic, Jeta Kelmendi, Israel Soriano Vázquez, Sang-Seob Lee, Galina Zolotenkova, Roberto Scendoni, Stefano De Luca
{"title":"评估牙齿年龄估计的主观性:在四种建立良好的第三磨牙评估方法中,内部和内部观察者的可靠性。","authors":"Nikolaos Angelakopoulos, Rizky Merdietio Boedi, Ademir Franco, Nikita Polukhin, Akiko Kumagai, Ivan Galic, Jeta Kelmendi, Israel Soriano Vázquez, Sang-Seob Lee, Galina Zolotenkova, Roberto Scendoni, Stefano De Luca","doi":"10.1007/s00414-025-03616-w","DOIUrl":null,"url":null,"abstract":"<p><p>In forensic contexts, age assessments constitute matters of substantial legalconsequence, particularly in proceedings involving children and young adolescents. Dental age estimation (DAE) techniques are widely used for this purpose, especially in cases involving undocumented minors. This study assesses intra- and inter-observer reliability across four well-established DAE methods: Gleiser and Hunt Modified by Köhler (GHK), Demirjian (DEM), Kullman (KUL), and Cameriere's Third Molar Maturity Index (I3M). A total of 50 panoramic radiographs from individuals aged 14-23.99 years were analyzed by nine qualified forensic experts. The observers assessed the development stages of third molars using the three staging methods (GHK, DEM, KUL) and measured the I3M using Cameriere's metric approach. Primarily, the quantitative assessment for analyzing the agreement was Cohen's Kappa, Gwet's Agreement Coefficient (AC1) and (AC2), and Intraclass Correlation Coefficient (ICC). Statistical analysis revealed high intra-observer reliability for all methods, with coefficient values indicating strong agreement among individual observers. In terms of inter-observer reliability, the I3M achieved the highest agreement (ICC 0.986), followed by DEM (AC2 0.918), GHK (AC2 0.914), and KUL (AC2 0.868). Notably, maxillary third molars consistently showed lower inter-observer agreement than mandibular third molars, particularly when assessed using the DEM and GHK methods. The highest inter-observer agreement in cases where a tooth could not be staged or measured was observed for the KUL method (AC1 0.993), followed by I3M (AC1 0.988), with DEM and GHK, demonstrating equivalent levels of agreement (AC1 0.954). All of the tested methods yielded highly reliable results, especially DEM and GHK. The choice of a staging method should be guided by the specific objectives of each study. Moreover, while the I3M method demonstrated high reliability values, obtaining identical repeated measurements was nearly impossible due to its metric approach..</p>","PeriodicalId":14071,"journal":{"name":"International Journal of Legal Medicine","volume":" ","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2025-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Assessing subjectivity in dental age estimation: intra- and inter-observer reliability across four well established third molar evaluation methods.\",\"authors\":\"Nikolaos Angelakopoulos, Rizky Merdietio Boedi, Ademir Franco, Nikita Polukhin, Akiko Kumagai, Ivan Galic, Jeta Kelmendi, Israel Soriano Vázquez, Sang-Seob Lee, Galina Zolotenkova, Roberto Scendoni, Stefano De Luca\",\"doi\":\"10.1007/s00414-025-03616-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>In forensic contexts, age assessments constitute matters of substantial legalconsequence, particularly in proceedings involving children and young adolescents. Dental age estimation (DAE) techniques are widely used for this purpose, especially in cases involving undocumented minors. This study assesses intra- and inter-observer reliability across four well-established DAE methods: Gleiser and Hunt Modified by Köhler (GHK), Demirjian (DEM), Kullman (KUL), and Cameriere's Third Molar Maturity Index (I3M). A total of 50 panoramic radiographs from individuals aged 14-23.99 years were analyzed by nine qualified forensic experts. The observers assessed the development stages of third molars using the three staging methods (GHK, DEM, KUL) and measured the I3M using Cameriere's metric approach. Primarily, the quantitative assessment for analyzing the agreement was Cohen's Kappa, Gwet's Agreement Coefficient (AC1) and (AC2), and Intraclass Correlation Coefficient (ICC). Statistical analysis revealed high intra-observer reliability for all methods, with coefficient values indicating strong agreement among individual observers. In terms of inter-observer reliability, the I3M achieved the highest agreement (ICC 0.986), followed by DEM (AC2 0.918), GHK (AC2 0.914), and KUL (AC2 0.868). Notably, maxillary third molars consistently showed lower inter-observer agreement than mandibular third molars, particularly when assessed using the DEM and GHK methods. The highest inter-observer agreement in cases where a tooth could not be staged or measured was observed for the KUL method (AC1 0.993), followed by I3M (AC1 0.988), with DEM and GHK, demonstrating equivalent levels of agreement (AC1 0.954). All of the tested methods yielded highly reliable results, especially DEM and GHK. The choice of a staging method should be guided by the specific objectives of each study. Moreover, while the I3M method demonstrated high reliability values, obtaining identical repeated measurements was nearly impossible due to its metric approach..</p>\",\"PeriodicalId\":14071,\"journal\":{\"name\":\"International Journal of Legal Medicine\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-10-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Legal Medicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s00414-025-03616-w\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICINE, LEGAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Legal Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00414-025-03616-w","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, LEGAL","Score":null,"Total":0}
Assessing subjectivity in dental age estimation: intra- and inter-observer reliability across four well established third molar evaluation methods.
In forensic contexts, age assessments constitute matters of substantial legalconsequence, particularly in proceedings involving children and young adolescents. Dental age estimation (DAE) techniques are widely used for this purpose, especially in cases involving undocumented minors. This study assesses intra- and inter-observer reliability across four well-established DAE methods: Gleiser and Hunt Modified by Köhler (GHK), Demirjian (DEM), Kullman (KUL), and Cameriere's Third Molar Maturity Index (I3M). A total of 50 panoramic radiographs from individuals aged 14-23.99 years were analyzed by nine qualified forensic experts. The observers assessed the development stages of third molars using the three staging methods (GHK, DEM, KUL) and measured the I3M using Cameriere's metric approach. Primarily, the quantitative assessment for analyzing the agreement was Cohen's Kappa, Gwet's Agreement Coefficient (AC1) and (AC2), and Intraclass Correlation Coefficient (ICC). Statistical analysis revealed high intra-observer reliability for all methods, with coefficient values indicating strong agreement among individual observers. In terms of inter-observer reliability, the I3M achieved the highest agreement (ICC 0.986), followed by DEM (AC2 0.918), GHK (AC2 0.914), and KUL (AC2 0.868). Notably, maxillary third molars consistently showed lower inter-observer agreement than mandibular third molars, particularly when assessed using the DEM and GHK methods. The highest inter-observer agreement in cases where a tooth could not be staged or measured was observed for the KUL method (AC1 0.993), followed by I3M (AC1 0.988), with DEM and GHK, demonstrating equivalent levels of agreement (AC1 0.954). All of the tested methods yielded highly reliable results, especially DEM and GHK. The choice of a staging method should be guided by the specific objectives of each study. Moreover, while the I3M method demonstrated high reliability values, obtaining identical repeated measurements was nearly impossible due to its metric approach..
期刊介绍:
The International Journal of Legal Medicine aims to improve the scientific resources used in the elucidation of crime and related forensic applications at a high level of evidential proof. The journal offers review articles tracing development in specific areas, with up-to-date analysis; original articles discussing significant recent research results; case reports describing interesting and exceptional examples; population data; letters to the editors; and technical notes, which appear in a section originally created for rapid publication of data in the dynamic field of DNA analysis.