Patterns of observer error in scoring macromorphoscopic traits for population affinity

IF 1.8 4区医学 Q2 MEDICINE, LEGAL

Journal of forensic sciences Pub Date : 2025-05-07 DOI:10.1111/1556-4029.70063

Leandi Liebenberg PhD, Kyra E. Stull PhD, Ericka N. L'Abbé PhD

{"title":"Patterns of observer error in scoring macromorphoscopic traits for population affinity","authors":"Leandi Liebenberg PhD, Kyra E. Stull PhD, Ericka N. L'Abbé PhD","doi":"10.1111/1556-4029.70063","DOIUrl":null,"url":null,"abstract":"<p>Revising methodologies is essential to understand the limitations and biases inherent in certain methods, which is crucial for obtaining reliable results. Due to the subjective nature of non-metric methods, variation in trait scoring and its impact on accurately classifying biological parameters remains a concern that requires further investigation. This study aimed to examine the effects of observer experience, familiarity with the method, and different statistical approaches on the repeatability of macromorphoscopic traits in the cranium for population affinity. Seventeen traits were scored on a sample of 10 crania by five observers with varying experience levels. Intra-observer agreement ranged from moderate to perfect, with three traits—inferior nasal margin, nasal bone shape, and nasal overgrowth demonstrating—the lowest agreement. Overall, inter-observer repeatability ranged from poor to substantial agreement. After a group discussion on the scoring procedure and subsequent rescoring of the crania, a slight improvement in agreement was observed, with kappa values shifting towards moderate and substantial levels. Each observer exhibited variation in the repeatability of different traits. While general experience did not consistently translate into proficiency with the method, familiarity with the specific traits and scoring procedures contributed to more consistent results. Therefore, method-specific training is crucial before applying the MMS traits in practice. Additionally, the choice of statistical approaches—such as applying different weights to Cohen's kappa based on data type—can influence the perceived reliability of a method. Practitioners should select weights and tests that are most appropriate for the data type of each trait being analyzed.</p>","PeriodicalId":15743,"journal":{"name":"Journal of forensic sciences","volume":"70 4","pages":"1489-1500"},"PeriodicalIF":1.8000,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1556-4029.70063","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of forensic sciences","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/1556-4029.70063","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, LEGAL","Score":null,"Total":0}

引用次数: 0

Abstract

Revising methodologies is essential to understand the limitations and biases inherent in certain methods, which is crucial for obtaining reliable results. Due to the subjective nature of non-metric methods, variation in trait scoring and its impact on accurately classifying biological parameters remains a concern that requires further investigation. This study aimed to examine the effects of observer experience, familiarity with the method, and different statistical approaches on the repeatability of macromorphoscopic traits in the cranium for population affinity. Seventeen traits were scored on a sample of 10 crania by five observers with varying experience levels. Intra-observer agreement ranged from moderate to perfect, with three traits—inferior nasal margin, nasal bone shape, and nasal overgrowth demonstrating—the lowest agreement. Overall, inter-observer repeatability ranged from poor to substantial agreement. After a group discussion on the scoring procedure and subsequent rescoring of the crania, a slight improvement in agreement was observed, with kappa values shifting towards moderate and substantial levels. Each observer exhibited variation in the repeatability of different traits. While general experience did not consistently translate into proficiency with the method, familiarity with the specific traits and scoring procedures contributed to more consistent results. Therefore, method-specific training is crucial before applying the MMS traits in practice. Additionally, the choice of statistical approaches—such as applying different weights to Cohen's kappa based on data type—can influence the perceived reliability of a method. Practitioners should select weights and tests that are most appropriate for the data type of each trait being analyzed.

查看原文本刊更多论文

群体亲和性大形态性状评分的观察者误差模式。

修正方法对于理解某些方法固有的局限性和偏差至关重要，这对于获得可靠的结果至关重要。由于非度量方法的主观性，性状评分的变化及其对准确分类生物学参数的影响仍然是一个需要进一步研究的问题。本研究旨在探讨观察经验、对方法的熟悉程度和不同的统计方法对群体亲和性的头盖骨大形态特征可重复性的影响。由五位经验水平不同的观察者对10个颅骨样本的17个特征进行评分。观察者之间的一致性从中等到完全不等，其中三个特征——下鼻缘、鼻骨形状和鼻过度生长——一致性最低。总的来说，观察者之间的可重复性从较差到相当一致。在对评分程序和随后的颅骨评分进行小组讨论后，观察到一致性略有改善，kappa值向中等和实质性水平转移。每个观察者在不同特征的可重复性上都表现出差异。虽然一般经验并不能始终转化为对该方法的熟练程度，但对特定特征和评分程序的熟悉有助于获得更一致的结果。因此，在实践中应用MMS特征之前，针对方法的培训是至关重要的。此外，统计方法的选择——例如根据数据类型对Cohen’s kappa应用不同的权重——会影响方法的感知可靠性。从业者应该选择最适合所分析的每个特征的数据类型的权重和测试。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of forensic sciences 医学-医学：法

CiteScore

4.00

自引率

12.50%

发文量

215

审稿时长

2 months

期刊介绍： The Journal of Forensic Sciences (JFS) is the official publication of the American Academy of Forensic Sciences (AAFS). It is devoted to the publication of original investigations, observations, scholarly inquiries and reviews in various branches of the forensic sciences. These include anthropology, criminalistics, digital and multimedia sciences, engineering and applied sciences, pathology/biology, psychiatry and behavioral science, jurisprudence, odontology, questioned documents, and toxicology. Similar submissions dealing with forensic aspects of other sciences and the social sciences are also accepted, as are submissions dealing with scientifically sound emerging science disciplines. The content and/or views expressed in the JFS are not necessarily those of the AAFS, the JFS Editorial Board, the organizations with which authors are affiliated, or the publisher of JFS. All manuscript submissions are double-blind peer-reviewed.