Skin Type Diversity in Skin Lesion Datasets: A Review.

IF 1.4 Q2 DERMATOLOGY

Current Dermatology Reports Pub Date : 2024-01-01 Epub Date: 2024-08-14 DOI:10.1007/s13671-024-00440-0

Neda Alipour, Ted Burke, Jane Courtney

{"title":"Skin Type Diversity in Skin Lesion Datasets: A Review.","authors":"Neda Alipour, Ted Burke, Jane Courtney","doi":"10.1007/s13671-024-00440-0","DOIUrl":null,"url":null,"abstract":"Purpose of review: Skin type diversity in image datasets refers to the representation of various skin types. This diversity allows for the verification of comparable performance of a trained model across different skin types. A widespread problem in datasets involving human skin is the lack of verifiable diversity in skin types, making it difficult to evaluate whether the performance of the trained models generalizes across different skin types. For example, the diversity issues in skin lesion datasets, which are used to train deep learning-based models, often result in lower accuracy for darker skin types that are typically under-represented in these datasets. Under-representation in datasets results in lower performance in deep learning models for under-represented skin types.Recent findings: This issue has been discussed in previous works; however, the reporting of skin types, and inherent diversity, have not been fully assessed. Some works report skin types but do not attempt to assess the representation of each skin type in datasets. Others, focusing on skin lesions, identify the issue but do not measure skin type diversity in the datasets examined.Summary: Effort is needed to address these shortcomings and move towards facilitating verifiable diversity. Building on previous works in skin lesion datasets, this review explores the general issue of skin type diversity by investigating and evaluating skin lesion datasets specifically. The main contributions of this work are an evaluation of publicly available skin lesion datasets and their metadata to assess the frequency and completeness of reporting of skin type and an investigation into the diversity and representation of each skin type within these datasets.Supplementary information: The online version contains material available at 10.1007/s13671-024-00440-0.","PeriodicalId":10838,"journal":{"name":"Current Dermatology Reports","volume":"13 3","pages":"198-210"},"PeriodicalIF":1.4000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11343783/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Dermatology Reports","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s13671-024-00440-0","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/14 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"DERMATOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose of review: Skin type diversity in image datasets refers to the representation of various skin types. This diversity allows for the verification of comparable performance of a trained model across different skin types. A widespread problem in datasets involving human skin is the lack of verifiable diversity in skin types, making it difficult to evaluate whether the performance of the trained models generalizes across different skin types. For example, the diversity issues in skin lesion datasets, which are used to train deep learning-based models, often result in lower accuracy for darker skin types that are typically under-represented in these datasets. Under-representation in datasets results in lower performance in deep learning models for under-represented skin types.

Recent findings: This issue has been discussed in previous works; however, the reporting of skin types, and inherent diversity, have not been fully assessed. Some works report skin types but do not attempt to assess the representation of each skin type in datasets. Others, focusing on skin lesions, identify the issue but do not measure skin type diversity in the datasets examined.

Summary: Effort is needed to address these shortcomings and move towards facilitating verifiable diversity. Building on previous works in skin lesion datasets, this review explores the general issue of skin type diversity by investigating and evaluating skin lesion datasets specifically. The main contributions of this work are an evaluation of publicly available skin lesion datasets and their metadata to assess the frequency and completeness of reporting of skin type and an investigation into the diversity and representation of each skin type within these datasets.

Supplementary information: The online version contains material available at 10.1007/s13671-024-00440-0.

Abstract Image

查看原文本刊更多论文

皮肤病变数据集中的皮肤类型多样性：综述。

审查目的：图像数据集中的皮肤类型多样性是指各种皮肤类型的代表性。这种多样性可以验证训练有素的模型在不同皮肤类型中的可比性能。在涉及人类皮肤的数据集中，一个普遍存在的问题是缺乏可验证的皮肤类型多样性，因此很难评估训练模型的性能是否能在不同皮肤类型中通用。例如，用于训练基于深度学习的模型的皮肤病变数据集的多样性问题往往会导致深色皮肤类型的准确率降低，而深色皮肤类型在这些数据集中通常代表性不足。数据集代表性不足导致深度学习模型对代表性不足的皮肤类型的性能较低：这一问题在以前的作品中已经讨论过；但是，皮肤类型的报告和固有的多样性还没有得到充分的评估。一些作品报告了皮肤类型，但并未尝试评估数据集中每种皮肤类型的代表性。小结：我们需要努力解决这些不足，促进可验证的多样性。在以往皮损数据集工作的基础上，本综述通过专门调查和评估皮损数据集，探讨了肤质类型多样性的一般问题。这项工作的主要贡献是对公开的皮损数据集及其元数据进行评估，以评估皮肤类型报告的频率和完整性，并调查这些数据集中每种皮肤类型的多样性和代表性：在线版本包含 10.1007/s13671-024-00440-0。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Current Dermatology Reports DERMATOLOGY-

CiteScore

3.30

自引率

6.20%

发文量

期刊介绍： This journal intends to review the most significant recent developments in the field of dermatology. By providing clear, insightful, balanced contributions by expert international authors, the journal aims to serve all those involved in the diagnosis, treatment, management, and prevention of dermatologic conditions. We accomplish this aim by appointing international authorities to serve as Section Editors in key subject areas across the field, such as epidemiology, surgery, pharmacology, clinical trial design, and pediatrics. Section Editors select topics for which leading experts contribute comprehensive review articles that emphasize new developments and recently published papers of major importance, highlighted by annotated reference lists. We also provide commentaries from well-known figures in the field, and an Editorial Board of more than 20 internationally diverse members suggests topics of special interest to their country/region and ensures that topics are current and include emerging research.