{"title":"Skin Type Diversity in Skin Lesion Datasets: A Review.","authors":"Neda Alipour, Ted Burke, Jane Courtney","doi":"10.1007/s13671-024-00440-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose of review: </strong>Skin type diversity in image datasets refers to the representation of various skin types. This diversity allows for the verification of comparable performance of a trained model across different skin types. A widespread problem in datasets involving human skin is the lack of verifiable diversity in skin types, making it difficult to evaluate whether the performance of the trained models generalizes across different skin types. For example, the diversity issues in skin lesion datasets, which are used to train deep learning-based models, often result in lower accuracy for darker skin types that are typically under-represented in these datasets. Under-representation in datasets results in lower performance in deep learning models for under-represented skin types.</p><p><strong>Recent findings: </strong>This issue has been discussed in previous works; however, the reporting of skin types, and inherent diversity, have not been fully assessed. Some works report skin types but do not attempt to assess the representation of each skin type in datasets. Others, focusing on skin lesions, identify the issue but do not measure skin type diversity in the datasets examined.</p><p><strong>Summary: </strong>Effort is needed to address these shortcomings and move towards facilitating verifiable diversity. Building on previous works in skin lesion datasets, this review explores the general issue of skin type diversity by investigating and evaluating skin lesion datasets specifically. The main contributions of this work are an evaluation of publicly available skin lesion datasets and their metadata to assess the frequency and completeness of reporting of skin type and an investigation into the diversity and representation of each skin type within these datasets.</p><p><strong>Supplementary information: </strong>The online version contains material available at 10.1007/s13671-024-00440-0.</p>","PeriodicalId":10838,"journal":{"name":"Current Dermatology Reports","volume":"13 3","pages":"198-210"},"PeriodicalIF":2.4000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11343783/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Dermatology Reports","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s13671-024-00440-0","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/14 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"DERMATOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose of review: Skin type diversity in image datasets refers to the representation of various skin types. This diversity allows for the verification of comparable performance of a trained model across different skin types. A widespread problem in datasets involving human skin is the lack of verifiable diversity in skin types, making it difficult to evaluate whether the performance of the trained models generalizes across different skin types. For example, the diversity issues in skin lesion datasets, which are used to train deep learning-based models, often result in lower accuracy for darker skin types that are typically under-represented in these datasets. Under-representation in datasets results in lower performance in deep learning models for under-represented skin types.
Recent findings: This issue has been discussed in previous works; however, the reporting of skin types, and inherent diversity, have not been fully assessed. Some works report skin types but do not attempt to assess the representation of each skin type in datasets. Others, focusing on skin lesions, identify the issue but do not measure skin type diversity in the datasets examined.
Summary: Effort is needed to address these shortcomings and move towards facilitating verifiable diversity. Building on previous works in skin lesion datasets, this review explores the general issue of skin type diversity by investigating and evaluating skin lesion datasets specifically. The main contributions of this work are an evaluation of publicly available skin lesion datasets and their metadata to assess the frequency and completeness of reporting of skin type and an investigation into the diversity and representation of each skin type within these datasets.
Supplementary information: The online version contains material available at 10.1007/s13671-024-00440-0.
期刊介绍:
This journal intends to review the most significant recent developments in the field of dermatology. By providing clear, insightful, balanced contributions by expert international authors, the journal aims to serve all those involved in the diagnosis, treatment, management, and prevention of dermatologic conditions. We accomplish this aim by appointing international authorities to serve as Section Editors in key subject areas across the field, such as epidemiology, surgery, pharmacology, clinical trial design, and pediatrics. Section Editors select topics for which leading experts contribute comprehensive review articles that emphasize new developments and recently published papers of major importance, highlighted by annotated reference lists. We also provide commentaries from well-known figures in the field, and an Editorial Board of more than 20 internationally diverse members suggests topics of special interest to their country/region and ensures that topics are current and include emerging research.