用于病灶分割的大规模皮肤病理学数据集:模型开发和分析。

IF 2.3 3区 医学 Q1 MEDICINE, GENERAL & INTERNAL
Yosep Chong, Daseul Park, Youngbin Ahn, Yoonjin Kwak, Seyeon Park, Seung Wan Back, Changwoo Lee, Gyeongsin Park, Mohammad Rizwan Alam, Binna Kim, Kee-Taek Jang, Nayoung Han, Chong Woo Yoo, Jonghyuck Lee, Cheol Lee, Young-Gon Kim
{"title":"用于病灶分割的大规模皮肤病理学数据集:模型开发和分析。","authors":"Yosep Chong, Daseul Park, Youngbin Ahn, Yoonjin Kwak, Seyeon Park, Seung Wan Back, Changwoo Lee, Gyeongsin Park, Mohammad Rizwan Alam, Binna Kim, Kee-Taek Jang, Nayoung Han, Chong Woo Yoo, Jonghyuck Lee, Cheol Lee, Young-Gon Kim","doi":"10.3346/jkms.2025.40.e220","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>With the increasing incidence of skin cancer, the workload for pathologists has surged. The diagnosis of skin samples, especially for complex lesions such as malignant melanomas and melanocytic lesions, has shown higher diagnostic variability compared to other organ samples. Consequently, artificial intelligence (AI)-based diagnostic assistance programs are increasingly needed to support dermatopathologists in achieving more consistent diagnoses. However, large-scale skin pathology image datasets for AI learning are often insufficient or limited to specific diseases. This study aimed to build and assess a large-scale dermatopathology image dataset for an AI model.</p><p><strong>Methods: </strong>We trained and evaluated a lesion segmentation model based on this dataset, which consisted of over 34,376 histopathology slide images collected from four institutions, including normal skin and six types of common skin lesion: epidermal cysts, seborrheic keratosis, Bowen disease/squamous cell carcinoma, basal cell carcinoma, melanocytic nevus, and malignant melanoma. Each image was accompanied by labeled data consisting of lesion area annotations and clinical information. To ensure the high quality and accuracy of the dataset, we employed data quality management methods, including syntactic accuracy, semantic accuracy, statistical diversity, and validity evaluation.</p><p><strong>Results: </strong>The results of the dataset quality assessment confirmed high quality, with syntactic accuracy and semantic accuracy at 0.99 and 0.95, respectively. Statistical diversity was verified to follow a natural distribution. The validity evaluation verified the strong performance of the segmentation model for each group of data, with a Dice score ranging from 80% to 91%.</p><p><strong>Conclusion: </strong>The results demonstrated that our constructed dataset provides a well-suited resource for deep learning training, offering a large-scale multi-institutional dermatopathology dataset that can drive advancements in AI-driven dermatopathology diagnosis.</p>","PeriodicalId":16249,"journal":{"name":"Journal of Korean Medical Science","volume":"40 35","pages":"e220"},"PeriodicalIF":2.3000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12418205/pdf/","citationCount":"0","resultStr":"{\"title\":\"Large-Scale Dermatopathology Dataset for Lesion Segmentation: Model Development and Analysis.\",\"authors\":\"Yosep Chong, Daseul Park, Youngbin Ahn, Yoonjin Kwak, Seyeon Park, Seung Wan Back, Changwoo Lee, Gyeongsin Park, Mohammad Rizwan Alam, Binna Kim, Kee-Taek Jang, Nayoung Han, Chong Woo Yoo, Jonghyuck Lee, Cheol Lee, Young-Gon Kim\",\"doi\":\"10.3346/jkms.2025.40.e220\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>With the increasing incidence of skin cancer, the workload for pathologists has surged. The diagnosis of skin samples, especially for complex lesions such as malignant melanomas and melanocytic lesions, has shown higher diagnostic variability compared to other organ samples. Consequently, artificial intelligence (AI)-based diagnostic assistance programs are increasingly needed to support dermatopathologists in achieving more consistent diagnoses. However, large-scale skin pathology image datasets for AI learning are often insufficient or limited to specific diseases. This study aimed to build and assess a large-scale dermatopathology image dataset for an AI model.</p><p><strong>Methods: </strong>We trained and evaluated a lesion segmentation model based on this dataset, which consisted of over 34,376 histopathology slide images collected from four institutions, including normal skin and six types of common skin lesion: epidermal cysts, seborrheic keratosis, Bowen disease/squamous cell carcinoma, basal cell carcinoma, melanocytic nevus, and malignant melanoma. Each image was accompanied by labeled data consisting of lesion area annotations and clinical information. To ensure the high quality and accuracy of the dataset, we employed data quality management methods, including syntactic accuracy, semantic accuracy, statistical diversity, and validity evaluation.</p><p><strong>Results: </strong>The results of the dataset quality assessment confirmed high quality, with syntactic accuracy and semantic accuracy at 0.99 and 0.95, respectively. Statistical diversity was verified to follow a natural distribution. The validity evaluation verified the strong performance of the segmentation model for each group of data, with a Dice score ranging from 80% to 91%.</p><p><strong>Conclusion: </strong>The results demonstrated that our constructed dataset provides a well-suited resource for deep learning training, offering a large-scale multi-institutional dermatopathology dataset that can drive advancements in AI-driven dermatopathology diagnosis.</p>\",\"PeriodicalId\":16249,\"journal\":{\"name\":\"Journal of Korean Medical Science\",\"volume\":\"40 35\",\"pages\":\"e220\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-09-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12418205/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Korean Medical Science\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.3346/jkms.2025.40.e220\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICINE, GENERAL & INTERNAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Korean Medical Science","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3346/jkms.2025.40.e220","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

摘要

背景:随着皮肤癌发病率的增加,病理学家的工作量急剧增加。与其他器官样本相比,皮肤样本的诊断,特别是复杂病变,如恶性黑色素瘤和黑素细胞病变,显示出更高的诊断变异性。因此,越来越需要基于人工智能(AI)的诊断辅助程序来支持皮肤科医生实现更一致的诊断。然而,用于人工智能学习的大规模皮肤病理图像数据集往往不足或仅限于特定疾病。本研究旨在为人工智能模型建立和评估大规模皮肤病理图像数据集。方法:我们训练并评估了基于该数据集的病变分割模型,该数据集由来自四个机构的34,376张组织病理学切片图像组成,包括正常皮肤和六种常见皮肤病变:表皮囊肿、脂溢性角化病、Bowen病/鳞状细胞癌、基底细胞癌、黑素细胞痣和恶性黑色素瘤。每张图像都附有标记数据,包括病变区域注释和临床信息。为了保证数据集的高质量和准确性,我们采用了数据质量管理方法,包括句法准确性、语义准确性、统计多样性和有效性评估。结果:数据集质量评估结果证实了高质量,句法准确度和语义准确度分别为0.99和0.95。统计多样性被证实遵循自然分布。有效性评估验证了分割模型对每组数据的强大性能,Dice得分在80%到91%之间。结论:结果表明,我们构建的数据集为深度学习训练提供了一个非常合适的资源,提供了一个大规模的多机构皮肤病理学数据集,可以推动人工智能驱动的皮肤病理学诊断的进步。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Large-Scale Dermatopathology Dataset for Lesion Segmentation: Model Development and Analysis.

Large-Scale Dermatopathology Dataset for Lesion Segmentation: Model Development and Analysis.

Large-Scale Dermatopathology Dataset for Lesion Segmentation: Model Development and Analysis.

Large-Scale Dermatopathology Dataset for Lesion Segmentation: Model Development and Analysis.

Background: With the increasing incidence of skin cancer, the workload for pathologists has surged. The diagnosis of skin samples, especially for complex lesions such as malignant melanomas and melanocytic lesions, has shown higher diagnostic variability compared to other organ samples. Consequently, artificial intelligence (AI)-based diagnostic assistance programs are increasingly needed to support dermatopathologists in achieving more consistent diagnoses. However, large-scale skin pathology image datasets for AI learning are often insufficient or limited to specific diseases. This study aimed to build and assess a large-scale dermatopathology image dataset for an AI model.

Methods: We trained and evaluated a lesion segmentation model based on this dataset, which consisted of over 34,376 histopathology slide images collected from four institutions, including normal skin and six types of common skin lesion: epidermal cysts, seborrheic keratosis, Bowen disease/squamous cell carcinoma, basal cell carcinoma, melanocytic nevus, and malignant melanoma. Each image was accompanied by labeled data consisting of lesion area annotations and clinical information. To ensure the high quality and accuracy of the dataset, we employed data quality management methods, including syntactic accuracy, semantic accuracy, statistical diversity, and validity evaluation.

Results: The results of the dataset quality assessment confirmed high quality, with syntactic accuracy and semantic accuracy at 0.99 and 0.95, respectively. Statistical diversity was verified to follow a natural distribution. The validity evaluation verified the strong performance of the segmentation model for each group of data, with a Dice score ranging from 80% to 91%.

Conclusion: The results demonstrated that our constructed dataset provides a well-suited resource for deep learning training, offering a large-scale multi-institutional dermatopathology dataset that can drive advancements in AI-driven dermatopathology diagnosis.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Korean Medical Science
Journal of Korean Medical Science 医学-医学:内科
CiteScore
7.80
自引率
8.90%
发文量
320
审稿时长
3-6 weeks
期刊介绍: The Journal of Korean Medical Science (JKMS) is an international, peer-reviewed Open Access journal of medicine published weekly in English. The Journal’s publisher is the Korean Academy of Medical Sciences (KAMS), Korean Medical Association (KMA). JKMS aims to publish evidence-based, scientific research articles from various disciplines of the medical sciences. The Journal welcomes articles of general interest to medical researchers especially when they contain original information. Articles on the clinical evaluation of drugs and other therapies, epidemiologic studies of the general population, studies on pathogenic organisms and toxic materials, and the toxicities and adverse effects of therapeutics are welcome.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信