{"title":"早期胃癌和癌前病变的完整注释病理切片数据集。","authors":"Chunbao Wang, Jiusong Ge, Yi Niu, Caixia Ding, Yangyang Fan, Hongyun Chang, Zhe Yang, Caihong Ran, Xiali Teng, Xiaolin Wang, Lianlian Wu, Zeyu Gao, Chen Li","doi":"10.1038/s41597-025-05679-1","DOIUrl":null,"url":null,"abstract":"<p><p>Gastric cancer, a significant global health concern, exhibits high morbidity and mortality, especially in advanced stages. Timely diagnosis and intervention are crucial for improving patient outcomes, with Endoscopic Submucosal Dissection (ESD) playing a pivotal role in precise, minimally invasive early-stage treatments. Despite its importance, challenges include significant interobserver variability among pathologists and the intensive labor required for detailed pathological analysis of ESD specimens impede optimal outcomes. Artificial Intelligence (AI) offers promising solutions to these challenges, yet its advancement is limited by the scarcity of comprehensive, annotated pathological datasets. In this paper, we curate a fully annotated pathology slide dataset for ESD specimen examination. This dataset not only poses a challenging task for the computational pathology field but also enables precise detection of precancerous stages and accurate quantification of lesion distribution in patients with early-stage gastric lesions. Furthermore, it enhances the correlation between endoscopic findings and pathological interpretations, thereby advancing precision medicine strategies in the prevention and treatment of early gastric cancer.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1326"},"PeriodicalIF":6.9000,"publicationDate":"2025-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12311038/pdf/","citationCount":"0","resultStr":"{\"title\":\"A fully annotated pathology slide dataset for early gastric cancer and precancerous lesions.\",\"authors\":\"Chunbao Wang, Jiusong Ge, Yi Niu, Caixia Ding, Yangyang Fan, Hongyun Chang, Zhe Yang, Caihong Ran, Xiali Teng, Xiaolin Wang, Lianlian Wu, Zeyu Gao, Chen Li\",\"doi\":\"10.1038/s41597-025-05679-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Gastric cancer, a significant global health concern, exhibits high morbidity and mortality, especially in advanced stages. Timely diagnosis and intervention are crucial for improving patient outcomes, with Endoscopic Submucosal Dissection (ESD) playing a pivotal role in precise, minimally invasive early-stage treatments. Despite its importance, challenges include significant interobserver variability among pathologists and the intensive labor required for detailed pathological analysis of ESD specimens impede optimal outcomes. Artificial Intelligence (AI) offers promising solutions to these challenges, yet its advancement is limited by the scarcity of comprehensive, annotated pathological datasets. In this paper, we curate a fully annotated pathology slide dataset for ESD specimen examination. This dataset not only poses a challenging task for the computational pathology field but also enables precise detection of precancerous stages and accurate quantification of lesion distribution in patients with early-stage gastric lesions. Furthermore, it enhances the correlation between endoscopic findings and pathological interpretations, thereby advancing precision medicine strategies in the prevention and treatment of early gastric cancer.</p>\",\"PeriodicalId\":21597,\"journal\":{\"name\":\"Scientific Data\",\"volume\":\"12 1\",\"pages\":\"1326\"},\"PeriodicalIF\":6.9000,\"publicationDate\":\"2025-07-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12311038/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientific Data\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1038/s41597-025-05679-1\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Data","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41597-025-05679-1","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
A fully annotated pathology slide dataset for early gastric cancer and precancerous lesions.
Gastric cancer, a significant global health concern, exhibits high morbidity and mortality, especially in advanced stages. Timely diagnosis and intervention are crucial for improving patient outcomes, with Endoscopic Submucosal Dissection (ESD) playing a pivotal role in precise, minimally invasive early-stage treatments. Despite its importance, challenges include significant interobserver variability among pathologists and the intensive labor required for detailed pathological analysis of ESD specimens impede optimal outcomes. Artificial Intelligence (AI) offers promising solutions to these challenges, yet its advancement is limited by the scarcity of comprehensive, annotated pathological datasets. In this paper, we curate a fully annotated pathology slide dataset for ESD specimen examination. This dataset not only poses a challenging task for the computational pathology field but also enables precise detection of precancerous stages and accurate quantification of lesion distribution in patients with early-stage gastric lesions. Furthermore, it enhances the correlation between endoscopic findings and pathological interpretations, thereby advancing precision medicine strategies in the prevention and treatment of early gastric cancer.
期刊介绍:
Scientific Data is an open-access journal focused on data, publishing descriptions of research datasets and articles on data sharing across natural sciences, medicine, engineering, and social sciences. Its goal is to enhance the sharing and reuse of scientific data, encourage broader data sharing, and acknowledge those who share their data.
The journal primarily publishes Data Descriptors, which offer detailed descriptions of research datasets, including data collection methods and technical analyses validating data quality. These descriptors aim to facilitate data reuse rather than testing hypotheses or presenting new interpretations, methods, or in-depth analyses.