Arthur Derathé, Fabian Reche, Sylvain Guy, Katia Charrière, Bertrand Trilling, Pierre Jannin, Alexandre Moreau-Gaudry, Bernard Gibaud, Sandrine Voros
{"title":"LapEx:一个新的多模态数据集,用于腹腔镜手术的上下文识别和实践评估。","authors":"Arthur Derathé, Fabian Reche, Sylvain Guy, Katia Charrière, Bertrand Trilling, Pierre Jannin, Alexandre Moreau-Gaudry, Bernard Gibaud, Sandrine Voros","doi":"10.1038/s41597-025-04588-7","DOIUrl":null,"url":null,"abstract":"<p><p>In Surgical Data Science (SDS), there is an increasing demand for large, realistic annotated datasets to facilitate the development of machine learning techniques. However, in laparoscopic surgery, most publicly available datasets focus on low-granularity procedural annotations (such as phases or steps) and image segmentation of instruments or specific organs, often using animal models that lack clinical realism. Furthermore, annotation variability is seldom evaluated. In this work, we compiled 30 sleeve gastrectomy procedures and performed three levels of annotations for a specific step of this procedure (the fundus dissection): a procedural annotation of fine-grained activities, a semantic segmentation of the laparoscopic images, and the assessment of a surgical skill, specifically the quality of exposition of the surgical scene. We also conducted a comprehensive annotation variability analysis, highlighting the complexity of these tasks and providing a baseline for evaluating machine learning models. The dataset is publicly available and serves as a valuable resource for advancing SDS research.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"342"},"PeriodicalIF":6.9000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11865446/pdf/","citationCount":"0","resultStr":"{\"title\":\"LapEx: A new multimodal dataset for context recognition and practice assessment in laparoscopic surgery.\",\"authors\":\"Arthur Derathé, Fabian Reche, Sylvain Guy, Katia Charrière, Bertrand Trilling, Pierre Jannin, Alexandre Moreau-Gaudry, Bernard Gibaud, Sandrine Voros\",\"doi\":\"10.1038/s41597-025-04588-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>In Surgical Data Science (SDS), there is an increasing demand for large, realistic annotated datasets to facilitate the development of machine learning techniques. However, in laparoscopic surgery, most publicly available datasets focus on low-granularity procedural annotations (such as phases or steps) and image segmentation of instruments or specific organs, often using animal models that lack clinical realism. Furthermore, annotation variability is seldom evaluated. In this work, we compiled 30 sleeve gastrectomy procedures and performed three levels of annotations for a specific step of this procedure (the fundus dissection): a procedural annotation of fine-grained activities, a semantic segmentation of the laparoscopic images, and the assessment of a surgical skill, specifically the quality of exposition of the surgical scene. We also conducted a comprehensive annotation variability analysis, highlighting the complexity of these tasks and providing a baseline for evaluating machine learning models. The dataset is publicly available and serves as a valuable resource for advancing SDS research.</p>\",\"PeriodicalId\":21597,\"journal\":{\"name\":\"Scientific Data\",\"volume\":\"12 1\",\"pages\":\"342\"},\"PeriodicalIF\":6.9000,\"publicationDate\":\"2025-02-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11865446/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientific Data\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1038/s41597-025-04588-7\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Data","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41597-025-04588-7","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
LapEx: A new multimodal dataset for context recognition and practice assessment in laparoscopic surgery.
In Surgical Data Science (SDS), there is an increasing demand for large, realistic annotated datasets to facilitate the development of machine learning techniques. However, in laparoscopic surgery, most publicly available datasets focus on low-granularity procedural annotations (such as phases or steps) and image segmentation of instruments or specific organs, often using animal models that lack clinical realism. Furthermore, annotation variability is seldom evaluated. In this work, we compiled 30 sleeve gastrectomy procedures and performed three levels of annotations for a specific step of this procedure (the fundus dissection): a procedural annotation of fine-grained activities, a semantic segmentation of the laparoscopic images, and the assessment of a surgical skill, specifically the quality of exposition of the surgical scene. We also conducted a comprehensive annotation variability analysis, highlighting the complexity of these tasks and providing a baseline for evaluating machine learning models. The dataset is publicly available and serves as a valuable resource for advancing SDS research.
期刊介绍:
Scientific Data is an open-access journal focused on data, publishing descriptions of research datasets and articles on data sharing across natural sciences, medicine, engineering, and social sciences. Its goal is to enhance the sharing and reuse of scientific data, encourage broader data sharing, and acknowledge those who share their data.
The journal primarily publishes Data Descriptors, which offer detailed descriptions of research datasets, including data collection methods and technical analyses validating data quality. These descriptors aim to facilitate data reuse rather than testing hypotheses or presenting new interpretations, methods, or in-depth analyses.