Fiona R Kolbinger, Max Kirchner, Kevin Pfeiffer, Sebastian Bodenstedt, Alexander C Jenke, Julia Barthel, Matthias Carstens, Karolin Dehlke, Sophia Dietz, Sotirios Emmanouilidis, Guido Fitze, Martin Freitag, Fabian Holderried, Thorsten Jacobi, Weam Kanjo, Linda Leitermann, Sören Torge Mees, Steffen Pistorius, Conrad Prudlo, Astrid Seiberth, Jurek Schultz, Karolin Thiel, Daniel Ziehn, Stefanie Speidel, Jürgen Weitz, Jakob Nikolas Kather, Marius Distler, Oliver Lester Saldanha
{"title":"一个用于计算建模任务的多机构腹腔镜阑尾切除术视频数据集。","authors":"Fiona R Kolbinger, Max Kirchner, Kevin Pfeiffer, Sebastian Bodenstedt, Alexander C Jenke, Julia Barthel, Matthias Carstens, Karolin Dehlke, Sophia Dietz, Sotirios Emmanouilidis, Guido Fitze, Martin Freitag, Fabian Holderried, Thorsten Jacobi, Weam Kanjo, Linda Leitermann, Sören Torge Mees, Steffen Pistorius, Conrad Prudlo, Astrid Seiberth, Jurek Schultz, Karolin Thiel, Daniel Ziehn, Stefanie Speidel, Jürgen Weitz, Jakob Nikolas Kather, Marius Distler, Oliver Lester Saldanha","doi":"10.1101/2025.09.05.25335174","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The limited availability of diverse and representative training data poses a critical barrier to the development of clinically relevant computational tools for intraoperative surgical decision support. Surgical procedures are not routinely recorded, and annotation requires domain expertise, resulting in a scarcity of open-access surgical video datasets with high-quality annotations. Existing datasets are typically limited to single institutions and specific procedures, such as cholecystectomy, and rarely comprise patient-level metadata like demographic characteristics, disease history, or laboratory parameters.</p><p><strong>Methods: </strong>The Appendix300 dataset comprises 330 laparoscopic surgery recordings, including 325 full-length laparoscopic appendectomies and 5 control recordings from non-appendectomy procedures in pediatric and adult patients treated at five German centers. The dataset includes patient-level clinical metadata (demographics, medical history, clinical symptoms, laboratory parameters, and histopathological findings), as well as standardized expert annotations of the laparoscopic grade of appendicitis.</p><p><strong>Results: </strong>Appendix300 currently represents the largest publicly available collection of surgical video data with patient metadata and the first curated dataset of laparoscopic appendectomies. It enables novel validation tasks for computer vision in surgery, including the classification of appendicitis severity and the detection of appendiceal perforation. Technical validation of the laparoscopic appendicitis grade annotations showed substantial interrater agreement (weighted Cohen's κ = 0.615).</p><p><strong>Conclusion: </strong>The Appendix300 dataset expands the scope of surgical data science by integrating video data with clinical and pathological metadata across institutions. It enables new and clinically relevant patient-level validation tasks for computer vision in laparoscopic surgery and facilitates decentralized learning approaches, overall enhancing the breadth and translational relevance of AI-based surgical video analysis.</p><p><strong>Dataset description: </strong>Appendix300 is a multi-institutional dataset comprising 330 laparoscopic surgery recordings, including 325 appendectomies and 5 control cases, detailed patient-level metadata (demographics, medical history, clinical symptoms, laboratory parameters, and histopathological findings), and expert annotations of appendicitis severity. It enables novel validation tasks for surgical AI, such as inflammation grading and perforation detection, and supports decentralized learning across diverse patient populations.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12424895/pdf/","citationCount":"0","resultStr":"{\"title\":\"Appendix300: A multi-institutional laparoscopic appendectomy video dataset for computational modeling tasks.\",\"authors\":\"Fiona R Kolbinger, Max Kirchner, Kevin Pfeiffer, Sebastian Bodenstedt, Alexander C Jenke, Julia Barthel, Matthias Carstens, Karolin Dehlke, Sophia Dietz, Sotirios Emmanouilidis, Guido Fitze, Martin Freitag, Fabian Holderried, Thorsten Jacobi, Weam Kanjo, Linda Leitermann, Sören Torge Mees, Steffen Pistorius, Conrad Prudlo, Astrid Seiberth, Jurek Schultz, Karolin Thiel, Daniel Ziehn, Stefanie Speidel, Jürgen Weitz, Jakob Nikolas Kather, Marius Distler, Oliver Lester Saldanha\",\"doi\":\"10.1101/2025.09.05.25335174\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The limited availability of diverse and representative training data poses a critical barrier to the development of clinically relevant computational tools for intraoperative surgical decision support. Surgical procedures are not routinely recorded, and annotation requires domain expertise, resulting in a scarcity of open-access surgical video datasets with high-quality annotations. Existing datasets are typically limited to single institutions and specific procedures, such as cholecystectomy, and rarely comprise patient-level metadata like demographic characteristics, disease history, or laboratory parameters.</p><p><strong>Methods: </strong>The Appendix300 dataset comprises 330 laparoscopic surgery recordings, including 325 full-length laparoscopic appendectomies and 5 control recordings from non-appendectomy procedures in pediatric and adult patients treated at five German centers. The dataset includes patient-level clinical metadata (demographics, medical history, clinical symptoms, laboratory parameters, and histopathological findings), as well as standardized expert annotations of the laparoscopic grade of appendicitis.</p><p><strong>Results: </strong>Appendix300 currently represents the largest publicly available collection of surgical video data with patient metadata and the first curated dataset of laparoscopic appendectomies. It enables novel validation tasks for computer vision in surgery, including the classification of appendicitis severity and the detection of appendiceal perforation. Technical validation of the laparoscopic appendicitis grade annotations showed substantial interrater agreement (weighted Cohen's κ = 0.615).</p><p><strong>Conclusion: </strong>The Appendix300 dataset expands the scope of surgical data science by integrating video data with clinical and pathological metadata across institutions. It enables new and clinically relevant patient-level validation tasks for computer vision in laparoscopic surgery and facilitates decentralized learning approaches, overall enhancing the breadth and translational relevance of AI-based surgical video analysis.</p><p><strong>Dataset description: </strong>Appendix300 is a multi-institutional dataset comprising 330 laparoscopic surgery recordings, including 325 appendectomies and 5 control cases, detailed patient-level metadata (demographics, medical history, clinical symptoms, laboratory parameters, and histopathological findings), and expert annotations of appendicitis severity. It enables novel validation tasks for surgical AI, such as inflammation grading and perforation detection, and supports decentralized learning across diverse patient populations.</p>\",\"PeriodicalId\":94281,\"journal\":{\"name\":\"medRxiv : the preprint server for health sciences\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-10-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12424895/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"medRxiv : the preprint server for health sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2025.09.05.25335174\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv : the preprint server for health sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2025.09.05.25335174","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Appendix300: A multi-institutional laparoscopic appendectomy video dataset for computational modeling tasks.
Background: The limited availability of diverse and representative training data poses a critical barrier to the development of clinically relevant computational tools for intraoperative surgical decision support. Surgical procedures are not routinely recorded, and annotation requires domain expertise, resulting in a scarcity of open-access surgical video datasets with high-quality annotations. Existing datasets are typically limited to single institutions and specific procedures, such as cholecystectomy, and rarely comprise patient-level metadata like demographic characteristics, disease history, or laboratory parameters.
Methods: The Appendix300 dataset comprises 330 laparoscopic surgery recordings, including 325 full-length laparoscopic appendectomies and 5 control recordings from non-appendectomy procedures in pediatric and adult patients treated at five German centers. The dataset includes patient-level clinical metadata (demographics, medical history, clinical symptoms, laboratory parameters, and histopathological findings), as well as standardized expert annotations of the laparoscopic grade of appendicitis.
Results: Appendix300 currently represents the largest publicly available collection of surgical video data with patient metadata and the first curated dataset of laparoscopic appendectomies. It enables novel validation tasks for computer vision in surgery, including the classification of appendicitis severity and the detection of appendiceal perforation. Technical validation of the laparoscopic appendicitis grade annotations showed substantial interrater agreement (weighted Cohen's κ = 0.615).
Conclusion: The Appendix300 dataset expands the scope of surgical data science by integrating video data with clinical and pathological metadata across institutions. It enables new and clinically relevant patient-level validation tasks for computer vision in laparoscopic surgery and facilitates decentralized learning approaches, overall enhancing the breadth and translational relevance of AI-based surgical video analysis.
Dataset description: Appendix300 is a multi-institutional dataset comprising 330 laparoscopic surgery recordings, including 325 appendectomies and 5 control cases, detailed patient-level metadata (demographics, medical history, clinical symptoms, laboratory parameters, and histopathological findings), and expert annotations of appendicitis severity. It enables novel validation tasks for surgical AI, such as inflammation grading and perforation detection, and supports decentralized learning across diverse patient populations.