{"title":"基于深度网络的腹腔镜视频手术工具检测","authors":"Praveen SR Konduri , G Siva Nageswara Rao","doi":"10.1016/j.knosys.2025.113517","DOIUrl":null,"url":null,"abstract":"<div><div>Recently, deep learning has revolutionized significant advances in image classification, especially in Medical image (MI) processing. Surgical Data Science (SDS) has been developed as a scientific research field that aims to improve the health status of patients. Laparoscopic videos possess a highly significant information source that is integrally present in minimally invasive surgeries. Recognizing surgical tools based on the videos has promoted greater interest because of their significance. In most existing research, single-tool detection is carried out, but multiple-tool recognition is not concentrated well. However, multiple-tool recognition poses numerous challenges, including diverse lighting conditions, the appearance of multiple instruments in different representations, tissue blood, etc. Also, the detection speed of learning methodology is very low because of inherent complexities and improper handling of huge amounts of data. The proposed research introduces a novel DeepNet-Tool for automatic multi-tool classification in laparoscopy videos to address these existing challenges. This paper focuses on solving the spatial-temporal issues in detecting Surgical Tools (STs). The proposed model is implemented in Python, and the overall accuracy is 97.36 % with the Cholec80 dataset, 98.67 % with the EndoVis dataset, 99.73 % on EndoVis and 98.67 % on LapGyn4, respectively. Experimental outcomes of the proposed DeepNet-Tool showed higher effectiveness compared with other deep learning methods on the ST classification task. Thus, the proposed model has revealed the potential for clinical use in accurate ST classification.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"318 ","pages":"Article 113517"},"PeriodicalIF":7.2000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deepnet-based surgical tools detection in laparoscopic videos\",\"authors\":\"Praveen SR Konduri , G Siva Nageswara Rao\",\"doi\":\"10.1016/j.knosys.2025.113517\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Recently, deep learning has revolutionized significant advances in image classification, especially in Medical image (MI) processing. Surgical Data Science (SDS) has been developed as a scientific research field that aims to improve the health status of patients. Laparoscopic videos possess a highly significant information source that is integrally present in minimally invasive surgeries. Recognizing surgical tools based on the videos has promoted greater interest because of their significance. In most existing research, single-tool detection is carried out, but multiple-tool recognition is not concentrated well. However, multiple-tool recognition poses numerous challenges, including diverse lighting conditions, the appearance of multiple instruments in different representations, tissue blood, etc. Also, the detection speed of learning methodology is very low because of inherent complexities and improper handling of huge amounts of data. The proposed research introduces a novel DeepNet-Tool for automatic multi-tool classification in laparoscopy videos to address these existing challenges. This paper focuses on solving the spatial-temporal issues in detecting Surgical Tools (STs). The proposed model is implemented in Python, and the overall accuracy is 97.36 % with the Cholec80 dataset, 98.67 % with the EndoVis dataset, 99.73 % on EndoVis and 98.67 % on LapGyn4, respectively. Experimental outcomes of the proposed DeepNet-Tool showed higher effectiveness compared with other deep learning methods on the ST classification task. Thus, the proposed model has revealed the potential for clinical use in accurate ST classification.</div></div>\",\"PeriodicalId\":49939,\"journal\":{\"name\":\"Knowledge-Based Systems\",\"volume\":\"318 \",\"pages\":\"Article 113517\"},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2025-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Knowledge-Based Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0950705125005635\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125005635","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Deepnet-based surgical tools detection in laparoscopic videos
Recently, deep learning has revolutionized significant advances in image classification, especially in Medical image (MI) processing. Surgical Data Science (SDS) has been developed as a scientific research field that aims to improve the health status of patients. Laparoscopic videos possess a highly significant information source that is integrally present in minimally invasive surgeries. Recognizing surgical tools based on the videos has promoted greater interest because of their significance. In most existing research, single-tool detection is carried out, but multiple-tool recognition is not concentrated well. However, multiple-tool recognition poses numerous challenges, including diverse lighting conditions, the appearance of multiple instruments in different representations, tissue blood, etc. Also, the detection speed of learning methodology is very low because of inherent complexities and improper handling of huge amounts of data. The proposed research introduces a novel DeepNet-Tool for automatic multi-tool classification in laparoscopy videos to address these existing challenges. This paper focuses on solving the spatial-temporal issues in detecting Surgical Tools (STs). The proposed model is implemented in Python, and the overall accuracy is 97.36 % with the Cholec80 dataset, 98.67 % with the EndoVis dataset, 99.73 % on EndoVis and 98.67 % on LapGyn4, respectively. Experimental outcomes of the proposed DeepNet-Tool showed higher effectiveness compared with other deep learning methods on the ST classification task. Thus, the proposed model has revealed the potential for clinical use in accurate ST classification.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.