Olivier Zanier , Aron Alakmeh , Raffaele Da Mutten , Alessandro Carretta , Matteo Zoli , Diego Mazzatenta , Carlo Serra , Luca Regli , Victor E. Staartjes
{"title":"Real-time intraoperative depth estimation in transsphenoidal surgery using deep learning: A feasibility study","authors":"Olivier Zanier , Aron Alakmeh , Raffaele Da Mutten , Alessandro Carretta , Matteo Zoli , Diego Mazzatenta , Carlo Serra , Luca Regli , Victor E. Staartjes","doi":"10.1016/j.jocn.2026.111910","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><div>Endoscopic endonasal and transcranial approaches are used for the resection of various pathological lesions in neurosurgery, especially pituitary adenomas, craniopharyngiomas, chordomas, or meningiomas. The video feed provided by endoscopes is generally two-dimensional, which can hinder depth perception. Thus, generating three-dimensional imaging without the need for special endoscopes using deep learning might be beneficial for enhanced intraoperative orientation.</div></div><div><h3>Methods</h3><div>DINOv2 is a pre-trained deep-learning model published by Meta in 2023. One of its capabilities is to estimate the depth in two-dimensional images. In this study, we explore the application of DINOv2 to the video feed of eight transsphenoidal endonasal surgeries. The results were evaluated for quality by both a senior neurosurgeon and a resident neurosurgeon. Furthermore, depth estimations from a randomly selected subset of 488 images taken from the videos were semi-quantitatively compared against manual segmentations for the estimation of deep, intermediate, and superficial areas.</div></div><div><h3>Results</h3><div>Using DINOv2, numeric depth maps were generated, and colormaps were created for depth visualization. Although these colormaps were not perfect, they aligned well with the subjective assessment of depth in the video feed by a senior neurosurgeon as well as a resident neurosurgeon. Semi-quantitative validation of the model’s estimations yielded a mean overall DICE Similarity Index of 0.48. These semi-quantitative results should be interpreted with caution, as the cutoffs used for model depth predictions and manual segmentation are not standardized.</div></div><div><h3>Conclusions</h3><div>Through the application of DINOv2, we were able to estimate depth in endoscopic imaging from transsphenoidal endonasal surgeries by generating numeric maps and depth colormaps. This illustrates the potential of deep learning-based depth estimations, which in the future could contribute to improving intraoperative orientation. It also highlights the opportunities in using artificial intelligence to augment endoscopic video feeds.</div></div>","PeriodicalId":15487,"journal":{"name":"Journal of Clinical Neuroscience","volume":"147 ","pages":"Article 111910"},"PeriodicalIF":1.8000,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Clinical Neuroscience","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0967586826000615","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/2/14 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose
Endoscopic endonasal and transcranial approaches are used for the resection of various pathological lesions in neurosurgery, especially pituitary adenomas, craniopharyngiomas, chordomas, or meningiomas. The video feed provided by endoscopes is generally two-dimensional, which can hinder depth perception. Thus, generating three-dimensional imaging without the need for special endoscopes using deep learning might be beneficial for enhanced intraoperative orientation.
Methods
DINOv2 is a pre-trained deep-learning model published by Meta in 2023. One of its capabilities is to estimate the depth in two-dimensional images. In this study, we explore the application of DINOv2 to the video feed of eight transsphenoidal endonasal surgeries. The results were evaluated for quality by both a senior neurosurgeon and a resident neurosurgeon. Furthermore, depth estimations from a randomly selected subset of 488 images taken from the videos were semi-quantitatively compared against manual segmentations for the estimation of deep, intermediate, and superficial areas.
Results
Using DINOv2, numeric depth maps were generated, and colormaps were created for depth visualization. Although these colormaps were not perfect, they aligned well with the subjective assessment of depth in the video feed by a senior neurosurgeon as well as a resident neurosurgeon. Semi-quantitative validation of the model’s estimations yielded a mean overall DICE Similarity Index of 0.48. These semi-quantitative results should be interpreted with caution, as the cutoffs used for model depth predictions and manual segmentation are not standardized.
Conclusions
Through the application of DINOv2, we were able to estimate depth in endoscopic imaging from transsphenoidal endonasal surgeries by generating numeric maps and depth colormaps. This illustrates the potential of deep learning-based depth estimations, which in the future could contribute to improving intraoperative orientation. It also highlights the opportunities in using artificial intelligence to augment endoscopic video feeds.
期刊介绍:
This International journal, Journal of Clinical Neuroscience, publishes articles on clinical neurosurgery and neurology and the related neurosciences such as neuro-pathology, neuro-radiology, neuro-ophthalmology and neuro-physiology.
The journal has a broad International perspective, and emphasises the advances occurring in Asia, the Pacific Rim region, Europe and North America. The Journal acts as a focus for publication of major clinical and laboratory research, as well as publishing solicited manuscripts on specific subjects from experts, case reports and other information of interest to clinicians working in the clinical neurosciences.