Yanyun Pu;Chengyuan Zhu;Gongxin Yao;Yu Pan;Kaixiang Yang;Qinmin Yang
{"title":"Mamba-Unet-Depth: Enhancing Long-Range Dependency for Photon-Efficient Imaging","authors":"Yanyun Pu;Chengyuan Zhu;Gongxin Yao;Yu Pan;Kaixiang Yang;Qinmin Yang","doi":"10.1109/TCSII.2025.3580670","DOIUrl":null,"url":null,"abstract":"With the rapid development of single-photon LiDAR, accurate depth recovery remains a key challenge. Conventional deep learning methods, such as CNNs and ViTs, leverage convolution and self-attention to extract local and global features, respectively. However, these models struggle to capture long-range dependencies in depth images, especially under low signal-to-background ratio (SBR) conditions. To address this, we propose Mamba-Unet-Depth, a novel network inspired by the Mamba architecture, which models long sequences and global context efficiently. By combining the hierarchical representation capability of U-Net with Mamba’s sequential modeling strength, the proposed model uses skip connections to retain spatial details across scales, facilitating richer feature learning. This enables more effective extraction of both fine-grained and contextual depth cues in challenging LiDAR data. Experimental results on the NYU Depth v2 dataset show that Mamba-Unet-Depth outperforms existing baselines in depth prediction accuracy and robustness, achieving state-of-the-art performance.","PeriodicalId":13101,"journal":{"name":"IEEE Transactions on Circuits and Systems II: Express Briefs","volume":"72 8","pages":"1123-1127"},"PeriodicalIF":4.9000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Circuits and Systems II: Express Briefs","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11039709/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
With the rapid development of single-photon LiDAR, accurate depth recovery remains a key challenge. Conventional deep learning methods, such as CNNs and ViTs, leverage convolution and self-attention to extract local and global features, respectively. However, these models struggle to capture long-range dependencies in depth images, especially under low signal-to-background ratio (SBR) conditions. To address this, we propose Mamba-Unet-Depth, a novel network inspired by the Mamba architecture, which models long sequences and global context efficiently. By combining the hierarchical representation capability of U-Net with Mamba’s sequential modeling strength, the proposed model uses skip connections to retain spatial details across scales, facilitating richer feature learning. This enables more effective extraction of both fine-grained and contextual depth cues in challenging LiDAR data. Experimental results on the NYU Depth v2 dataset show that Mamba-Unet-Depth outperforms existing baselines in depth prediction accuracy and robustness, achieving state-of-the-art performance.
期刊介绍:
TCAS II publishes brief papers in the field specified by the theory, analysis, design, and practical implementations of circuits, and the application of circuit techniques to systems and to signal processing. Included is the whole spectrum from basic scientific theory to industrial applications. The field of interest covered includes:
Circuits: Analog, Digital and Mixed Signal Circuits and Systems
Nonlinear Circuits and Systems, Integrated Sensors, MEMS and Systems on Chip, Nanoscale Circuits and Systems, Optoelectronic
Circuits and Systems, Power Electronics and Systems
Software for Analog-and-Logic Circuits and Systems
Control aspects of Circuits and Systems.