Shufflemono: Rethinking Lightweight Network for Self-Supervised Monocular Depth Estimation

IF 3.3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence and Soft Computing Research Pub Date : 2024-06-01 DOI:10.2478/jaiscr-2024-0011

Yingwei Feng, Zhiyong Hong, Liping Xiong, Zhiqiang Zeng, Jingmin Li

{"title":"Shufflemono: Rethinking Lightweight Network for Self-Supervised Monocular Depth Estimation","authors":"Yingwei Feng, Zhiyong Hong, Liping Xiong, Zhiqiang Zeng, Jingmin Li","doi":"10.2478/jaiscr-2024-0011","DOIUrl":null,"url":null,"abstract":"Abstract Self-supervised monocular depth estimation has been widely applied in autonomous driving and automated guided vehicles. It offers the advantages of low cost and extended effective distance compared with alternative methods. However, like automated guided vehicles, devices with limited computing resources struggle to leverage state-of-the-art large model structures. In recent years, researchers have acknowledged this issue and endeavored to reduce model size. Model lightweight techniques aim to decrease the number of parameters while maintaining satisfactory performance. In this paper, to enhance the model’s performance in lightweight scenarios, a novel approach to encompassing three key aspects is proposed: (1) utilizing LeakyReLU to involve more neurons in manifold representation; (2) employing large convolution for improved recognition of edges in lightweight models; (3) applying channel grouping and shuffling to maximize the model efficiency. Experimental results demonstrate that our proposed method achieves satisfactory outcomes on KITTI and Make3D benchmarks while having only 1.6M trainable parameters, representing a reduction of 27% compared with the previous smallest model, Lite-Mono-tiny, in monocular depth estimation.","PeriodicalId":48494,"journal":{"name":"Journal of Artificial Intelligence and Soft Computing Research","volume":null,"pages":null},"PeriodicalIF":3.3000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Artificial Intelligence and Soft Computing Research","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.2478/jaiscr-2024-0011","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Abstract Self-supervised monocular depth estimation has been widely applied in autonomous driving and automated guided vehicles. It offers the advantages of low cost and extended effective distance compared with alternative methods. However, like automated guided vehicles, devices with limited computing resources struggle to leverage state-of-the-art large model structures. In recent years, researchers have acknowledged this issue and endeavored to reduce model size. Model lightweight techniques aim to decrease the number of parameters while maintaining satisfactory performance. In this paper, to enhance the model’s performance in lightweight scenarios, a novel approach to encompassing three key aspects is proposed: (1) utilizing LeakyReLU to involve more neurons in manifold representation; (2) employing large convolution for improved recognition of edges in lightweight models; (3) applying channel grouping and shuffling to maximize the model efficiency. Experimental results demonstrate that our proposed method achieves satisfactory outcomes on KITTI and Make3D benchmarks while having only 1.6M trainable parameters, representing a reduction of 27% compared with the previous smallest model, Lite-Mono-tiny, in monocular depth estimation.

查看原文本刊更多论文

Shufflemono：重新思考用于自我监督单目深度估计的轻量级网络

摘要自监督单目深度估计已广泛应用于自动驾驶和自动制导车辆。与其他方法相比，它具有成本低、有效距离长等优点。然而，与自动导航车辆一样，计算资源有限的设备很难利用最先进的大型模型结构。近年来，研究人员已经意识到这一问题，并努力缩小模型尺寸。模型轻量化技术旨在减少参数数量，同时保持令人满意的性能。为了提高模型在轻量级场景中的性能，本文提出了一种包含三个关键方面的新方法：(1) 利用 LeakyReLU 让更多神经元参与流形表示；(2) 利用大卷积提高轻量级模型中边缘的识别能力；(3) 应用通道分组和洗牌最大限度地提高模型效率。实验结果表明，我们提出的方法在 KITTI 和 Make3D 基准测试中取得了令人满意的结果，同时仅有 160 万个可训练参数，与之前最小的单目深度估计模型 Lite-Mono-tiny 相比，减少了 27%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Artificial Intelligence and Soft Computing Research COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

7.00

自引率

25.00%

发文量

审稿时长

24 weeks

期刊介绍： Journal of Artificial Intelligence and Soft Computing Research (available also at Sciendo (De Gruyter)) is a dynamically developing international journal focused on the latest scientific results and methods constituting traditional artificial intelligence methods and soft computing techniques. Our goal is to bring together scientists representing both approaches and various research communities.