ModelNet-O：用于遮挡感知点云分类的大规模合成数据集

IF 4.3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Computer Vision and Image Understanding Pub Date : 2024-06-19 DOI:10.1016/j.cviu.2024.104060

Zhongbin Fang , Xia Li , Xiangtai Li , Shen Zhao , Mengyuan Liu

{"title":"ModelNet-O：用于遮挡感知点云分类的大规模合成数据集","authors":"Zhongbin Fang , Xia Li , Xiangtai Li , Shen Zhao , Mengyuan Liu","doi":"10.1016/j.cviu.2024.104060","DOIUrl":null,"url":null,"abstract":"<div><p>Recently, 3D point cloud classification has made significant progress with the help of many datasets. However, these datasets do not reflect the incomplete nature of real-world point clouds caused by <strong><em>occlusion</em></strong>, which limits the practical application of current methods. To bridge this gap, we propose ModelNet-O, <strong>a large-scale synthetic dataset of 123,041 samples</strong> that emulates real-world point clouds with self-occlusion caused by scanning from monocular cameras. ModelNet-O is <strong><em>10 times</em></strong> larger than existing datasets and offers more challenging cases to evaluate the robustness of existing methods. Our observation on ModelNet-O reveals that <strong>well-designed sparse structures can preserve structural information of point clouds under occlusion</strong>, motivating us to propose a robust point cloud processing method that leverages a critical point sampling (CPS) strategy in a multi-level manner. We term our method PointMLS. Through extensive experiments, we demonstrate that our PointMLS achieves state-of-the-art results on ModelNet-O and competitive results on regular datasets such as ModelNet40 and ScanObjectNN, and we also demonstrate its robustness and effectiveness. Code available: <span>https://github.com/fanglaosi/ModelNet-O_PointMLS</span><svg><path></path></svg>.</p></div>","PeriodicalId":50633,"journal":{"name":"Computer Vision and Image Understanding","volume":null,"pages":null},"PeriodicalIF":4.3000,"publicationDate":"2024-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ModelNet-O: A large-scale synthetic dataset for occlusion-aware point cloud classification\",\"authors\":\"Zhongbin Fang , Xia Li , Xiangtai Li , Shen Zhao , Mengyuan Liu\",\"doi\":\"10.1016/j.cviu.2024.104060\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Recently, 3D point cloud classification has made significant progress with the help of many datasets. However, these datasets do not reflect the incomplete nature of real-world point clouds caused by <strong><em>occlusion</em></strong>, which limits the practical application of current methods. To bridge this gap, we propose ModelNet-O, <strong>a large-scale synthetic dataset of 123,041 samples</strong> that emulates real-world point clouds with self-occlusion caused by scanning from monocular cameras. ModelNet-O is <strong><em>10 times</em></strong> larger than existing datasets and offers more challenging cases to evaluate the robustness of existing methods. Our observation on ModelNet-O reveals that <strong>well-designed sparse structures can preserve structural information of point clouds under occlusion</strong>, motivating us to propose a robust point cloud processing method that leverages a critical point sampling (CPS) strategy in a multi-level manner. We term our method PointMLS. Through extensive experiments, we demonstrate that our PointMLS achieves state-of-the-art results on ModelNet-O and competitive results on regular datasets such as ModelNet40 and ScanObjectNN, and we also demonstrate its robustness and effectiveness. Code available: <span>https://github.com/fanglaosi/ModelNet-O_PointMLS</span><svg><path></path></svg>.</p></div>\",\"PeriodicalId\":50633,\"journal\":{\"name\":\"Computer Vision and Image Understanding\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2024-06-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Vision and Image Understanding\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1077314224001413\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Vision and Image Understanding","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1077314224001413","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

最近，在许多数据集的帮助下，三维点云分类取得了重大进展。然而，这些数据集并不能反映真实世界点云因遮挡而造成的不完整性，这限制了当前方法的实际应用。为了弥补这一缺陷，我们提出了 ModelNet-O，这是一个由 123,041 个样本组成的大规模合成数据集，它模拟了真实世界中由单目摄像头扫描引起的自闭塞点云。ModelNet-O 比现有数据集大 10 倍，为评估现有方法的鲁棒性提供了更具挑战性的案例。我们对 ModelNet-O 的观察发现，精心设计的稀疏结构可以在遮挡情况下保留点云的结构信息，这促使我们提出了一种稳健的点云处理方法，该方法以多层次的方式利用临界点采样（CPS）策略。我们将这种方法称为 PointMLS。通过大量实验，我们证明了我们的 PointMLS 在 ModelNet-O 上取得了最先进的结果，在 ModelNet40 和 ScanObjectNN 等常规数据集上也取得了有竞争力的结果，我们还证明了它的鲁棒性和有效性。可用代码：https://github.com/fanglaosi/ModelNet-O_PointMLS。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

ModelNet-O: A large-scale synthetic dataset for occlusion-aware point cloud classification

查看原文本刊更多论文

ModelNet-O: A large-scale synthetic dataset for occlusion-aware point cloud classification

Recently, 3D point cloud classification has made significant progress with the help of many datasets. However, these datasets do not reflect the incomplete nature of real-world point clouds caused by occlusion, which limits the practical application of current methods. To bridge this gap, we propose ModelNet-O, a large-scale synthetic dataset of 123,041 samples that emulates real-world point clouds with self-occlusion caused by scanning from monocular cameras. ModelNet-O is 10 times larger than existing datasets and offers more challenging cases to evaluate the robustness of existing methods. Our observation on ModelNet-O reveals that well-designed sparse structures can preserve structural information of point clouds under occlusion, motivating us to propose a robust point cloud processing method that leverages a critical point sampling (CPS) strategy in a multi-level manner. We term our method PointMLS. Through extensive experiments, we demonstrate that our PointMLS achieves state-of-the-art results on ModelNet-O and competitive results on regular datasets such as ModelNet40 and ScanObjectNN, and we also demonstrate its robustness and effectiveness. Code available: https://github.com/fanglaosi/ModelNet-O_PointMLS.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computer Vision and Image Understanding 工程技术-工程：电子与电气

CiteScore

7.80

自引率

4.40%

发文量

112

审稿时长

79 days

期刊介绍： The central focus of this journal is the computer analysis of pictorial information. Computer Vision and Image Understanding publishes papers covering all aspects of image analysis from the low-level, iconic processes of early vision to the high-level, symbolic processes of recognition and interpretation. A wide range of topics in the image understanding area is covered, including papers offering insights that differ from predominant views. Research Areas Include: • Theory • Early vision • Data structures and representations • Shape • Range • Motion • Matching and recognition • Architecture and languages • Vision systems