{"title":"基于点云的3D场景图预测","authors":"Fanfan Wu, Feihu Yan, Weimin Shi, Zhong Zhou","doi":"10.1016/j.vrih.2022.01.005","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>In this study, we propose a novel 3D scene graph prediction approach for scene understanding from point clouds.</p></div><div><h3>Methods</h3><p>It can automatically organize the entities of a scene in a graph, where objects are nodes and their relationships are modeled as edges. More specifically, we employ the DGCNN to capture the features of objects and their relationships in the scene. A Graph Attention Network (GAT) is introduced to exploit latent features obtained from the initial estimation to further refine the object arrangement in the graph structure. A one loss function modified from cross entropy with a variable weight is proposed to solve the multi-category problem in the prediction of object and predicate.</p></div><div><h3>Results</h3><p>Experiments reveal that the proposed approach performs favorably against the state-of-the-art methods in terms of predicate classification and relationship prediction and achieves comparable performance on object classification prediction.</p></div><div><h3>Conclusions</h3><p>The 3D scene graph prediction approach can form an abstract description of the scene space from point clouds.</p></div>","PeriodicalId":33538,"journal":{"name":"Virtual Reality Intelligent Hardware","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2096579622000055/pdf?md5=00217b3d733a606f1856c11825c17ba8&pid=1-s2.0-S2096579622000055-main.pdf","citationCount":"2","resultStr":"{\"title\":\"3D scene graph prediction from point clouds\",\"authors\":\"Fanfan Wu, Feihu Yan, Weimin Shi, Zhong Zhou\",\"doi\":\"10.1016/j.vrih.2022.01.005\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><p>In this study, we propose a novel 3D scene graph prediction approach for scene understanding from point clouds.</p></div><div><h3>Methods</h3><p>It can automatically organize the entities of a scene in a graph, where objects are nodes and their relationships are modeled as edges. More specifically, we employ the DGCNN to capture the features of objects and their relationships in the scene. A Graph Attention Network (GAT) is introduced to exploit latent features obtained from the initial estimation to further refine the object arrangement in the graph structure. A one loss function modified from cross entropy with a variable weight is proposed to solve the multi-category problem in the prediction of object and predicate.</p></div><div><h3>Results</h3><p>Experiments reveal that the proposed approach performs favorably against the state-of-the-art methods in terms of predicate classification and relationship prediction and achieves comparable performance on object classification prediction.</p></div><div><h3>Conclusions</h3><p>The 3D scene graph prediction approach can form an abstract description of the scene space from point clouds.</p></div>\",\"PeriodicalId\":33538,\"journal\":{\"name\":\"Virtual Reality Intelligent Hardware\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2096579622000055/pdf?md5=00217b3d733a606f1856c11825c17ba8&pid=1-s2.0-S2096579622000055-main.pdf\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Virtual Reality Intelligent Hardware\",\"FirstCategoryId\":\"1093\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2096579622000055\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Virtual Reality Intelligent Hardware","FirstCategoryId":"1093","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2096579622000055","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Computer Science","Score":null,"Total":0}
In this study, we propose a novel 3D scene graph prediction approach for scene understanding from point clouds.
Methods
It can automatically organize the entities of a scene in a graph, where objects are nodes and their relationships are modeled as edges. More specifically, we employ the DGCNN to capture the features of objects and their relationships in the scene. A Graph Attention Network (GAT) is introduced to exploit latent features obtained from the initial estimation to further refine the object arrangement in the graph structure. A one loss function modified from cross entropy with a variable weight is proposed to solve the multi-category problem in the prediction of object and predicate.
Results
Experiments reveal that the proposed approach performs favorably against the state-of-the-art methods in terms of predicate classification and relationship prediction and achieves comparable performance on object classification prediction.
Conclusions
The 3D scene graph prediction approach can form an abstract description of the scene space from point clouds.