Moch I. Riansyah, T. A. Sardjono, E. M. Yuniarno, M. Purnomo
{"title":"基于体素的三维卷积神经网络人体方向预测","authors":"Moch I. Riansyah, T. A. Sardjono, E. M. Yuniarno, M. Purnomo","doi":"10.1109/ISITIA59021.2023.10221066","DOIUrl":null,"url":null,"abstract":"Robot interaction with humans requires intelligent robots that can understand human activities. The development of advanced 3D LiDAR sensors has greatly contributed to this capability. In this study, we specifically focus on the use of 3D LiDAR sensors to predict the orientation of the human body using 3D Convolutional Neural Networks (CNNs) based on voxelized datasets. The dataset used in this study was created using a 3D LiDAR sensor with 32-channel specifications. We divided the dataset into four categories representing different walking orientations. The goal was to explore the performance of four different 3D CNN architectures using independently generated datasets. Based on the experimental results and performance analysis, it was found that VGG16 outperformed the other three architectures in predicting body orientation. VGG16 achieved an accuracy of 0.95, which was higher than DenseNet121 with approximately 0.91, ResNet50V2 with 0.80, and ResNet50 with 0.73. In the future, this method will be developed with additional orientation and results of architectural testing so that it can be modified to be better for further research on understanding human activity by robots.","PeriodicalId":116682,"journal":{"name":"2023 International Seminar on Intelligent Technology and Its Applications (ISITIA)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediction of Human Body Orientation based on Voxel Using 3D Convolutional Neural Network\",\"authors\":\"Moch I. Riansyah, T. A. Sardjono, E. M. Yuniarno, M. Purnomo\",\"doi\":\"10.1109/ISITIA59021.2023.10221066\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Robot interaction with humans requires intelligent robots that can understand human activities. The development of advanced 3D LiDAR sensors has greatly contributed to this capability. In this study, we specifically focus on the use of 3D LiDAR sensors to predict the orientation of the human body using 3D Convolutional Neural Networks (CNNs) based on voxelized datasets. The dataset used in this study was created using a 3D LiDAR sensor with 32-channel specifications. We divided the dataset into four categories representing different walking orientations. The goal was to explore the performance of four different 3D CNN architectures using independently generated datasets. Based on the experimental results and performance analysis, it was found that VGG16 outperformed the other three architectures in predicting body orientation. VGG16 achieved an accuracy of 0.95, which was higher than DenseNet121 with approximately 0.91, ResNet50V2 with 0.80, and ResNet50 with 0.73. In the future, this method will be developed with additional orientation and results of architectural testing so that it can be modified to be better for further research on understanding human activity by robots.\",\"PeriodicalId\":116682,\"journal\":{\"name\":\"2023 International Seminar on Intelligent Technology and Its Applications (ISITIA)\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Seminar on Intelligent Technology and Its Applications (ISITIA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISITIA59021.2023.10221066\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Seminar on Intelligent Technology and Its Applications (ISITIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISITIA59021.2023.10221066","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Prediction of Human Body Orientation based on Voxel Using 3D Convolutional Neural Network
Robot interaction with humans requires intelligent robots that can understand human activities. The development of advanced 3D LiDAR sensors has greatly contributed to this capability. In this study, we specifically focus on the use of 3D LiDAR sensors to predict the orientation of the human body using 3D Convolutional Neural Networks (CNNs) based on voxelized datasets. The dataset used in this study was created using a 3D LiDAR sensor with 32-channel specifications. We divided the dataset into four categories representing different walking orientations. The goal was to explore the performance of four different 3D CNN architectures using independently generated datasets. Based on the experimental results and performance analysis, it was found that VGG16 outperformed the other three architectures in predicting body orientation. VGG16 achieved an accuracy of 0.95, which was higher than DenseNet121 with approximately 0.91, ResNet50V2 with 0.80, and ResNet50 with 0.73. In the future, this method will be developed with additional orientation and results of architectural testing so that it can be modified to be better for further research on understanding human activity by robots.