{"title":"Depthwise Over-Parameterized CNN for Voxel Human Pose Classification","authors":"O. V. Putra, Riandini, E. M. Yuniarno, M. Purnomo","doi":"10.1109/ISITIA59021.2023.10221054","DOIUrl":null,"url":null,"abstract":"Light Detection and Ranging (LiDAR) capture objects and backgrounds using a laser sensor, producing unstructured points in 3-dimensional called point clouds (PC). However, captured human pose PC is limited partially due to the LiDAR scan. The only information in the scanned area exists. Due to the inadequacy of PC data, it is challenging to classify such data. In this paper, we proposed a solution to overcome those problems. It is a novel depthwise over-parameterized (DOConv) embedded into a simple CNN. The raw PCs are converted into a 3D voxel in the input layer. In the convolutional (Conv) layer, the regular Conv is substituted with a-three layered DOConv. Lastly, to assess the performance of our model, we commence an evaluation with multiple classifier algorithms in ModelNet40 and our human pose dataset. Accuracy, loss, recall, precision, F1-scores, and Geometric mean are engaged as performance indicators. To sum up, our model outperformed all compared classifiers in accuracy for the primary dataset by 87.06 % and ModelNet40 by 68.68%.","PeriodicalId":116682,"journal":{"name":"2023 International Seminar on Intelligent Technology and Its Applications (ISITIA)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Seminar on Intelligent Technology and Its Applications (ISITIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISITIA59021.2023.10221054","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Light Detection and Ranging (LiDAR) capture objects and backgrounds using a laser sensor, producing unstructured points in 3-dimensional called point clouds (PC). However, captured human pose PC is limited partially due to the LiDAR scan. The only information in the scanned area exists. Due to the inadequacy of PC data, it is challenging to classify such data. In this paper, we proposed a solution to overcome those problems. It is a novel depthwise over-parameterized (DOConv) embedded into a simple CNN. The raw PCs are converted into a 3D voxel in the input layer. In the convolutional (Conv) layer, the regular Conv is substituted with a-three layered DOConv. Lastly, to assess the performance of our model, we commence an evaluation with multiple classifier algorithms in ModelNet40 and our human pose dataset. Accuracy, loss, recall, precision, F1-scores, and Geometric mean are engaged as performance indicators. To sum up, our model outperformed all compared classifiers in accuracy for the primary dataset by 87.06 % and ModelNet40 by 68.68%.