{"title":"An Efficient Pelican optimization based CNN-BiLSTM to Detect and Classify 3D Objects","authors":"Ramana Rajendran, B. Murugan","doi":"10.1109/ICKECS56523.2022.10060768","DOIUrl":null,"url":null,"abstract":"The lack of appropriate shape representation makes it complex to recognize the 3D shapes accurately and it is a hot topic in the field of Computer Vision (CV). This paper presents a Pelican optimized Convolutional Neural Network (CNN)-Bidirectional Long Short Term Memory (BiLSTM) to recognize the different objects in a particular scene. The CNN-BilSTM architecture is formed by placing two BiLSTM architectures below the CNN network and integrating the outputs via a fully connected layer. The pelican optimization algorithm is mainly incorporated to optimize the different hyperparameters associated with the CNN-BiLSTM architecture such as number of layers, batch size, number of layers, dropout, etc. The experiments are conducted using the ScanNet dataset which comprises both 2D and 3D data along with the labeled voxels. The proposed methodology offers improved results when compared with the existing techniques in terms of confusion matrix, accuracy, precision, and recall.","PeriodicalId":171432,"journal":{"name":"2022 International Conference on Knowledge Engineering and Communication Systems (ICKES)","volume":"205 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Knowledge Engineering and Communication Systems (ICKES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICKECS56523.2022.10060768","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The lack of appropriate shape representation makes it complex to recognize the 3D shapes accurately and it is a hot topic in the field of Computer Vision (CV). This paper presents a Pelican optimized Convolutional Neural Network (CNN)-Bidirectional Long Short Term Memory (BiLSTM) to recognize the different objects in a particular scene. The CNN-BilSTM architecture is formed by placing two BiLSTM architectures below the CNN network and integrating the outputs via a fully connected layer. The pelican optimization algorithm is mainly incorporated to optimize the different hyperparameters associated with the CNN-BiLSTM architecture such as number of layers, batch size, number of layers, dropout, etc. The experiments are conducted using the ScanNet dataset which comprises both 2D and 3D data along with the labeled voxels. The proposed methodology offers improved results when compared with the existing techniques in terms of confusion matrix, accuracy, precision, and recall.