An Efficient Pelican optimization based CNN-BiLSTM to Detect and Classify 3D Objects

2022 International Conference on Knowledge Engineering and Communication Systems (ICKES) Pub Date : 2022-12-28 DOI:10.1109/ICKECS56523.2022.10060768

Ramana Rajendran, B. Murugan

引用次数: 0

Abstract

The lack of appropriate shape representation makes it complex to recognize the 3D shapes accurately and it is a hot topic in the field of Computer Vision (CV). This paper presents a Pelican optimized Convolutional Neural Network (CNN)-Bidirectional Long Short Term Memory (BiLSTM) to recognize the different objects in a particular scene. The CNN-BilSTM architecture is formed by placing two BiLSTM architectures below the CNN network and integrating the outputs via a fully connected layer. The pelican optimization algorithm is mainly incorporated to optimize the different hyperparameters associated with the CNN-BiLSTM architecture such as number of layers, batch size, number of layers, dropout, etc. The experiments are conducted using the ScanNet dataset which comprises both 2D and 3D data along with the labeled voxels. The proposed methodology offers improved results when compared with the existing techniques in terms of confusion matrix, accuracy, precision, and recall.

查看原文本刊更多论文

基于Pelican优化的CNN-BiLSTM三维目标检测与分类

缺乏合适的形状表示使得三维形状的准确识别变得复杂，是计算机视觉领域的研究热点。本文提出了一种鹈鹕优化卷积神经网络(CNN)-双向长短期记忆(BiLSTM)来识别特定场景中的不同物体。CNN- BiLSTM架构是通过在CNN网络下方放置两个BiLSTM架构，并通过一个全连接层集成输出而形成的。鹈鹕优化算法主要用于优化与CNN-BiLSTM架构相关的不同超参数，如层数、批大小、层数、dropout等。实验使用ScanNet数据集进行，该数据集包括2D和3D数据以及标记的体素。该方法在混淆矩阵、准确度、精密度和召回率等方面均优于现有方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 International Conference on Knowledge Engineering and Communication Systems (ICKES)

自引率

0.00%

发文量