Spatial feature recognition and layout method based on improved CenterNet and LSTM frameworks

IF 1.6 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC
Yuxuan Gu, Fengyu Liu, Xiaodi Yi, Lewei Yang, Yunshu Wang
{"title":"Spatial feature recognition and layout method based on improved CenterNet and LSTM frameworks","authors":"Yuxuan Gu,&nbsp;Fengyu Liu,&nbsp;Xiaodi Yi,&nbsp;Lewei Yang,&nbsp;Yunshu Wang","doi":"10.4218/etrij.2024-0192","DOIUrl":null,"url":null,"abstract":"<p>Existing spatial feature recognition and layout methods primarily identify spatial components manually, which is time-consuming and inefficient, and the constraint relationship between objects in space can be difficult to observe. Consequently, this study introduces an advanced spatial feature recognition and layout methodology employing enhanced CenterNet and LSTM (Long Short-Term Memory) frameworks, which is bifurcated into two major components—first, HCenterNet-based feature recognition enhances feature extraction through an attention mechanism and feature fusion technology, refining the identification of small targets within complex background areas; second, a GA-BiLSTM (Genetic Algorithm - Bidirectional LSTM)-based spatial layout model uses a bidirectional LSTM network optimized with a genetic algorithm (GA), aimed at fine-tuning the network parameters to yield more accurate spatial layouts. Experiments verified that compared with the CenterNet model, the recognition performance of the proposed HCenterNet-DIoU model improved by 7.44%. Moreover, the GA-BiLSTM model improved the overall layout accuracy by 10.08% compared with the LSTM model. Time cost analysis also confirmed that the proposed model could meet the real-time requirements.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"47 4","pages":"721-736"},"PeriodicalIF":1.6000,"publicationDate":"2025-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2024-0192","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ETRI Journal","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.4218/etrij.2024-0192","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Existing spatial feature recognition and layout methods primarily identify spatial components manually, which is time-consuming and inefficient, and the constraint relationship between objects in space can be difficult to observe. Consequently, this study introduces an advanced spatial feature recognition and layout methodology employing enhanced CenterNet and LSTM (Long Short-Term Memory) frameworks, which is bifurcated into two major components—first, HCenterNet-based feature recognition enhances feature extraction through an attention mechanism and feature fusion technology, refining the identification of small targets within complex background areas; second, a GA-BiLSTM (Genetic Algorithm - Bidirectional LSTM)-based spatial layout model uses a bidirectional LSTM network optimized with a genetic algorithm (GA), aimed at fine-tuning the network parameters to yield more accurate spatial layouts. Experiments verified that compared with the CenterNet model, the recognition performance of the proposed HCenterNet-DIoU model improved by 7.44%. Moreover, the GA-BiLSTM model improved the overall layout accuracy by 10.08% compared with the LSTM model. Time cost analysis also confirmed that the proposed model could meet the real-time requirements.

Abstract Image

基于改进CenterNet和LSTM框架的空间特征识别与布局方法
现有的空间特征识别和布局方法主要是手工识别空间成分,耗时长、效率低,且空间中物体之间的约束关系难以观察。基于此,本研究引入了一种基于增强的CenterNet和LSTM(长短期记忆)框架的先进空间特征识别和布局方法,该方法分为两个主要部分:首先,基于hcenternet的特征识别通过注意机制和特征融合技术增强了特征提取,细化了复杂背景区域内小目标的识别;其次,基于GA- bilstm (Genetic Algorithm - Bidirectional LSTM)的空间布局模型采用遗传算法优化的双向LSTM网络,对网络参数进行微调,得到更精确的空间布局。实验验证,与CenterNet模型相比,提出的HCenterNet-DIoU模型的识别性能提高了7.44%。与LSTM模型相比,GA-BiLSTM模型总体布局精度提高了10.08%。时间成本分析也证实了所提出的模型能够满足实时性要求。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
ETRI Journal
ETRI Journal 工程技术-电信学
CiteScore
4.00
自引率
7.10%
发文量
98
审稿时长
6.9 months
期刊介绍: ETRI Journal is an international, peer-reviewed multidisciplinary journal published bimonthly in English. The main focus of the journal is to provide an open forum to exchange innovative ideas and technology in the fields of information, telecommunications, and electronics. Key topics of interest include high-performance computing, big data analytics, cloud computing, multimedia technology, communication networks and services, wireless communications and mobile computing, material and component technology, as well as security. With an international editorial committee and experts from around the world as reviewers, ETRI Journal publishes high-quality research papers on the latest and best developments from the global community.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信