基于多视角感知扩散的脑电图驱动三维目标解码

IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS
Xin Xiang, Wenhui Zhou, Guojun Dai
{"title":"基于多视角感知扩散的脑电图驱动三维目标解码","authors":"Xin Xiang,&nbsp;Wenhui Zhou,&nbsp;Guojun Dai","doi":"10.1016/j.engappai.2025.111180","DOIUrl":null,"url":null,"abstract":"<div><div>With the rapid development of artificial intelligence and neuroscience, existing research has shown that two-dimensional (2D) visual stimuli observed by subjects can be reconstructed from Electroencephalography (EEG) signals. However, few studies have attempted to decode three-dimensional (3D) visual objects from EEG signals evoked by 2D visual stimuli. Neuroscience research has confirmed that the brain can perceive 3D forms and cues from 2D visual stimuli, and this perceptual activity is reflected in EEG signals. To address this gap, this paper explores high-fidelity 3D object decoding from EEG signals by leveraging a multi-view perception diffusion model combined with Neural Radiance Field (NeRF) representations. The proposed EEG-to-3D method employs a two-stage learning process. The first stage captures latent EEG codes that imply 3D perceptual representations, by presenting a multi-task optimization strategy that combines EEG signal reconstruction with semantic classification tasks. In the second stage, a multi-view perception diffusion model, conditioned on latent EEG codes, is fine-tuned to constrain and optimize the parameters of the NeRF model, thereby generating multi-views of 3D objects with both semantic and viewpoint consistency. Experimental results demonstrate the effectiveness and superiority of the proposed method for 3D object decoding from EEG signals. The source codes are publicly available at: <span><span>https://github.com/xiangxinhello/EEG_to_3D</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"156 ","pages":"Article 111180"},"PeriodicalIF":8.0000,"publicationDate":"2025-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Electroencephalography-driven three-dimensional object decoding with multi-view perception diffusion\",\"authors\":\"Xin Xiang,&nbsp;Wenhui Zhou,&nbsp;Guojun Dai\",\"doi\":\"10.1016/j.engappai.2025.111180\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>With the rapid development of artificial intelligence and neuroscience, existing research has shown that two-dimensional (2D) visual stimuli observed by subjects can be reconstructed from Electroencephalography (EEG) signals. However, few studies have attempted to decode three-dimensional (3D) visual objects from EEG signals evoked by 2D visual stimuli. Neuroscience research has confirmed that the brain can perceive 3D forms and cues from 2D visual stimuli, and this perceptual activity is reflected in EEG signals. To address this gap, this paper explores high-fidelity 3D object decoding from EEG signals by leveraging a multi-view perception diffusion model combined with Neural Radiance Field (NeRF) representations. The proposed EEG-to-3D method employs a two-stage learning process. The first stage captures latent EEG codes that imply 3D perceptual representations, by presenting a multi-task optimization strategy that combines EEG signal reconstruction with semantic classification tasks. In the second stage, a multi-view perception diffusion model, conditioned on latent EEG codes, is fine-tuned to constrain and optimize the parameters of the NeRF model, thereby generating multi-views of 3D objects with both semantic and viewpoint consistency. Experimental results demonstrate the effectiveness and superiority of the proposed method for 3D object decoding from EEG signals. The source codes are publicly available at: <span><span>https://github.com/xiangxinhello/EEG_to_3D</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"156 \",\"pages\":\"Article 111180\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-06-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625011819\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625011819","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

随着人工智能和神经科学的快速发展,已有研究表明,被试观察到的二维视觉刺激可以从脑电图(EEG)信号中重建出来。然而,很少有研究试图从二维视觉刺激引起的脑电图信号中解码三维视觉对象。神经科学研究已经证实,大脑可以从2D视觉刺激中感知3D形式和线索,这种感知活动反映在脑电图信号中。为了解决这一差距,本文通过利用结合神经辐射场(NeRF)表示的多视图感知扩散模型,探索了从EEG信号中解码高保真3D物体。提出的EEG-to-3D方法采用两阶段学习过程。第一阶段通过提出将脑电信号重构与语义分类任务相结合的多任务优化策略,捕获隐含3D感知表征的潜在脑电信号。第二阶段,以潜在脑电图编码为条件,对多视图感知扩散模型进行微调,约束和优化NeRF模型的参数,从而生成语义和视点一致的3D物体多视图。实验结果证明了该方法在脑电信号三维目标解码中的有效性和优越性。源代码可以在https://github.com/xiangxinhello/EEG_to_3D上公开获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Electroencephalography-driven three-dimensional object decoding with multi-view perception diffusion
With the rapid development of artificial intelligence and neuroscience, existing research has shown that two-dimensional (2D) visual stimuli observed by subjects can be reconstructed from Electroencephalography (EEG) signals. However, few studies have attempted to decode three-dimensional (3D) visual objects from EEG signals evoked by 2D visual stimuli. Neuroscience research has confirmed that the brain can perceive 3D forms and cues from 2D visual stimuli, and this perceptual activity is reflected in EEG signals. To address this gap, this paper explores high-fidelity 3D object decoding from EEG signals by leveraging a multi-view perception diffusion model combined with Neural Radiance Field (NeRF) representations. The proposed EEG-to-3D method employs a two-stage learning process. The first stage captures latent EEG codes that imply 3D perceptual representations, by presenting a multi-task optimization strategy that combines EEG signal reconstruction with semantic classification tasks. In the second stage, a multi-view perception diffusion model, conditioned on latent EEG codes, is fine-tuned to constrain and optimize the parameters of the NeRF model, thereby generating multi-views of 3D objects with both semantic and viewpoint consistency. Experimental results demonstrate the effectiveness and superiority of the proposed method for 3D object decoding from EEG signals. The source codes are publicly available at: https://github.com/xiangxinhello/EEG_to_3D.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Engineering Applications of Artificial Intelligence
Engineering Applications of Artificial Intelligence 工程技术-工程:电子与电气
CiteScore
9.60
自引率
10.00%
发文量
505
审稿时长
68 days
期刊介绍: Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信