{"title":"基于多视角感知扩散的脑电图驱动三维目标解码","authors":"Xin Xiang, Wenhui Zhou, Guojun Dai","doi":"10.1016/j.engappai.2025.111180","DOIUrl":null,"url":null,"abstract":"<div><div>With the rapid development of artificial intelligence and neuroscience, existing research has shown that two-dimensional (2D) visual stimuli observed by subjects can be reconstructed from Electroencephalography (EEG) signals. However, few studies have attempted to decode three-dimensional (3D) visual objects from EEG signals evoked by 2D visual stimuli. Neuroscience research has confirmed that the brain can perceive 3D forms and cues from 2D visual stimuli, and this perceptual activity is reflected in EEG signals. To address this gap, this paper explores high-fidelity 3D object decoding from EEG signals by leveraging a multi-view perception diffusion model combined with Neural Radiance Field (NeRF) representations. The proposed EEG-to-3D method employs a two-stage learning process. The first stage captures latent EEG codes that imply 3D perceptual representations, by presenting a multi-task optimization strategy that combines EEG signal reconstruction with semantic classification tasks. In the second stage, a multi-view perception diffusion model, conditioned on latent EEG codes, is fine-tuned to constrain and optimize the parameters of the NeRF model, thereby generating multi-views of 3D objects with both semantic and viewpoint consistency. Experimental results demonstrate the effectiveness and superiority of the proposed method for 3D object decoding from EEG signals. The source codes are publicly available at: <span><span>https://github.com/xiangxinhello/EEG_to_3D</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"156 ","pages":"Article 111180"},"PeriodicalIF":8.0000,"publicationDate":"2025-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Electroencephalography-driven three-dimensional object decoding with multi-view perception diffusion\",\"authors\":\"Xin Xiang, Wenhui Zhou, Guojun Dai\",\"doi\":\"10.1016/j.engappai.2025.111180\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>With the rapid development of artificial intelligence and neuroscience, existing research has shown that two-dimensional (2D) visual stimuli observed by subjects can be reconstructed from Electroencephalography (EEG) signals. However, few studies have attempted to decode three-dimensional (3D) visual objects from EEG signals evoked by 2D visual stimuli. Neuroscience research has confirmed that the brain can perceive 3D forms and cues from 2D visual stimuli, and this perceptual activity is reflected in EEG signals. To address this gap, this paper explores high-fidelity 3D object decoding from EEG signals by leveraging a multi-view perception diffusion model combined with Neural Radiance Field (NeRF) representations. The proposed EEG-to-3D method employs a two-stage learning process. The first stage captures latent EEG codes that imply 3D perceptual representations, by presenting a multi-task optimization strategy that combines EEG signal reconstruction with semantic classification tasks. In the second stage, a multi-view perception diffusion model, conditioned on latent EEG codes, is fine-tuned to constrain and optimize the parameters of the NeRF model, thereby generating multi-views of 3D objects with both semantic and viewpoint consistency. Experimental results demonstrate the effectiveness and superiority of the proposed method for 3D object decoding from EEG signals. The source codes are publicly available at: <span><span>https://github.com/xiangxinhello/EEG_to_3D</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"156 \",\"pages\":\"Article 111180\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-06-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625011819\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625011819","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Electroencephalography-driven three-dimensional object decoding with multi-view perception diffusion
With the rapid development of artificial intelligence and neuroscience, existing research has shown that two-dimensional (2D) visual stimuli observed by subjects can be reconstructed from Electroencephalography (EEG) signals. However, few studies have attempted to decode three-dimensional (3D) visual objects from EEG signals evoked by 2D visual stimuli. Neuroscience research has confirmed that the brain can perceive 3D forms and cues from 2D visual stimuli, and this perceptual activity is reflected in EEG signals. To address this gap, this paper explores high-fidelity 3D object decoding from EEG signals by leveraging a multi-view perception diffusion model combined with Neural Radiance Field (NeRF) representations. The proposed EEG-to-3D method employs a two-stage learning process. The first stage captures latent EEG codes that imply 3D perceptual representations, by presenting a multi-task optimization strategy that combines EEG signal reconstruction with semantic classification tasks. In the second stage, a multi-view perception diffusion model, conditioned on latent EEG codes, is fine-tuned to constrain and optimize the parameters of the NeRF model, thereby generating multi-views of 3D objects with both semantic and viewpoint consistency. Experimental results demonstrate the effectiveness and superiority of the proposed method for 3D object decoding from EEG signals. The source codes are publicly available at: https://github.com/xiangxinhello/EEG_to_3D.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.