Yoelvis Moreno-Alcayde, V Javier Traver, Luis A Leiva
{"title":"通过脑电图预测注视和凝视位置。","authors":"Yoelvis Moreno-Alcayde, V Javier Traver, Luis A Leiva","doi":"10.1007/s11517-025-03362-6","DOIUrl":null,"url":null,"abstract":"<p><p>Brain signals carry cognitive information that can be relevant in downstream tasks, but what about eye-gaze? Although this can be estimated with eye-trackers, it can be very convenient in practice to do it without extra equipment. We consider the challenging tasks of fixation prediction and gaze estimation from electroencephalography (EEG) using deep learning models. We argue that there are three critical design criteria when designing neural architectures for EEG: (1) the spatial and temporal dimensions of the data, (2) the local vs global nature of the data processing, and (3) the overall structure and order with which the steps (1) and (2) are orchestrated. We propose two model architectures, based on Transformers and LSTMs, with different variants in this large design space, and compare them with recent state-of-the-art (SOTA) approaches under two constraints: reduced EEG signal length and reduced set of EEG channels. Our Transformer-based model outperforms the LSTM-only model, but it turns out to be more sensitive with short signal lengths and with less number of channels. Interestingly, our results are similar or slightly better than SOTA, and the models are trained from scratch (i.e., without pre-training or fine-tuning). Our findings provide useful insights for advancing in eye-from-EEG tasks.</p>","PeriodicalId":49840,"journal":{"name":"Medical & Biological Engineering & Computing","volume":" ","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2025-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting fixations and gaze location from EEG.\",\"authors\":\"Yoelvis Moreno-Alcayde, V Javier Traver, Luis A Leiva\",\"doi\":\"10.1007/s11517-025-03362-6\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Brain signals carry cognitive information that can be relevant in downstream tasks, but what about eye-gaze? Although this can be estimated with eye-trackers, it can be very convenient in practice to do it without extra equipment. We consider the challenging tasks of fixation prediction and gaze estimation from electroencephalography (EEG) using deep learning models. We argue that there are three critical design criteria when designing neural architectures for EEG: (1) the spatial and temporal dimensions of the data, (2) the local vs global nature of the data processing, and (3) the overall structure and order with which the steps (1) and (2) are orchestrated. We propose two model architectures, based on Transformers and LSTMs, with different variants in this large design space, and compare them with recent state-of-the-art (SOTA) approaches under two constraints: reduced EEG signal length and reduced set of EEG channels. Our Transformer-based model outperforms the LSTM-only model, but it turns out to be more sensitive with short signal lengths and with less number of channels. Interestingly, our results are similar or slightly better than SOTA, and the models are trained from scratch (i.e., without pre-training or fine-tuning). Our findings provide useful insights for advancing in eye-from-EEG tasks.</p>\",\"PeriodicalId\":49840,\"journal\":{\"name\":\"Medical & Biological Engineering & Computing\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2025-05-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Medical & Biological Engineering & Computing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s11517-025-03362-6\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical & Biological Engineering & Computing","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11517-025-03362-6","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Brain signals carry cognitive information that can be relevant in downstream tasks, but what about eye-gaze? Although this can be estimated with eye-trackers, it can be very convenient in practice to do it without extra equipment. We consider the challenging tasks of fixation prediction and gaze estimation from electroencephalography (EEG) using deep learning models. We argue that there are three critical design criteria when designing neural architectures for EEG: (1) the spatial and temporal dimensions of the data, (2) the local vs global nature of the data processing, and (3) the overall structure and order with which the steps (1) and (2) are orchestrated. We propose two model architectures, based on Transformers and LSTMs, with different variants in this large design space, and compare them with recent state-of-the-art (SOTA) approaches under two constraints: reduced EEG signal length and reduced set of EEG channels. Our Transformer-based model outperforms the LSTM-only model, but it turns out to be more sensitive with short signal lengths and with less number of channels. Interestingly, our results are similar or slightly better than SOTA, and the models are trained from scratch (i.e., without pre-training or fine-tuning). Our findings provide useful insights for advancing in eye-from-EEG tasks.
期刊介绍:
Founded in 1963, Medical & Biological Engineering & Computing (MBEC) continues to serve the biomedical engineering community, covering the entire spectrum of biomedical and clinical engineering. The journal presents exciting and vital experimental and theoretical developments in biomedical science and technology, and reports on advances in computer-based methodologies in these multidisciplinary subjects. The journal also incorporates new and evolving technologies including cellular engineering and molecular imaging.
MBEC publishes original research articles as well as reviews and technical notes. Its Rapid Communications category focuses on material of immediate value to the readership, while the Controversies section provides a forum to exchange views on selected issues, stimulating a vigorous and informed debate in this exciting and high profile field.
MBEC is an official journal of the International Federation of Medical and Biological Engineering (IFMBE).