Using Ear-EEG to Decode Auditory Attention in Multiple-speaker Environment

Haolin Zhu, Yujie Yan, Xiran Xu, Zhongshu Ge, Pei Tian, Xihong Wu, Jing Chen
{"title":"Using Ear-EEG to Decode Auditory Attention in Multiple-speaker Environment","authors":"Haolin Zhu, Yujie Yan, Xiran Xu, Zhongshu Ge, Pei Tian, Xihong Wu, Jing Chen","doi":"arxiv-2409.08710","DOIUrl":null,"url":null,"abstract":"Auditory Attention Decoding (AAD) can help to determine the identity of the\nattended speaker during an auditory selective attention task, by analyzing and\nprocessing measurements of electroencephalography (EEG) data. Most studies on\nAAD are based on scalp-EEG signals in two-speaker scenarios, which are far from\nreal application. Ear-EEG has recently gained significant attention due to its\nmotion tolerance and invisibility during data acquisition, making it easy to\nincorporate with other devices for applications. In this work, participants\nselectively attended to one of the four spatially separated speakers' speech in\nan anechoic room. The EEG data were concurrently collected from a scalp-EEG\nsystem and an ear-EEG system (cEEGrids). Temporal response functions (TRFs) and\nstimulus reconstruction (SR) were utilized using ear-EEG data. Results showed\nthat the attended speech TRFs were stronger than each unattended speech and\ndecoding accuracy was 41.3\\% in the 60s (chance level of 25\\%). To further\ninvestigate the impact of electrode placement and quantity, SR was utilized in\nboth scalp-EEG and ear-EEG, revealing that while the number of electrodes had a\nminor effect, their positioning had a significant influence on the decoding\naccuracy. One kind of auditory spatial attention detection (ASAD) method,\nSTAnet, was testified with this ear-EEG database, resulting in 93.1% in\n1-second decoding window. The implementation code and database for our work are\navailable on GitHub: https://github.com/zhl486/Ear_EEG_code.git and Zenodo:\nhttps://zenodo.org/records/10803261.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - EE - Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.08710","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Auditory Attention Decoding (AAD) can help to determine the identity of the attended speaker during an auditory selective attention task, by analyzing and processing measurements of electroencephalography (EEG) data. Most studies on AAD are based on scalp-EEG signals in two-speaker scenarios, which are far from real application. Ear-EEG has recently gained significant attention due to its motion tolerance and invisibility during data acquisition, making it easy to incorporate with other devices for applications. In this work, participants selectively attended to one of the four spatially separated speakers' speech in an anechoic room. The EEG data were concurrently collected from a scalp-EEG system and an ear-EEG system (cEEGrids). Temporal response functions (TRFs) and stimulus reconstruction (SR) were utilized using ear-EEG data. Results showed that the attended speech TRFs were stronger than each unattended speech and decoding accuracy was 41.3\% in the 60s (chance level of 25\%). To further investigate the impact of electrode placement and quantity, SR was utilized in both scalp-EEG and ear-EEG, revealing that while the number of electrodes had a minor effect, their positioning had a significant influence on the decoding accuracy. One kind of auditory spatial attention detection (ASAD) method, STAnet, was testified with this ear-EEG database, resulting in 93.1% in 1-second decoding window. The implementation code and database for our work are available on GitHub: https://github.com/zhl486/Ear_EEG_code.git and Zenodo: https://zenodo.org/records/10803261.
利用耳电子脑电图解码多扬声器环境中的听觉注意力
听觉注意力解码(AAD)可以通过分析和处理脑电图(EEG)数据的测量结果,帮助确定在听觉选择性注意力任务中被注意的说话者的身份。大多数关于 AAD 的研究都是基于双扬声器场景下的头皮脑电信号,与实际应用相去甚远。最近,耳部电子脑电图(Ear-EEG)因其运动耐受性和数据采集过程中的隐蔽性而备受关注,这使得它很容易与其他设备结合应用。在这项工作中,参与者在消声室中选择性地聆听四位空间上分离的演讲者中的一位。脑电图数据由头皮脑电图系统和耳部脑电图系统(cEEGrids)同时采集。利用耳电子脑电图数据进行了时间响应函数(TRF)和刺激重建(SR)。结果表明,听过的语音 TRFs 比未听过的语音更强,在 60 年代的解码准确率为 41.3%(偶然水平为 25%)。为了进一步研究电极位置和数量的影响,研究人员在头皮电子脑电图和耳部电子脑电图中都使用了SR,结果表明,虽然电极数量的影响较小,但电极的位置对解码准确性有显著影响。一种名为 STAnet 的听觉空间注意力检测(ASAD)方法在该耳部电子脑电图数据库中进行了测试,结果表明,在 1 秒钟的解码窗口内,解码率达到 93.1%。我们工作的实现代码和数据库可在 GitHub: https://github.com/zhl486/Ear_EEG_code.git 和 Zenodo:https://zenodo.org/records/10803261 上获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信