Subject-independent auditory spatial attention detection based on brain topology modeling and feature distribution alignment

IF 2.5 2区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY
{"title":"Subject-independent auditory spatial attention detection based on brain topology modeling and feature distribution alignment","authors":"","doi":"10.1016/j.heares.2024.109104","DOIUrl":null,"url":null,"abstract":"<div><p>Auditory spatial attention detection (ASAD) seeks to determine which speaker in a surround sound field a listener is focusing on based on the one’s brain biosignals. Although existing studies have achieved ASAD from a single-trial electroencephalogram (EEG), the huge inter-subject variability makes them generally perform poorly in cross-subject scenarios. Besides, most ASAD methods do not take full advantage of topological relationships between EEG channels, which are crucial for high-quality ASAD. Recently, some advanced studies have introduced graph-based brain topology modeling into ASAD, but how to calculate edge weights in a graph to better capture actual brain connectivity is worthy of further investigation. To address these issues, we propose a new ASAD method in this paper. First, we model a multi-channel EEG segment as a graph, where differential entropy serves as the node feature, and a static adjacency matrix is generated based on inter-channel mutual information to quantify brain functional connectivity. Then, different subjects’ EEG graphs are encoded into a shared embedding space through a total variation graph neural network. Meanwhile, feature distribution alignment based on multi-kernel maximum mean discrepancy is adopted to learn subject-invariant patterns. Note that we align EEG embeddings of different subjects to reference distributions rather than align them to each other for the purpose of privacy preservation. A series of experiments on open datasets demonstrate that the proposed model outperforms state-of-the-art ASAD models in cross-subject scenarios with relatively low computational complexity, and feature distribution alignment improves the generalizability of the proposed model to a new subject.</p></div>","PeriodicalId":12881,"journal":{"name":"Hearing Research","volume":null,"pages":null},"PeriodicalIF":2.5000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Hearing Research","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378595524001576","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Auditory spatial attention detection (ASAD) seeks to determine which speaker in a surround sound field a listener is focusing on based on the one’s brain biosignals. Although existing studies have achieved ASAD from a single-trial electroencephalogram (EEG), the huge inter-subject variability makes them generally perform poorly in cross-subject scenarios. Besides, most ASAD methods do not take full advantage of topological relationships between EEG channels, which are crucial for high-quality ASAD. Recently, some advanced studies have introduced graph-based brain topology modeling into ASAD, but how to calculate edge weights in a graph to better capture actual brain connectivity is worthy of further investigation. To address these issues, we propose a new ASAD method in this paper. First, we model a multi-channel EEG segment as a graph, where differential entropy serves as the node feature, and a static adjacency matrix is generated based on inter-channel mutual information to quantify brain functional connectivity. Then, different subjects’ EEG graphs are encoded into a shared embedding space through a total variation graph neural network. Meanwhile, feature distribution alignment based on multi-kernel maximum mean discrepancy is adopted to learn subject-invariant patterns. Note that we align EEG embeddings of different subjects to reference distributions rather than align them to each other for the purpose of privacy preservation. A series of experiments on open datasets demonstrate that the proposed model outperforms state-of-the-art ASAD models in cross-subject scenarios with relatively low computational complexity, and feature distribution alignment improves the generalizability of the proposed model to a new subject.

基于大脑拓扑建模和特征分布配准的受试者无关听觉空间注意力检测
听觉空间注意力检测(ASAD)旨在根据听者的大脑生物信号来确定其注意力集中在环绕声场中的哪个扬声器上。虽然现有研究已通过单次脑电图(EEG)实现了听觉空间注意力检测,但由于受试者之间存在巨大差异,因此这些方法在跨受试者场景中的表现普遍较差。此外,大多数 ASAD 方法都没有充分利用脑电图通道之间的拓扑关系,而拓扑关系对高质量 ASAD 至关重要。最近,一些先进的研究将基于图的大脑拓扑建模引入了 ASAD,但如何计算图中的边权重以更好地捕捉实际的大脑连接性值得进一步研究。针对这些问题,我们在本文中提出了一种新的 ASAD 方法。首先,我们将多通道脑电图片段建模为一个图,以差分熵作为节点特征,并根据通道间互信息生成静态邻接矩阵,以量化大脑功能连接性。然后,通过总变异图神经网络将不同受试者的脑电图图编码到共享的嵌入空间中。同时,采用基于多核最大均值差异的特征分布对齐来学习主体不变模式。需要注意的是,为了保护隐私,我们将不同主体的脑电图嵌入对齐到参考分布,而不是相互对齐。在开放数据集上进行的一系列实验表明,所提出的模型在跨主体场景中的表现优于最先进的 ASAD 模型,而且计算复杂度相对较低。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Hearing Research
Hearing Research 医学-耳鼻喉科学
CiteScore
5.30
自引率
14.30%
发文量
163
审稿时长
75 days
期刊介绍: The aim of the journal is to provide a forum for papers concerned with basic peripheral and central auditory mechanisms. Emphasis is on experimental and clinical studies, but theoretical and methodological papers will also be considered. The journal publishes original research papers, review and mini- review articles, rapid communications, method/protocol and perspective articles. Papers submitted should deal with auditory anatomy, physiology, psychophysics, imaging, modeling and behavioural studies in animals and humans, as well as hearing aids and cochlear implants. Papers dealing with the vestibular system are also considered for publication. Papers on comparative aspects of hearing and on effects of drugs and environmental contaminants on hearing function will also be considered. Clinical papers will be accepted when they contribute to the understanding of normal and pathological hearing functions.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信