Subject-independent auditory spatial attention detection based on brain topology modeling and feature distribution alignment

IF 2.5 2区医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY

Hearing Research Pub Date : 2024-08-14 DOI:10.1016/j.heares.2024.109104

Yixiang Niu, Ning Chen, Hongqing Zhu, Guangqiang Li, Yibo Chen

{"title":"Subject-independent auditory spatial attention detection based on brain topology modeling and feature distribution alignment","authors":"Yixiang Niu, Ning Chen, Hongqing Zhu, Guangqiang Li, Yibo Chen","doi":"10.1016/j.heares.2024.109104","DOIUrl":null,"url":null,"abstract":"<div><p>Auditory spatial attention detection (ASAD) seeks to determine which speaker in a surround sound field a listener is focusing on based on the one’s brain biosignals. Although existing studies have achieved ASAD from a single-trial electroencephalogram (EEG), the huge inter-subject variability makes them generally perform poorly in cross-subject scenarios. Besides, most ASAD methods do not take full advantage of topological relationships between EEG channels, which are crucial for high-quality ASAD. Recently, some advanced studies have introduced graph-based brain topology modeling into ASAD, but how to calculate edge weights in a graph to better capture actual brain connectivity is worthy of further investigation. To address these issues, we propose a new ASAD method in this paper. First, we model a multi-channel EEG segment as a graph, where differential entropy serves as the node feature, and a static adjacency matrix is generated based on inter-channel mutual information to quantify brain functional connectivity. Then, different subjects’ EEG graphs are encoded into a shared embedding space through a total variation graph neural network. Meanwhile, feature distribution alignment based on multi-kernel maximum mean discrepancy is adopted to learn subject-invariant patterns. Note that we align EEG embeddings of different subjects to reference distributions rather than align them to each other for the purpose of privacy preservation. A series of experiments on open datasets demonstrate that the proposed model outperforms state-of-the-art ASAD models in cross-subject scenarios with relatively low computational complexity, and feature distribution alignment improves the generalizability of the proposed model to a new subject.</p></div>","PeriodicalId":12881,"journal":{"name":"Hearing Research","volume":"453 ","pages":"Article 109104"},"PeriodicalIF":2.5000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Hearing Research","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378595524001576","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Auditory spatial attention detection (ASAD) seeks to determine which speaker in a surround sound field a listener is focusing on based on the one’s brain biosignals. Although existing studies have achieved ASAD from a single-trial electroencephalogram (EEG), the huge inter-subject variability makes them generally perform poorly in cross-subject scenarios. Besides, most ASAD methods do not take full advantage of topological relationships between EEG channels, which are crucial for high-quality ASAD. Recently, some advanced studies have introduced graph-based brain topology modeling into ASAD, but how to calculate edge weights in a graph to better capture actual brain connectivity is worthy of further investigation. To address these issues, we propose a new ASAD method in this paper. First, we model a multi-channel EEG segment as a graph, where differential entropy serves as the node feature, and a static adjacency matrix is generated based on inter-channel mutual information to quantify brain functional connectivity. Then, different subjects’ EEG graphs are encoded into a shared embedding space through a total variation graph neural network. Meanwhile, feature distribution alignment based on multi-kernel maximum mean discrepancy is adopted to learn subject-invariant patterns. Note that we align EEG embeddings of different subjects to reference distributions rather than align them to each other for the purpose of privacy preservation. A series of experiments on open datasets demonstrate that the proposed model outperforms state-of-the-art ASAD models in cross-subject scenarios with relatively low computational complexity, and feature distribution alignment improves the generalizability of the proposed model to a new subject.

查看原文本刊更多论文

基于大脑拓扑建模和特征分布配准的受试者无关听觉空间注意力检测

听觉空间注意力检测（ASAD）旨在根据听者的大脑生物信号来确定其注意力集中在环绕声场中的哪个扬声器上。虽然现有研究已通过单次脑电图（EEG）实现了听觉空间注意力检测，但由于受试者之间存在巨大差异，因此这些方法在跨受试者场景中的表现普遍较差。此外，大多数 ASAD 方法都没有充分利用脑电图通道之间的拓扑关系，而拓扑关系对高质量 ASAD 至关重要。最近，一些先进的研究将基于图的大脑拓扑建模引入了 ASAD，但如何计算图中的边权重以更好地捕捉实际的大脑连接性值得进一步研究。针对这些问题，我们在本文中提出了一种新的 ASAD 方法。首先，我们将多通道脑电图片段建模为一个图，以差分熵作为节点特征，并根据通道间互信息生成静态邻接矩阵，以量化大脑功能连接性。然后，通过总变异图神经网络将不同受试者的脑电图图编码到共享的嵌入空间中。同时，采用基于多核最大均值差异的特征分布对齐来学习主体不变模式。需要注意的是，为了保护隐私，我们将不同主体的脑电图嵌入对齐到参考分布，而不是相互对齐。在开放数据集上进行的一系列实验表明，所提出的模型在跨主体场景中的表现优于最先进的 ASAD 模型，而且计算复杂度相对较低。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Hearing Research 医学-耳鼻喉科学

CiteScore

5.30

自引率

14.30%

发文量

163

审稿时长

75 days

期刊介绍： The aim of the journal is to provide a forum for papers concerned with basic peripheral and central auditory mechanisms. Emphasis is on experimental and clinical studies, but theoretical and methodological papers will also be considered. The journal publishes original research papers, review and mini- review articles, rapid communications, method/protocol and perspective articles. Papers submitted should deal with auditory anatomy, physiology, psychophysics, imaging, modeling and behavioural studies in animals and humans, as well as hearing aids and cochlear implants. Papers dealing with the vestibular system are also considered for publication. Papers on comparative aspects of hearing and on effects of drugs and environmental contaminants on hearing function will also be considered. Clinical papers will be accepted when they contribute to the understanding of normal and pathological hearing functions.