探索阿尔茨海默氏痴呆症检测的高光谱时间分辨率

2022 IEEE International Conference on Signal Processing and Communications (SPCOM) Pub Date : 2022-07-11 DOI:10.1109/SPCOM55316.2022.9840847

Nayan Anand Vats, Purva Barche, Mirishkar Sai Ganesh, A. Vuppala

{"title":"探索阿尔茨海默氏痴呆症检测的高光谱时间分辨率","authors":"Nayan Anand Vats, Purva Barche, Mirishkar Sai Ganesh, A. Vuppala","doi":"10.1109/SPCOM55316.2022.9840847","DOIUrl":null,"url":null,"abstract":"Alzheimer’s Dementia is a progressive neurological disorder characterized by cognitive impairment. It affects memory, thinking skills, language, and the ability to perform simple tasks. Detection of Alzheimer’s Dementia from the speech is considered a primitive task, as most speech cues are preserved in it. Studies in the literature focused mainly on the lexical features and few acoustic features for detecting Alzheimer’s disease. The present work explores the single frequency filtering cepstral coefficients (SFCC) for the automatic detection of Alzheimer’s disease. In contrast to STFTs, the proposed feature has better temporal and spectral resolution and captures the transient part more appropriately. This offers a very compact and efficient way to derive the formant structure in the speech signal. The experiments were conducted on the ADReSSo dataset, using the support vector machine classifier. The classification performance was compared with several baseline features like Mel-frequency cepstral coefficients (MFCC), perceptual linear prediction (PLP), linear prediction cepstral coefficient (LPCC), Mel frequency cepstral coefficients of LP-residual (MFCC-WR), ZFF signal (MFCC-ZF) and eGeMAPS (openSMILE). The experiments conducted on Alzheimer’s Dementia classification task show that the proposed feature performs better than conventional MFCCs. Among all the features, SFCC offers the best classification accuracy of 65.1% and 60.6% for dementia detection on cross-validation and test data, respectively. The combination of baseline features with SFCC features further improved the performance.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exploring High Spectro-Temporal Resolution for Alzheimer’s Dementia Detection\",\"authors\":\"Nayan Anand Vats, Purva Barche, Mirishkar Sai Ganesh, A. Vuppala\",\"doi\":\"10.1109/SPCOM55316.2022.9840847\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Alzheimer’s Dementia is a progressive neurological disorder characterized by cognitive impairment. It affects memory, thinking skills, language, and the ability to perform simple tasks. Detection of Alzheimer’s Dementia from the speech is considered a primitive task, as most speech cues are preserved in it. Studies in the literature focused mainly on the lexical features and few acoustic features for detecting Alzheimer’s disease. The present work explores the single frequency filtering cepstral coefficients (SFCC) for the automatic detection of Alzheimer’s disease. In contrast to STFTs, the proposed feature has better temporal and spectral resolution and captures the transient part more appropriately. This offers a very compact and efficient way to derive the formant structure in the speech signal. The experiments were conducted on the ADReSSo dataset, using the support vector machine classifier. The classification performance was compared with several baseline features like Mel-frequency cepstral coefficients (MFCC), perceptual linear prediction (PLP), linear prediction cepstral coefficient (LPCC), Mel frequency cepstral coefficients of LP-residual (MFCC-WR), ZFF signal (MFCC-ZF) and eGeMAPS (openSMILE). The experiments conducted on Alzheimer’s Dementia classification task show that the proposed feature performs better than conventional MFCCs. Among all the features, SFCC offers the best classification accuracy of 65.1% and 60.6% for dementia detection on cross-validation and test data, respectively. The combination of baseline features with SFCC features further improved the performance.\",\"PeriodicalId\":246982,\"journal\":{\"name\":\"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SPCOM55316.2022.9840847\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPCOM55316.2022.9840847","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

老年痴呆症是一种以认知障碍为特征的进行性神经系统疾病。它会影响记忆力、思维能力、语言和执行简单任务的能力。从言语中检测阿尔茨海默氏症被认为是一项原始任务，因为大多数言语线索都保存在言语中。文献研究主要集中在阿尔茨海默病的词汇特征，很少有声学特征。本研究探讨了用于阿尔茨海默病自动检测的单频滤波倒谱系数(SFCC)。与stft相比，所提出的特征具有更好的时间和光谱分辨率，并且更合适地捕获瞬态部分。这提供了一种非常紧凑和有效的方法来推导语音信号的形成峰结构。实验在ADReSSo数据集上进行，使用支持向量机分类器。与Mel-频率倒谱系数(MFCC)、感知线性预测(PLP)、线性预测倒谱系数(LPCC)、Mel- lp -残差频率倒谱系数(MFCC- wr)、ZFF信号(MFCC- zf)和eGeMAPS (openSMILE)等基线特征进行分类性能比较。在阿尔茨海默氏痴呆症分类任务上进行的实验表明，所提出的特征比传统的MFCCs具有更好的性能。其中，在交叉验证和测试数据上，SFCC对痴呆的检测准确率最高，分别为65.1%和60.6%。基线特征与SFCC特征的结合进一步提高了性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Exploring High Spectro-Temporal Resolution for Alzheimer’s Dementia Detection

Alzheimer’s Dementia is a progressive neurological disorder characterized by cognitive impairment. It affects memory, thinking skills, language, and the ability to perform simple tasks. Detection of Alzheimer’s Dementia from the speech is considered a primitive task, as most speech cues are preserved in it. Studies in the literature focused mainly on the lexical features and few acoustic features for detecting Alzheimer’s disease. The present work explores the single frequency filtering cepstral coefficients (SFCC) for the automatic detection of Alzheimer’s disease. In contrast to STFTs, the proposed feature has better temporal and spectral resolution and captures the transient part more appropriately. This offers a very compact and efficient way to derive the formant structure in the speech signal. The experiments were conducted on the ADReSSo dataset, using the support vector machine classifier. The classification performance was compared with several baseline features like Mel-frequency cepstral coefficients (MFCC), perceptual linear prediction (PLP), linear prediction cepstral coefficient (LPCC), Mel frequency cepstral coefficients of LP-residual (MFCC-WR), ZFF signal (MFCC-ZF) and eGeMAPS (openSMILE). The experiments conducted on Alzheimer’s Dementia classification task show that the proposed feature performs better than conventional MFCCs. Among all the features, SFCC offers the best classification accuracy of 65.1% and 60.6% for dementia detection on cross-validation and test data, respectively. The combination of baseline features with SFCC features further improved the performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE International Conference on Signal Processing and Communications (SPCOM)

自引率

0.00%

发文量