Julia Kimball , Ashley Abi Chaker , Alp Canbulat , Ipsit Vahia
{"title":"96. 利用机器学习模型通过语音分析检测早期阿尔茨海默病","authors":"Julia Kimball , Ashley Abi Chaker , Alp Canbulat , Ipsit Vahia","doi":"10.1016/j.jagp.2025.04.098","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><div>There is an urgent need for novel approaches that may facilitate early detection of Alzheimer's disease and thus, create targets for effective intervention and management. Current diagnostic methods often rely on expensive and/or time-consuming procedures such as brain imaging and cognitive assessments. A novel approach proposes leveraging AI/ML models to detect AD early through the analysis of spontaneous speech and language use. This method holds the potential to advance the process of AD diagnosis by offering a non-invasive, cost-effective, and easily accessible screening tool that may identify subtle variations in linguistic (and by extension, neurocognitive) function that may not yet be identified by standard screening tools. Here, we explore the range of deep learning models that have been applied to language and cognition. We also compare their analytic approaches and available results, with a view to identifying which approach may translate most readily to clinical care.</div></div><div><h3>Methods</h3><div>We used a multi-faceted approach that included a literature review, brainstorming sessions with an interdisciplinary team and field experts, and targeted internet searches for relevant web-based resources. The focus of our search was to compile studies that explored the development and application of AI algorithms to identify subtle changes in speech patterns, linguistic features, and acoustic properties associated with the early stages of AD. We considered, but did not apply a traditional biomedical search algorithm, since the literature in this space is often found outside of the biomedical literature, and because this is an exploratory project. We noted that by analyzing extensive datasets of speech samples from both healthy individuals and those with AD, all the identified studies sought to establish robust predictive models for early detection. We further examined whether confounding variables present in current linguistic AD models, such as those arising from language barriers, are also present in trained deep learning models.</div></div><div><h3>Results</h3><div>Our investigation demonstrated the consistent application across the literature, of a multimodal system, encompassing both neural networks and traditional analysis models, which were fine-tuned for the early detection of Alzheimer's disease. Among these, the ADReSS dataset emerged as the most effective, with the ensemble method achieving the highest accuracy in predicting Alzheimer’s disease based on speech patterns. However, we noted a crucial limitation: the model’s training relied solely on English speech data. This restriction introduces bias and hinders generalizability. Languages exhibit distinct phonetic structures, accents, and rhythms, potentially causing a model trained exclusively on English to misinterpret speech from other languages. Furthermore, while deep neural networks excel at discerning complex patterns, their internal workings often remain opaque, making it challenging to ascertain the precise rationale behind specific predictions.</div></div><div><h3>Conclusions</h3><div>Our review identifies a notable body of literature that outlines a range of deep learning models that have already been applied to identifying cognitive changes through the use of language. With large language models gaining rapid popularity, there is a tremendous opportunity to gather data samples from natural language, and by pairing the right model with the right type of language data, powerful new screening tools may be developed. Our work points to two key areas for future prioritization: 1) developing models trained on diverse languages and 2) expanding existing datasets to encompass a wider range of linguistic variations, including various dialects and demographics. These advancements will contribute to more equitable and reliable speech-based Alzheimer's disease detection tools.</div></div>","PeriodicalId":55534,"journal":{"name":"American Journal of Geriatric Psychiatry","volume":"33 10","pages":"Page S71"},"PeriodicalIF":3.8000,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"96. USING MACHINE LEARNING MODELS TO DETECT EARLY ALZHEIMER’S DISEASE THROUGH SPEECH ANALYSIS\",\"authors\":\"Julia Kimball , Ashley Abi Chaker , Alp Canbulat , Ipsit Vahia\",\"doi\":\"10.1016/j.jagp.2025.04.098\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Introduction</h3><div>There is an urgent need for novel approaches that may facilitate early detection of Alzheimer's disease and thus, create targets for effective intervention and management. Current diagnostic methods often rely on expensive and/or time-consuming procedures such as brain imaging and cognitive assessments. A novel approach proposes leveraging AI/ML models to detect AD early through the analysis of spontaneous speech and language use. This method holds the potential to advance the process of AD diagnosis by offering a non-invasive, cost-effective, and easily accessible screening tool that may identify subtle variations in linguistic (and by extension, neurocognitive) function that may not yet be identified by standard screening tools. Here, we explore the range of deep learning models that have been applied to language and cognition. We also compare their analytic approaches and available results, with a view to identifying which approach may translate most readily to clinical care.</div></div><div><h3>Methods</h3><div>We used a multi-faceted approach that included a literature review, brainstorming sessions with an interdisciplinary team and field experts, and targeted internet searches for relevant web-based resources. The focus of our search was to compile studies that explored the development and application of AI algorithms to identify subtle changes in speech patterns, linguistic features, and acoustic properties associated with the early stages of AD. We considered, but did not apply a traditional biomedical search algorithm, since the literature in this space is often found outside of the biomedical literature, and because this is an exploratory project. We noted that by analyzing extensive datasets of speech samples from both healthy individuals and those with AD, all the identified studies sought to establish robust predictive models for early detection. We further examined whether confounding variables present in current linguistic AD models, such as those arising from language barriers, are also present in trained deep learning models.</div></div><div><h3>Results</h3><div>Our investigation demonstrated the consistent application across the literature, of a multimodal system, encompassing both neural networks and traditional analysis models, which were fine-tuned for the early detection of Alzheimer's disease. Among these, the ADReSS dataset emerged as the most effective, with the ensemble method achieving the highest accuracy in predicting Alzheimer’s disease based on speech patterns. However, we noted a crucial limitation: the model’s training relied solely on English speech data. This restriction introduces bias and hinders generalizability. Languages exhibit distinct phonetic structures, accents, and rhythms, potentially causing a model trained exclusively on English to misinterpret speech from other languages. Furthermore, while deep neural networks excel at discerning complex patterns, their internal workings often remain opaque, making it challenging to ascertain the precise rationale behind specific predictions.</div></div><div><h3>Conclusions</h3><div>Our review identifies a notable body of literature that outlines a range of deep learning models that have already been applied to identifying cognitive changes through the use of language. With large language models gaining rapid popularity, there is a tremendous opportunity to gather data samples from natural language, and by pairing the right model with the right type of language data, powerful new screening tools may be developed. Our work points to two key areas for future prioritization: 1) developing models trained on diverse languages and 2) expanding existing datasets to encompass a wider range of linguistic variations, including various dialects and demographics. These advancements will contribute to more equitable and reliable speech-based Alzheimer's disease detection tools.</div></div>\",\"PeriodicalId\":55534,\"journal\":{\"name\":\"American Journal of Geriatric Psychiatry\",\"volume\":\"33 10\",\"pages\":\"Page S71\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2025-07-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"American Journal of Geriatric Psychiatry\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1064748125002088\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GERIATRICS & GERONTOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Journal of Geriatric Psychiatry","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1064748125002088","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GERIATRICS & GERONTOLOGY","Score":null,"Total":0}
96. USING MACHINE LEARNING MODELS TO DETECT EARLY ALZHEIMER’S DISEASE THROUGH SPEECH ANALYSIS
Introduction
There is an urgent need for novel approaches that may facilitate early detection of Alzheimer's disease and thus, create targets for effective intervention and management. Current diagnostic methods often rely on expensive and/or time-consuming procedures such as brain imaging and cognitive assessments. A novel approach proposes leveraging AI/ML models to detect AD early through the analysis of spontaneous speech and language use. This method holds the potential to advance the process of AD diagnosis by offering a non-invasive, cost-effective, and easily accessible screening tool that may identify subtle variations in linguistic (and by extension, neurocognitive) function that may not yet be identified by standard screening tools. Here, we explore the range of deep learning models that have been applied to language and cognition. We also compare their analytic approaches and available results, with a view to identifying which approach may translate most readily to clinical care.
Methods
We used a multi-faceted approach that included a literature review, brainstorming sessions with an interdisciplinary team and field experts, and targeted internet searches for relevant web-based resources. The focus of our search was to compile studies that explored the development and application of AI algorithms to identify subtle changes in speech patterns, linguistic features, and acoustic properties associated with the early stages of AD. We considered, but did not apply a traditional biomedical search algorithm, since the literature in this space is often found outside of the biomedical literature, and because this is an exploratory project. We noted that by analyzing extensive datasets of speech samples from both healthy individuals and those with AD, all the identified studies sought to establish robust predictive models for early detection. We further examined whether confounding variables present in current linguistic AD models, such as those arising from language barriers, are also present in trained deep learning models.
Results
Our investigation demonstrated the consistent application across the literature, of a multimodal system, encompassing both neural networks and traditional analysis models, which were fine-tuned for the early detection of Alzheimer's disease. Among these, the ADReSS dataset emerged as the most effective, with the ensemble method achieving the highest accuracy in predicting Alzheimer’s disease based on speech patterns. However, we noted a crucial limitation: the model’s training relied solely on English speech data. This restriction introduces bias and hinders generalizability. Languages exhibit distinct phonetic structures, accents, and rhythms, potentially causing a model trained exclusively on English to misinterpret speech from other languages. Furthermore, while deep neural networks excel at discerning complex patterns, their internal workings often remain opaque, making it challenging to ascertain the precise rationale behind specific predictions.
Conclusions
Our review identifies a notable body of literature that outlines a range of deep learning models that have already been applied to identifying cognitive changes through the use of language. With large language models gaining rapid popularity, there is a tremendous opportunity to gather data samples from natural language, and by pairing the right model with the right type of language data, powerful new screening tools may be developed. Our work points to two key areas for future prioritization: 1) developing models trained on diverse languages and 2) expanding existing datasets to encompass a wider range of linguistic variations, including various dialects and demographics. These advancements will contribute to more equitable and reliable speech-based Alzheimer's disease detection tools.
期刊介绍:
The American Journal of Geriatric Psychiatry is the leading source of information in the rapidly evolving field of geriatric psychiatry. This esteemed journal features peer-reviewed articles covering topics such as the diagnosis and classification of psychiatric disorders in older adults, epidemiological and biological correlates of mental health in the elderly, and psychopharmacology and other somatic treatments. Published twelve times a year, the journal serves as an authoritative resource for professionals in the field.