Cognitive status assessment of older adults - test administration by conversational artificial intelligence (AI) chatbot: proof-of-concept investigation.
Anastasia Serafimovska, Katrina Swavley, Alice Zhang Qian Ao, Kirsten L Challinor, Tony Florio
{"title":"Cognitive status assessment of older adults - test administration by conversational artificial intelligence (AI) chatbot: proof-of-concept investigation.","authors":"Anastasia Serafimovska, Katrina Swavley, Alice Zhang Qian Ao, Kirsten L Challinor, Tony Florio","doi":"10.1080/13803395.2025.2542248","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The Telephone Interview for Cognitive Status-Modified (TICS-M) is a widely utilized tool for remotely assessing cognitive function, particularly among community-dwelling older adults who are unable to attend in-person evaluations. In healthcare, AI has the potential to enhance service delivery by increasing efficiency, expanding accessibility, and reducing the cost per service. Using a conversational AI chatbot, we automated administration of TICS-M (traditionally administered by psychologists), referring to this chatbot-administered version as TICS-M-AI. The aim was to investigate proof-of-concept for chatbot automation of cognitive assessment. We report three studies evaluating psychometric properties of TICS-M-AI and an additional study on safety.</p><p><strong>Method: </strong>Study1: Concurrent validity of the TICS-M-AI was assessed by administration of the TICS-M (by Psychologist) and the TICS-M-AI to the same participants (<i>n</i> = 100), one week apart. Study 2: Test-retest reliability was assessed by administering the TICS-M-AI twice to each participant, one week apart (<i>n</i> = 82) and comparing results. Study 3: Construct validity was assessed by attempted replication, using TICS-M-AI data (<i>n</i> = 264), of a previously published study by Lindgren et al. (2019) of item response patterns observed using data obtained by traditional clinician administered TICS-M. Study 4: Safety was assessed by comparing rates of reported assessment-related distress between TICS-M (<i>n</i> = 100) and TICS-M-AI (<i>n</i> = 264) administrations.</p><p><strong>Results: </strong>TICS-M-AI concurrent validity (<i>r</i> = 0.81, 88% classification agreement, κ = 0.73) with the TICS-M and good test-retest reliability (<i>r</i> = 0.76, ICC = 0.72, 83% agreement, κ = 0.65). Using the TICS-M-AI we replicated Lindgren et al. (2019) result which used the TICS-M.</p><p><strong>Conclusions: </strong>TICS-M-AI administered by an AI chatbot performed well compared to traditional TICS-M administration by a psychologist. TICS-M-AI is reliable, valid, and equally safe with added advantages of lower cost, scalability, and broader accessibility. Future research should address generalizability across diverse populations and refine AI adaptability.</p>","PeriodicalId":15382,"journal":{"name":"Journal of clinical and experimental neuropsychology","volume":" ","pages":"472-484"},"PeriodicalIF":1.7000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of clinical and experimental neuropsychology","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1080/13803395.2025.2542248","RegionNum":4,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/8/12 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: The Telephone Interview for Cognitive Status-Modified (TICS-M) is a widely utilized tool for remotely assessing cognitive function, particularly among community-dwelling older adults who are unable to attend in-person evaluations. In healthcare, AI has the potential to enhance service delivery by increasing efficiency, expanding accessibility, and reducing the cost per service. Using a conversational AI chatbot, we automated administration of TICS-M (traditionally administered by psychologists), referring to this chatbot-administered version as TICS-M-AI. The aim was to investigate proof-of-concept for chatbot automation of cognitive assessment. We report three studies evaluating psychometric properties of TICS-M-AI and an additional study on safety.
Method: Study1: Concurrent validity of the TICS-M-AI was assessed by administration of the TICS-M (by Psychologist) and the TICS-M-AI to the same participants (n = 100), one week apart. Study 2: Test-retest reliability was assessed by administering the TICS-M-AI twice to each participant, one week apart (n = 82) and comparing results. Study 3: Construct validity was assessed by attempted replication, using TICS-M-AI data (n = 264), of a previously published study by Lindgren et al. (2019) of item response patterns observed using data obtained by traditional clinician administered TICS-M. Study 4: Safety was assessed by comparing rates of reported assessment-related distress between TICS-M (n = 100) and TICS-M-AI (n = 264) administrations.
Results: TICS-M-AI concurrent validity (r = 0.81, 88% classification agreement, κ = 0.73) with the TICS-M and good test-retest reliability (r = 0.76, ICC = 0.72, 83% agreement, κ = 0.65). Using the TICS-M-AI we replicated Lindgren et al. (2019) result which used the TICS-M.
Conclusions: TICS-M-AI administered by an AI chatbot performed well compared to traditional TICS-M administration by a psychologist. TICS-M-AI is reliable, valid, and equally safe with added advantages of lower cost, scalability, and broader accessibility. Future research should address generalizability across diverse populations and refine AI adaptability.
期刊介绍:
Journal of Clinical and Experimental Neuropsychology ( JCEN) publishes research on the neuropsychological consequences of brain disease, disorders, and dysfunction, and aims to promote the integration of theories, methods, and research findings in clinical and experimental neuropsychology. The primary emphasis of JCEN is to publish original empirical research pertaining to brain-behavior relationships and neuropsychological manifestations of brain disease. Theoretical and methodological papers, critical reviews of content areas, and theoretically-relevant case studies are also welcome.