Katharina Schultebraucks, Vijay Yadav, Isaac R Galatzer-Levy
{"title":"利用基于机器学习的计算机视觉和语音分析来获得创伤幸存者认知功能的数字生物标志物。","authors":"Katharina Schultebraucks, Vijay Yadav, Isaac R Galatzer-Levy","doi":"10.1159/000512394","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Alterations in multiple domains of cognition have been observed in individuals who have experienced a traumatic stressor. These domains may provide important insights in identifying underlying neurobiological dysfunction driving an individual's clinical response to trauma. However, such assessments are burdensome, costly, and time-consuming. To overcome barriers, efforts have emerged to measure multiple domains of cognitive functioning through the application of machine learning (ML) models to passive data sources.</p><p><strong>Methods: </strong>We utilized automated computer vision and voice analysis methods to extract facial, movement, and speech characteristics from semi-structured clinical interviews in 81 trauma survivors who additionally completed a cognitive assessment battery. A ML-based regression framework was used to identify variance in visual and auditory measures that relate to multiple cognitive domains.</p><p><strong>Results: </strong>Models derived from visual and auditory measures collectively accounted for a large variance in multiple domains of cognitive functioning, including motor coordination (<i>R</i><sup>2</sup> = 0.52), processing speed (<i>R</i><sup>2</sup> = 0.42), emotional bias (<i>R</i><sup>2</sup> = 0.52), sustained attention (<i>R</i><sup>2</sup> = 0.51), controlled attention (<i>R</i><sup>2</sup> = 0.44), cognitive flexibility (<i>R</i><sup>2</sup> = 0.43), cognitive inhibition (<i>R</i><sup>2</sup> = 0.64), and executive functioning (<i>R</i><sup>2</sup> = 0.63), consistent with the high test-retest reliability of traditional cognitive assessments. Face, voice, speech content, and movement have all significantly contributed to explaining the variance in predicting functioning in all cognitive domains.</p><p><strong>Conclusions: </strong>The results demonstrate the feasibility of automated measurement of reliable proxies of cognitive functioning through low-burden passive patient evaluations. This makes it easier to monitor cognitive functions and to intervene earlier and at a lower threshold without requiring a time-consuming neurocognitive assessment by, for instance, a licensed psychologist with specialized training in neuropsychology.</p>","PeriodicalId":11242,"journal":{"name":"Digital Biomarkers","volume":"5 1","pages":"16-23"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1159/000512394","citationCount":"11","resultStr":"{\"title\":\"Utilization of Machine Learning-Based Computer Vision and Voice Analysis to Derive Digital Biomarkers of Cognitive Functioning in Trauma Survivors.\",\"authors\":\"Katharina Schultebraucks, Vijay Yadav, Isaac R Galatzer-Levy\",\"doi\":\"10.1159/000512394\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Alterations in multiple domains of cognition have been observed in individuals who have experienced a traumatic stressor. These domains may provide important insights in identifying underlying neurobiological dysfunction driving an individual's clinical response to trauma. However, such assessments are burdensome, costly, and time-consuming. To overcome barriers, efforts have emerged to measure multiple domains of cognitive functioning through the application of machine learning (ML) models to passive data sources.</p><p><strong>Methods: </strong>We utilized automated computer vision and voice analysis methods to extract facial, movement, and speech characteristics from semi-structured clinical interviews in 81 trauma survivors who additionally completed a cognitive assessment battery. A ML-based regression framework was used to identify variance in visual and auditory measures that relate to multiple cognitive domains.</p><p><strong>Results: </strong>Models derived from visual and auditory measures collectively accounted for a large variance in multiple domains of cognitive functioning, including motor coordination (<i>R</i><sup>2</sup> = 0.52), processing speed (<i>R</i><sup>2</sup> = 0.42), emotional bias (<i>R</i><sup>2</sup> = 0.52), sustained attention (<i>R</i><sup>2</sup> = 0.51), controlled attention (<i>R</i><sup>2</sup> = 0.44), cognitive flexibility (<i>R</i><sup>2</sup> = 0.43), cognitive inhibition (<i>R</i><sup>2</sup> = 0.64), and executive functioning (<i>R</i><sup>2</sup> = 0.63), consistent with the high test-retest reliability of traditional cognitive assessments. Face, voice, speech content, and movement have all significantly contributed to explaining the variance in predicting functioning in all cognitive domains.</p><p><strong>Conclusions: </strong>The results demonstrate the feasibility of automated measurement of reliable proxies of cognitive functioning through low-burden passive patient evaluations. This makes it easier to monitor cognitive functions and to intervene earlier and at a lower threshold without requiring a time-consuming neurocognitive assessment by, for instance, a licensed psychologist with specialized training in neuropsychology.</p>\",\"PeriodicalId\":11242,\"journal\":{\"name\":\"Digital Biomarkers\",\"volume\":\"5 1\",\"pages\":\"16-23\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1159/000512394\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Biomarkers\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1159/000512394\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2021/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Biomarkers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1159/000512394","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2021/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"Computer Science","Score":null,"Total":0}
Utilization of Machine Learning-Based Computer Vision and Voice Analysis to Derive Digital Biomarkers of Cognitive Functioning in Trauma Survivors.
Background: Alterations in multiple domains of cognition have been observed in individuals who have experienced a traumatic stressor. These domains may provide important insights in identifying underlying neurobiological dysfunction driving an individual's clinical response to trauma. However, such assessments are burdensome, costly, and time-consuming. To overcome barriers, efforts have emerged to measure multiple domains of cognitive functioning through the application of machine learning (ML) models to passive data sources.
Methods: We utilized automated computer vision and voice analysis methods to extract facial, movement, and speech characteristics from semi-structured clinical interviews in 81 trauma survivors who additionally completed a cognitive assessment battery. A ML-based regression framework was used to identify variance in visual and auditory measures that relate to multiple cognitive domains.
Results: Models derived from visual and auditory measures collectively accounted for a large variance in multiple domains of cognitive functioning, including motor coordination (R2 = 0.52), processing speed (R2 = 0.42), emotional bias (R2 = 0.52), sustained attention (R2 = 0.51), controlled attention (R2 = 0.44), cognitive flexibility (R2 = 0.43), cognitive inhibition (R2 = 0.64), and executive functioning (R2 = 0.63), consistent with the high test-retest reliability of traditional cognitive assessments. Face, voice, speech content, and movement have all significantly contributed to explaining the variance in predicting functioning in all cognitive domains.
Conclusions: The results demonstrate the feasibility of automated measurement of reliable proxies of cognitive functioning through low-burden passive patient evaluations. This makes it easier to monitor cognitive functions and to intervene earlier and at a lower threshold without requiring a time-consuming neurocognitive assessment by, for instance, a licensed psychologist with specialized training in neuropsychology.