通过基于视觉的骨骼-光学融合来区分脑卒中后患者和健康人。

IF 5.2 2区 医学 Q1 ENGINEERING, BIOMEDICAL
Xiao Han, Ziyan Wang, Liping Li, Kongfa Hu
{"title":"通过基于视觉的骨骼-光学融合来区分脑卒中后患者和健康人。","authors":"Xiao Han, Ziyan Wang, Liping Li, Kongfa Hu","doi":"10.1186/s12984-025-01726-5","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>At present, the analysis of abnormal gait in post-stroke patients predominantly relies on wearable devices. However, with the advancements in computer vision technology, the integration of deep learning algorithms has introduced new possibilities for research. In particular, multi-modal fusion technology can effectively combine various modalities obtained through vision-based approaches, enabling more comprehensive and accurate representation of abnormal gait information in post-stroke patients.</p><p><strong>Methods: </strong>The study recruited 70 post-stroke patients and 70 healthy individuals to capture video recordings of their gait. We used Human Pose Estimation (HPE) to extract skeleton points from each frame and computed the optical flow information of these points and the corresponding angular variations of the lower limbs. Additionally, depth space features were extracted using ResNet-50 and subsequently integrated. For classification, a Long Short-Term Memory (LSTM) network was employed to analyze the fused features.</p><p><strong>Results: </strong>To evaluate the effectiveness of the feature extraction method, we tested it on both an open dataset and a self-collected clinical dataset, comparing it with CNN-RNN and Vision Transformer (ViT). The results from the LSTM network, after inputting the fused features, demonstrated optimal performance with 2 layers and 128 hidden units, achieving accuracies of 0.8794±0.0447 and 0.8778±0.0347, respectively.</p><p><strong>Conclusion: </strong>It was found that optical flow information calculated based on skeleton points, combined with variations in knee flexion and ankle dorsiflexion angles, improved the interpretability of the analytical framework. This improvement enables clinicians to gain a clearer understanding of the model's decision-making process, thereby increasing their confidence in its outputs. By employing a multi-modal fusion approach, information from different modalities is integrated, which not only broadens the analytical perspectives but also facilitates clinicians' deeper insights into the patient's gait characteristics.</p>","PeriodicalId":16384,"journal":{"name":"Journal of NeuroEngineering and Rehabilitation","volume":"22 1","pages":"209"},"PeriodicalIF":5.2000,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12509394/pdf/","citationCount":"0","resultStr":"{\"title\":\"Differentiating post-stroke patients from healthy individuals via vision-based skeleton-optical fusion.\",\"authors\":\"Xiao Han, Ziyan Wang, Liping Li, Kongfa Hu\",\"doi\":\"10.1186/s12984-025-01726-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>At present, the analysis of abnormal gait in post-stroke patients predominantly relies on wearable devices. However, with the advancements in computer vision technology, the integration of deep learning algorithms has introduced new possibilities for research. In particular, multi-modal fusion technology can effectively combine various modalities obtained through vision-based approaches, enabling more comprehensive and accurate representation of abnormal gait information in post-stroke patients.</p><p><strong>Methods: </strong>The study recruited 70 post-stroke patients and 70 healthy individuals to capture video recordings of their gait. We used Human Pose Estimation (HPE) to extract skeleton points from each frame and computed the optical flow information of these points and the corresponding angular variations of the lower limbs. Additionally, depth space features were extracted using ResNet-50 and subsequently integrated. For classification, a Long Short-Term Memory (LSTM) network was employed to analyze the fused features.</p><p><strong>Results: </strong>To evaluate the effectiveness of the feature extraction method, we tested it on both an open dataset and a self-collected clinical dataset, comparing it with CNN-RNN and Vision Transformer (ViT). The results from the LSTM network, after inputting the fused features, demonstrated optimal performance with 2 layers and 128 hidden units, achieving accuracies of 0.8794±0.0447 and 0.8778±0.0347, respectively.</p><p><strong>Conclusion: </strong>It was found that optical flow information calculated based on skeleton points, combined with variations in knee flexion and ankle dorsiflexion angles, improved the interpretability of the analytical framework. This improvement enables clinicians to gain a clearer understanding of the model's decision-making process, thereby increasing their confidence in its outputs. By employing a multi-modal fusion approach, information from different modalities is integrated, which not only broadens the analytical perspectives but also facilitates clinicians' deeper insights into the patient's gait characteristics.</p>\",\"PeriodicalId\":16384,\"journal\":{\"name\":\"Journal of NeuroEngineering and Rehabilitation\",\"volume\":\"22 1\",\"pages\":\"209\"},\"PeriodicalIF\":5.2000,\"publicationDate\":\"2025-10-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12509394/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of NeuroEngineering and Rehabilitation\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1186/s12984-025-01726-5\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of NeuroEngineering and Rehabilitation","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1186/s12984-025-01726-5","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

摘要

背景:目前,脑卒中后患者异常步态的分析主要依靠可穿戴设备。然而,随着计算机视觉技术的进步,深度学习算法的集成为研究带来了新的可能性。特别是多模态融合技术可以有效地将基于视觉的方法获得的各种模态结合起来,使脑卒中后患者异常步态信息能够更全面、更准确地表征。方法:研究招募了70名中风后患者和70名健康人,对他们的步态进行视频记录。利用人体姿态估计(Human Pose Estimation, HPE)从每帧图像中提取骨架点,并计算这些点的光流信息和相应的下肢角度变化。此外,利用ResNet-50提取深度空间特征并进行整合。在分类方面,采用长短期记忆(LSTM)网络对融合特征进行分析。结果:为了评估特征提取方法的有效性,我们在开放数据集和自收集临床数据集上对其进行了测试,并将其与CNN-RNN和Vision Transformer (ViT)进行了比较。输入融合特征后的LSTM网络具有2层128个隐藏单元的最优性能,准确率分别为0.8794±0.0447和0.8778±0.0347。结论:基于骨骼点计算的光流信息,结合膝关节屈曲和踝关节背屈角度的变化,提高了分析框架的可解释性。这一改进使临床医生能够更清楚地了解模型的决策过程,从而增加他们对其输出的信心。通过采用多模态融合方法,整合了不同模态的信息,不仅拓宽了分析视角,而且有助于临床医生更深入地了解患者的步态特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Differentiating post-stroke patients from healthy individuals via vision-based skeleton-optical fusion.

Differentiating post-stroke patients from healthy individuals via vision-based skeleton-optical fusion.

Differentiating post-stroke patients from healthy individuals via vision-based skeleton-optical fusion.

Differentiating post-stroke patients from healthy individuals via vision-based skeleton-optical fusion.

Background: At present, the analysis of abnormal gait in post-stroke patients predominantly relies on wearable devices. However, with the advancements in computer vision technology, the integration of deep learning algorithms has introduced new possibilities for research. In particular, multi-modal fusion technology can effectively combine various modalities obtained through vision-based approaches, enabling more comprehensive and accurate representation of abnormal gait information in post-stroke patients.

Methods: The study recruited 70 post-stroke patients and 70 healthy individuals to capture video recordings of their gait. We used Human Pose Estimation (HPE) to extract skeleton points from each frame and computed the optical flow information of these points and the corresponding angular variations of the lower limbs. Additionally, depth space features were extracted using ResNet-50 and subsequently integrated. For classification, a Long Short-Term Memory (LSTM) network was employed to analyze the fused features.

Results: To evaluate the effectiveness of the feature extraction method, we tested it on both an open dataset and a self-collected clinical dataset, comparing it with CNN-RNN and Vision Transformer (ViT). The results from the LSTM network, after inputting the fused features, demonstrated optimal performance with 2 layers and 128 hidden units, achieving accuracies of 0.8794±0.0447 and 0.8778±0.0347, respectively.

Conclusion: It was found that optical flow information calculated based on skeleton points, combined with variations in knee flexion and ankle dorsiflexion angles, improved the interpretability of the analytical framework. This improvement enables clinicians to gain a clearer understanding of the model's decision-making process, thereby increasing their confidence in its outputs. By employing a multi-modal fusion approach, information from different modalities is integrated, which not only broadens the analytical perspectives but also facilitates clinicians' deeper insights into the patient's gait characteristics.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of NeuroEngineering and Rehabilitation
Journal of NeuroEngineering and Rehabilitation 工程技术-工程:生物医学
CiteScore
9.60
自引率
3.90%
发文量
122
审稿时长
24 months
期刊介绍: Journal of NeuroEngineering and Rehabilitation considers manuscripts on all aspects of research that result from cross-fertilization of the fields of neuroscience, biomedical engineering, and physical medicine & rehabilitation.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信