Kévin Réby, Idris Dulau, Guillaume Dubrasquet, M. Beurton-Aimar
{"title":"用于物理康复评估的图形转换器","authors":"Kévin Réby, Idris Dulau, Guillaume Dubrasquet, M. Beurton-Aimar","doi":"10.1109/FG57933.2023.10042778","DOIUrl":null,"url":null,"abstract":"Physical rehabilitation is a medical specialty that focuses on the restoration of body functions in the safest and most effective way possible. During a rehabilitation exercise session, patients' behavior reflects their health status and is an important indicator of the treatment outcome. Building an automatic system for the evaluation of human motion quality in an objective and reliable way can be used in medicine to establish a differential diagnosis, to choose the adequate treatment, or for patient monitoring. Deep learning has become the state-of-the-art in Human Action and Human Behavior Recognition from videos. Most of the state-of-the-art model architectures are CNNs based and often use RNNs to calculate temporal dependencies among frames, and ignore the topological structure of the human body. In this work, we propose to use a Graph Transformer network with spatial and temporal attention mechanisms for physical rehabilitation evaluation. First, we used a standard Transformer network with a selfattention mechanism, then we take advantage of graph skeletons as inputs for a two-stream spatio-temporal graph network with both spatial and temporal attention mechanisms. We used our model on UI-PRMD, a benchmark dataset that provides skeleton data using motion capture systems. Our results show that our attention-based based ST-GCN models outperform the state-of-the-art methods on quality score prediction and binary classification.","PeriodicalId":318766,"journal":{"name":"2023 IEEE 17th International Conference on Automatic Face and Gesture Recognition (FG)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Graph Transformer for Physical Rehabilitation Evaluation\",\"authors\":\"Kévin Réby, Idris Dulau, Guillaume Dubrasquet, M. Beurton-Aimar\",\"doi\":\"10.1109/FG57933.2023.10042778\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Physical rehabilitation is a medical specialty that focuses on the restoration of body functions in the safest and most effective way possible. During a rehabilitation exercise session, patients' behavior reflects their health status and is an important indicator of the treatment outcome. Building an automatic system for the evaluation of human motion quality in an objective and reliable way can be used in medicine to establish a differential diagnosis, to choose the adequate treatment, or for patient monitoring. Deep learning has become the state-of-the-art in Human Action and Human Behavior Recognition from videos. Most of the state-of-the-art model architectures are CNNs based and often use RNNs to calculate temporal dependencies among frames, and ignore the topological structure of the human body. In this work, we propose to use a Graph Transformer network with spatial and temporal attention mechanisms for physical rehabilitation evaluation. First, we used a standard Transformer network with a selfattention mechanism, then we take advantage of graph skeletons as inputs for a two-stream spatio-temporal graph network with both spatial and temporal attention mechanisms. We used our model on UI-PRMD, a benchmark dataset that provides skeleton data using motion capture systems. Our results show that our attention-based based ST-GCN models outperform the state-of-the-art methods on quality score prediction and binary classification.\",\"PeriodicalId\":318766,\"journal\":{\"name\":\"2023 IEEE 17th International Conference on Automatic Face and Gesture Recognition (FG)\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE 17th International Conference on Automatic Face and Gesture Recognition (FG)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/FG57933.2023.10042778\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 17th International Conference on Automatic Face and Gesture Recognition (FG)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FG57933.2023.10042778","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Graph Transformer for Physical Rehabilitation Evaluation
Physical rehabilitation is a medical specialty that focuses on the restoration of body functions in the safest and most effective way possible. During a rehabilitation exercise session, patients' behavior reflects their health status and is an important indicator of the treatment outcome. Building an automatic system for the evaluation of human motion quality in an objective and reliable way can be used in medicine to establish a differential diagnosis, to choose the adequate treatment, or for patient monitoring. Deep learning has become the state-of-the-art in Human Action and Human Behavior Recognition from videos. Most of the state-of-the-art model architectures are CNNs based and often use RNNs to calculate temporal dependencies among frames, and ignore the topological structure of the human body. In this work, we propose to use a Graph Transformer network with spatial and temporal attention mechanisms for physical rehabilitation evaluation. First, we used a standard Transformer network with a selfattention mechanism, then we take advantage of graph skeletons as inputs for a two-stream spatio-temporal graph network with both spatial and temporal attention mechanisms. We used our model on UI-PRMD, a benchmark dataset that provides skeleton data using motion capture systems. Our results show that our attention-based based ST-GCN models outperform the state-of-the-art methods on quality score prediction and binary classification.