{"title":"基于空间和位置编码变压器网络的骨骼运动预测","authors":"Lingchao Mi, Rui Ding, Xiaodong Zhang","doi":"10.1117/12.2667366","DOIUrl":null,"url":null,"abstract":"Many transformer modules, have been applied to computer vision. However, the transformer can extract the distal connections of human skeleton points and apply the attention mechanism to the problem of predicting human motion pose. We introduce a transformer module in the joint dimension. In addition, the Encoder module of the transformer is improved. Finally, our method achieves impressive results on benchmark datasets, including short- and long-term predictions of FNTU, confirming its effectiveness and efficiency.","PeriodicalId":137914,"journal":{"name":"International Conference on Artificial Intelligence, Virtual Reality, and Visualization","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Skeleton-based human motion prediction via spatio and position encoding transformer network\",\"authors\":\"Lingchao Mi, Rui Ding, Xiaodong Zhang\",\"doi\":\"10.1117/12.2667366\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Many transformer modules, have been applied to computer vision. However, the transformer can extract the distal connections of human skeleton points and apply the attention mechanism to the problem of predicting human motion pose. We introduce a transformer module in the joint dimension. In addition, the Encoder module of the transformer is improved. Finally, our method achieves impressive results on benchmark datasets, including short- and long-term predictions of FNTU, confirming its effectiveness and efficiency.\",\"PeriodicalId\":137914,\"journal\":{\"name\":\"International Conference on Artificial Intelligence, Virtual Reality, and Visualization\",\"volume\":\"60 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Artificial Intelligence, Virtual Reality, and Visualization\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2667366\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Artificial Intelligence, Virtual Reality, and Visualization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2667366","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Skeleton-based human motion prediction via spatio and position encoding transformer network
Many transformer modules, have been applied to computer vision. However, the transformer can extract the distal connections of human skeleton points and apply the attention mechanism to the problem of predicting human motion pose. We introduce a transformer module in the joint dimension. In addition, the Encoder module of the transformer is improved. Finally, our method achieves impressive results on benchmark datasets, including short- and long-term predictions of FNTU, confirming its effectiveness and efficiency.