{"title":"Skeleton-based human motion prediction via spatio and position encoding transformer network","authors":"Lingchao Mi, Rui Ding, Xiaodong Zhang","doi":"10.1117/12.2667366","DOIUrl":null,"url":null,"abstract":"Many transformer modules, have been applied to computer vision. However, the transformer can extract the distal connections of human skeleton points and apply the attention mechanism to the problem of predicting human motion pose. We introduce a transformer module in the joint dimension. In addition, the Encoder module of the transformer is improved. Finally, our method achieves impressive results on benchmark datasets, including short- and long-term predictions of FNTU, confirming its effectiveness and efficiency.","PeriodicalId":137914,"journal":{"name":"International Conference on Artificial Intelligence, Virtual Reality, and Visualization","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Artificial Intelligence, Virtual Reality, and Visualization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2667366","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Many transformer modules, have been applied to computer vision. However, the transformer can extract the distal connections of human skeleton points and apply the attention mechanism to the problem of predicting human motion pose. We introduce a transformer module in the joint dimension. In addition, the Encoder module of the transformer is improved. Finally, our method achieves impressive results on benchmark datasets, including short- and long-term predictions of FNTU, confirming its effectiveness and efficiency.