人体运动合成_用于运动缝合和夹缝的扩散方法

arXiv - CS - Human-Computer Interaction Pub Date : 2024-09-10 DOI:arxiv-2409.06791

Michael Adewole, Oluwaseyi Giwa, Favour Nerrise, Martins Osifeko, Ajibola Oyedeji

{"title":"人体运动合成_用于运动缝合和夹缝的扩散方法","authors":"Michael Adewole, Oluwaseyi Giwa, Favour Nerrise, Martins Osifeko, Ajibola Oyedeji","doi":"arxiv-2409.06791","DOIUrl":null,"url":null,"abstract":"Human motion generation is an important area of research in many fields. In\nthis work, we tackle the problem of motion stitching and in-betweening. Current\nmethods either require manual efforts, or are incapable of handling longer\nsequences. To address these challenges, we propose a diffusion model with a\ntransformer-based denoiser to generate realistic human motion. Our method\ndemonstrated strong performance in generating in-betweening sequences,\ntransforming a variable number of input poses into smooth and realistic motion\nsequences consisting of 75 frames at 15 fps, resulting in a total duration of 5\nseconds. We present the performance evaluation of our method using quantitative\nmetrics such as Frechet Inception Distance (FID), Diversity, and Multimodality,\nalong with visual assessments of the generated outputs.","PeriodicalId":501541,"journal":{"name":"arXiv - CS - Human-Computer Interaction","volume":"2 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Human Motion Synthesis_ A Diffusion Approach for Motion Stitching and In-Betweening\",\"authors\":\"Michael Adewole, Oluwaseyi Giwa, Favour Nerrise, Martins Osifeko, Ajibola Oyedeji\",\"doi\":\"arxiv-2409.06791\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Human motion generation is an important area of research in many fields. In\\nthis work, we tackle the problem of motion stitching and in-betweening. Current\\nmethods either require manual efforts, or are incapable of handling longer\\nsequences. To address these challenges, we propose a diffusion model with a\\ntransformer-based denoiser to generate realistic human motion. Our method\\ndemonstrated strong performance in generating in-betweening sequences,\\ntransforming a variable number of input poses into smooth and realistic motion\\nsequences consisting of 75 frames at 15 fps, resulting in a total duration of 5\\nseconds. We present the performance evaluation of our method using quantitative\\nmetrics such as Frechet Inception Distance (FID), Diversity, and Multimodality,\\nalong with visual assessments of the generated outputs.\",\"PeriodicalId\":501541,\"journal\":{\"name\":\"arXiv - CS - Human-Computer Interaction\",\"volume\":\"2 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Human-Computer Interaction\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.06791\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Human-Computer Interaction","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.06791","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

人体运动生成是许多领域的一个重要研究领域。在这项工作中，我们解决了运动拼接和中间处理的问题。目前的方法要么需要人工操作，要么无法处理较长的序列。为了应对这些挑战，我们提出了一种扩散模型，并使用基于变换器的去噪器来生成逼真的人体运动。我们的方法在生成中间序列方面表现出很强的性能，可将不同数量的输入姿势转换成平滑逼真的运动序列，包括 75 帧、15 帧/秒，总时长为 5 秒。我们使用弗雷谢特起始距离（FID）、多样性和多模态等定量指标对我们的方法进行了性能评估，并对生成的输出结果进行了视觉评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Human Motion Synthesis_ A Diffusion Approach for Motion Stitching and In-Betweening

Human motion generation is an important area of research in many fields. In this work, we tackle the problem of motion stitching and in-betweening. Current methods either require manual efforts, or are incapable of handling longer sequences. To address these challenges, we propose a diffusion model with a transformer-based denoiser to generate realistic human motion. Our method demonstrated strong performance in generating in-betweening sequences, transforming a variable number of input poses into smooth and realistic motion sequences consisting of 75 frames at 15 fps, resulting in a total duration of 5 seconds. We present the performance evaluation of our method using quantitative metrics such as Frechet Inception Distance (FID), Diversity, and Multimodality, along with visual assessments of the generated outputs.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv - CS - Human-Computer Interaction

自引率

0.00%

发文量