Zijian Zhang, Muqing Wu, Honghao Qi, Tianyi Ma, Min Zhao
{"title":"SMPL-IKS: A Mixed Analytical-Neural Inverse Kinematics Solver for 3D Human Mesh Recovery","authors":"Zijian Zhang, Muqing Wu, Honghao Qi, Tianyi Ma, Min Zhao","doi":"10.1007/s11263-025-02574-5","DOIUrl":null,"url":null,"abstract":"<p>We present SMPL-IKS, a mixed analytical-neural inverse kinematics solver that operates on the well-known Skinned Multi-Person Linear model (SMPL) to recover human mesh from 3D skeleton. The key challenges in the task are threefold: (1) Shape Mismatching, (2) Error Accumulation, and (3) Rotation Ambiguity. Unlike previous methods that rely on costly vertex up-sampling or iterative optimization, SMPL-IKS directly regresses the SMPL parameters (<i>i.e.</i>, shape and pose parameters) in a clean and efficient way. Specifically, we propose to infer <i>skeleton-to-mesh</i> via three explicit mappings viz. <i>Shape Inverse (SI)</i>, <i>Inverse kinematics (IK)</i>, and <i>Pose Refinement (PR)</i>. SI maps bone length to shape parameters, IK maps bone direction to pose parameters, and PR addresses errors accumulated along the kinematic tree. SMPL-IKS is general and thus extensible to MANO or SMPL-H models. Extensive experiments are conducted on various benchmarks for body-only, hand-only, and body-hand scenarios. Our model surpasses state-of-the-art methods by a large margin while being much more efficient. Data and code are available at https://github.com/Z-Z-J/SMPL-IKS.</p>","PeriodicalId":13752,"journal":{"name":"International Journal of Computer Vision","volume":"31 1","pages":""},"PeriodicalIF":9.3000,"publicationDate":"2025-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computer Vision","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11263-025-02574-5","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
We present SMPL-IKS, a mixed analytical-neural inverse kinematics solver that operates on the well-known Skinned Multi-Person Linear model (SMPL) to recover human mesh from 3D skeleton. The key challenges in the task are threefold: (1) Shape Mismatching, (2) Error Accumulation, and (3) Rotation Ambiguity. Unlike previous methods that rely on costly vertex up-sampling or iterative optimization, SMPL-IKS directly regresses the SMPL parameters (i.e., shape and pose parameters) in a clean and efficient way. Specifically, we propose to infer skeleton-to-mesh via three explicit mappings viz. Shape Inverse (SI), Inverse kinematics (IK), and Pose Refinement (PR). SI maps bone length to shape parameters, IK maps bone direction to pose parameters, and PR addresses errors accumulated along the kinematic tree. SMPL-IKS is general and thus extensible to MANO or SMPL-H models. Extensive experiments are conducted on various benchmarks for body-only, hand-only, and body-hand scenarios. Our model surpasses state-of-the-art methods by a large margin while being much more efficient. Data and code are available at https://github.com/Z-Z-J/SMPL-IKS.
期刊介绍:
The International Journal of Computer Vision (IJCV) serves as a platform for sharing new research findings in the rapidly growing field of computer vision. It publishes 12 issues annually and presents high-quality, original contributions to the science and engineering of computer vision. The journal encompasses various types of articles to cater to different research outputs.
Regular articles, which span up to 25 journal pages, focus on significant technical advancements that are of broad interest to the field. These articles showcase substantial progress in computer vision.
Short articles, limited to 10 pages, offer a swift publication path for novel research outcomes. They provide a quicker means for sharing new findings with the computer vision community.
Survey articles, comprising up to 30 pages, offer critical evaluations of the current state of the art in computer vision or offer tutorial presentations of relevant topics. These articles provide comprehensive and insightful overviews of specific subject areas.
In addition to technical articles, the journal also includes book reviews, position papers, and editorials by prominent scientific figures. These contributions serve to complement the technical content and provide valuable perspectives.
The journal encourages authors to include supplementary material online, such as images, video sequences, data sets, and software. This additional material enhances the understanding and reproducibility of the published research.
Overall, the International Journal of Computer Vision is a comprehensive publication that caters to researchers in this rapidly growing field. It covers a range of article types, offers additional online resources, and facilitates the dissemination of impactful research.