{"title":"EVA: Key values eclosion with space anchor used in hand pose estimation and shape reconstruction","authors":"Xuefeng Li , Xiangbo Lin","doi":"10.1016/j.ins.2025.122003","DOIUrl":null,"url":null,"abstract":"<div><div>3D hand pose estimation and shape reconstruction from single RGB image face challenges of self-occlusion, object occlusion, and depth ambiguity. Previous methods tried efforts to detect relevant information from images directly. Differently, this paper considers the task as a union of detection and generation. A novel framework called Key Value Eclosion is proposed. It utilizes powerful Diffusion generation strategies to gradually generate and refine occluded joints, vertices, and depth, using visible 2D joint locations as clues. To make the latent codes more comprehensive for hand shape reconstruction, 2D image features are transformed into 3D space using the proposed Space Anchor based feature inverse projection strategy. Integrating the Space Anchor based feature inverse projection into the Key Values Eclosion framework, a complete hand pose estimation and shape reconstruction model called EVA is constructed. The EVA model demonstrates excellent accuracy on both aligned and unaligned metrics using the HO-3D and DexYCB datasets. Especially, the improvement on Mean Error and Trans&Scale metrics are about 30%~50%, compared to state-of-the-art methods.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"706 ","pages":"Article 122003"},"PeriodicalIF":8.1000,"publicationDate":"2025-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025525001355","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
3D hand pose estimation and shape reconstruction from single RGB image face challenges of self-occlusion, object occlusion, and depth ambiguity. Previous methods tried efforts to detect relevant information from images directly. Differently, this paper considers the task as a union of detection and generation. A novel framework called Key Value Eclosion is proposed. It utilizes powerful Diffusion generation strategies to gradually generate and refine occluded joints, vertices, and depth, using visible 2D joint locations as clues. To make the latent codes more comprehensive for hand shape reconstruction, 2D image features are transformed into 3D space using the proposed Space Anchor based feature inverse projection strategy. Integrating the Space Anchor based feature inverse projection into the Key Values Eclosion framework, a complete hand pose estimation and shape reconstruction model called EVA is constructed. The EVA model demonstrates excellent accuracy on both aligned and unaligned metrics using the HO-3D and DexYCB datasets. Especially, the improvement on Mean Error and Trans&Scale metrics are about 30%~50%, compared to state-of-the-art methods.
期刊介绍:
Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions.
Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.