CLDM-Palm:一种基于bsamzier曲线的高保真掌纹生成的可控潜在扩散模型

IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Yuanpan Zhu, Donghuai Jia, Kevin Chu, Wenshuang Zhi, Weide Li, Shukai Chen
{"title":"CLDM-Palm:一种基于bsamzier曲线的高保真掌纹生成的可控潜在扩散模型","authors":"Yuanpan Zhu,&nbsp;Donghuai Jia,&nbsp;Kevin Chu,&nbsp;Wenshuang Zhi,&nbsp;Weide Li,&nbsp;Shukai Chen","doi":"10.1007/s10489-025-06923-2","DOIUrl":null,"url":null,"abstract":"<div><p>Using limited real data to synthesize realistic palmprints and expand training samples for recognition models has become a promising direction in palmprint recognition. However, the pseudo-palmprints generated by existing models still exhibit significant discrepancies from real ones, particularly in crease structures and fine-grained details. In this paper, we first introduce Latent Diffusion Models (LDM) as the backbone to improve the quality of palmprint generation. Secondly, to incorporate Bézier curves as control conditions into the model, we propose the Palm-to-Bézier Module (P2BM), which maps real palmprints to their corresponding Bézier-style pseudo-Bézier curves. These curves establish the connection between real palmprints and Bézier curves, which are used as conditional inputs during diffusion model training. At inference time, Bézier curves can be provided as conditions to generate high-resolution, fine-grained, and highly realistic synthetic palmprints. Thirdly, to enable Bézier curves to better model palmprint creases, we propose 12 Bézier curves templates based on real crease distribution priors. With only 10-step Denoising Diffusion Implicit Models (DDIM) sampling, our method achieves a significantly lower Fréchet Inception Distance (FID) compared to existing palmprint generation approaches. Moreover, the recognition models trained on the synthetic palmprints generated by our model achieve new state-of-the-art results in both Fisher Discriminant Ratio (FDR) and <i>TAR@FAR</i> metrics. Under a 1:1 train-test fine-tuning setting, our model improves average <i>TAR@FAR</i>=<span>\\(10^{-6}\\)</span> performance by over <span>\\(10\\%\\)</span> compared to prior methods. We name our model CLDM-Palm (Controllable Latent Diffusion Model-Palm).</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 15","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2025-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CLDM-Palm: A controllable latent diffusion model for high-fidelity palmprint generation based on Bézier curves\",\"authors\":\"Yuanpan Zhu,&nbsp;Donghuai Jia,&nbsp;Kevin Chu,&nbsp;Wenshuang Zhi,&nbsp;Weide Li,&nbsp;Shukai Chen\",\"doi\":\"10.1007/s10489-025-06923-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Using limited real data to synthesize realistic palmprints and expand training samples for recognition models has become a promising direction in palmprint recognition. However, the pseudo-palmprints generated by existing models still exhibit significant discrepancies from real ones, particularly in crease structures and fine-grained details. In this paper, we first introduce Latent Diffusion Models (LDM) as the backbone to improve the quality of palmprint generation. Secondly, to incorporate Bézier curves as control conditions into the model, we propose the Palm-to-Bézier Module (P2BM), which maps real palmprints to their corresponding Bézier-style pseudo-Bézier curves. These curves establish the connection between real palmprints and Bézier curves, which are used as conditional inputs during diffusion model training. At inference time, Bézier curves can be provided as conditions to generate high-resolution, fine-grained, and highly realistic synthetic palmprints. Thirdly, to enable Bézier curves to better model palmprint creases, we propose 12 Bézier curves templates based on real crease distribution priors. With only 10-step Denoising Diffusion Implicit Models (DDIM) sampling, our method achieves a significantly lower Fréchet Inception Distance (FID) compared to existing palmprint generation approaches. Moreover, the recognition models trained on the synthetic palmprints generated by our model achieve new state-of-the-art results in both Fisher Discriminant Ratio (FDR) and <i>TAR@FAR</i> metrics. Under a 1:1 train-test fine-tuning setting, our model improves average <i>TAR@FAR</i>=<span>\\\\(10^{-6}\\\\)</span> performance by over <span>\\\\(10\\\\%\\\\)</span> compared to prior methods. We name our model CLDM-Palm (Controllable Latent Diffusion Model-Palm).</p></div>\",\"PeriodicalId\":8041,\"journal\":{\"name\":\"Applied Intelligence\",\"volume\":\"55 15\",\"pages\":\"\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2025-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10489-025-06923-2\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-025-06923-2","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

利用有限的真实数据合成真实掌纹,扩大识别模型的训练样本,已成为掌纹识别研究的一个有前景的方向。然而,现有模型生成的伪掌纹与真实掌纹仍然存在显著差异,特别是在折痕结构和细粒度细节方面。本文首先引入潜在扩散模型(Latent Diffusion Models, LDM)作为主干来提高掌纹生成的质量。其次,为了将bsamzier曲线作为控制条件纳入模型,我们提出了palm -to- bsamzier模块(P2BM),该模块将真实掌纹映射到相应的bsamzier型伪bsamzier曲线。这些曲线建立了真实掌纹和bsamzier曲线之间的联系,bsamzier曲线作为扩散模型训练过程中的条件输入。在推断时,b zier曲线可以作为条件提供,以生成高分辨率、细粒度和高度逼真的合成掌纹。第三,为了更好地模拟掌纹折痕,我们提出了12个基于真实折痕分布先验的bsamzier曲线模板。与现有的掌纹生成方法相比,我们的方法仅使用10步去噪扩散隐式模型(DDIM)采样,实现了显着降低的fr起始距离(FID)。此外,在我们的模型生成的合成掌纹上训练的识别模型在Fisher判别比(FDR)和TAR@FAR指标上都取得了最新的结果。在1:1训练测试微调设置下,与之前的方法相比,我们的模型将平均TAR@FAR= \(10^{-6}\)性能提高了\(10\%\)以上。我们将模型命名为CLDM-Palm(可控潜伏扩散模型- palm)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
CLDM-Palm: A controllable latent diffusion model for high-fidelity palmprint generation based on Bézier curves

Using limited real data to synthesize realistic palmprints and expand training samples for recognition models has become a promising direction in palmprint recognition. However, the pseudo-palmprints generated by existing models still exhibit significant discrepancies from real ones, particularly in crease structures and fine-grained details. In this paper, we first introduce Latent Diffusion Models (LDM) as the backbone to improve the quality of palmprint generation. Secondly, to incorporate Bézier curves as control conditions into the model, we propose the Palm-to-Bézier Module (P2BM), which maps real palmprints to their corresponding Bézier-style pseudo-Bézier curves. These curves establish the connection between real palmprints and Bézier curves, which are used as conditional inputs during diffusion model training. At inference time, Bézier curves can be provided as conditions to generate high-resolution, fine-grained, and highly realistic synthetic palmprints. Thirdly, to enable Bézier curves to better model palmprint creases, we propose 12 Bézier curves templates based on real crease distribution priors. With only 10-step Denoising Diffusion Implicit Models (DDIM) sampling, our method achieves a significantly lower Fréchet Inception Distance (FID) compared to existing palmprint generation approaches. Moreover, the recognition models trained on the synthetic palmprints generated by our model achieve new state-of-the-art results in both Fisher Discriminant Ratio (FDR) and TAR@FAR metrics. Under a 1:1 train-test fine-tuning setting, our model improves average TAR@FAR=\(10^{-6}\) performance by over \(10\%\) compared to prior methods. We name our model CLDM-Palm (Controllable Latent Diffusion Model-Palm).

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Applied Intelligence
Applied Intelligence 工程技术-计算机:人工智能
CiteScore
6.60
自引率
20.80%
发文量
1361
审稿时长
5.9 months
期刊介绍: With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance. The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信