Qiyao Hu , Jingyu Wang , Xianlin Peng , Tengfei Li , Rui Cao
{"title":"传统中国画元素的三维重建图像生成","authors":"Qiyao Hu , Jingyu Wang , Xianlin Peng , Tengfei Li , Rui Cao","doi":"10.1016/j.engappai.2025.111417","DOIUrl":null,"url":null,"abstract":"<div><div>This paper presents a comprehensive pipeline for generating detailed three-dimensional (3D) models from single images of traditional Chinese painting elements. This task is particularly challenging due to the lack of 3D datasets for Chinese paintings and the limited research on their 3D reconstruction. As a result, direct access to multiple views is precluded. We propose a novel method for the <strong>3D</strong> reconstruction of <strong>T</strong>raditional <strong>C</strong>hinese <strong>P</strong>ainting <strong>E</strong>lements, termed <strong>TCPE-3D</strong>, which has three components of the process. The first component is a multi-view synthesis module named One To Six (OTX) - Multi-View Generating (MVG) Module. This module creates six fixed-view images through a series of preprocessing steps. These images are used to generate the Local Light Field Fusion (LLFF) dataset within the Neural Radiance Fields (NeRF) synthesis module. This process leads to the creation of detailed mesh structures in the final Mesh Generation module. Comparison with several state-of-the-art 3D reconstruction methods shows that our framework achieves better visualization results and higher technical specifications. Additionally, it solves the Janus problem encountered by other algorithms for Chinese painting data. Our dataset is available at <span><span>https://github.com/LPDLG/3DTCP-Dataset</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"158 ","pages":"Article 111417"},"PeriodicalIF":8.0000,"publicationDate":"2025-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Three-dimensional reconstruction image generation of traditional Chinese painting elements\",\"authors\":\"Qiyao Hu , Jingyu Wang , Xianlin Peng , Tengfei Li , Rui Cao\",\"doi\":\"10.1016/j.engappai.2025.111417\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This paper presents a comprehensive pipeline for generating detailed three-dimensional (3D) models from single images of traditional Chinese painting elements. This task is particularly challenging due to the lack of 3D datasets for Chinese paintings and the limited research on their 3D reconstruction. As a result, direct access to multiple views is precluded. We propose a novel method for the <strong>3D</strong> reconstruction of <strong>T</strong>raditional <strong>C</strong>hinese <strong>P</strong>ainting <strong>E</strong>lements, termed <strong>TCPE-3D</strong>, which has three components of the process. The first component is a multi-view synthesis module named One To Six (OTX) - Multi-View Generating (MVG) Module. This module creates six fixed-view images through a series of preprocessing steps. These images are used to generate the Local Light Field Fusion (LLFF) dataset within the Neural Radiance Fields (NeRF) synthesis module. This process leads to the creation of detailed mesh structures in the final Mesh Generation module. Comparison with several state-of-the-art 3D reconstruction methods shows that our framework achieves better visualization results and higher technical specifications. Additionally, it solves the Janus problem encountered by other algorithms for Chinese painting data. Our dataset is available at <span><span>https://github.com/LPDLG/3DTCP-Dataset</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"158 \",\"pages\":\"Article 111417\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625014198\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625014198","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
摘要
本文提出了一种从传统中国画元素单幅图像生成精细三维模型的综合流水线。由于缺乏中国绘画的三维数据集以及对其三维重建的研究有限,这项任务尤其具有挑战性。因此,直接访问多个视图被排除在外。我们提出了一种新的中国传统绘画元素的三维重建方法,称为TCPE-3D,它有三个组成部分的过程。第一个组件是名为One To Six (OTX) - multi-view Generating (MVG) module的多视图合成模块。本模块通过一系列预处理步骤创建六张固定视图图像。这些图像用于在神经辐射场合成模块中生成局部光场融合(LLFF)数据集。这个过程导致在最终的网格生成模块中创建详细的网格结构。与几种最先进的三维重建方法的对比表明,我们的框架具有更好的可视化效果和更高的技术指标。此外,它还解决了其他算法在处理中国画数据时遇到的Janus问题。我们的数据集可以在https://github.com/LPDLG/3DTCP-Dataset上找到。
Three-dimensional reconstruction image generation of traditional Chinese painting elements
This paper presents a comprehensive pipeline for generating detailed three-dimensional (3D) models from single images of traditional Chinese painting elements. This task is particularly challenging due to the lack of 3D datasets for Chinese paintings and the limited research on their 3D reconstruction. As a result, direct access to multiple views is precluded. We propose a novel method for the 3D reconstruction of Traditional Chinese Painting Elements, termed TCPE-3D, which has three components of the process. The first component is a multi-view synthesis module named One To Six (OTX) - Multi-View Generating (MVG) Module. This module creates six fixed-view images through a series of preprocessing steps. These images are used to generate the Local Light Field Fusion (LLFF) dataset within the Neural Radiance Fields (NeRF) synthesis module. This process leads to the creation of detailed mesh structures in the final Mesh Generation module. Comparison with several state-of-the-art 3D reconstruction methods shows that our framework achieves better visualization results and higher technical specifications. Additionally, it solves the Janus problem encountered by other algorithms for Chinese painting data. Our dataset is available at https://github.com/LPDLG/3DTCP-Dataset.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.