Yu Du , Mingyang Zhao , Jian Shi , Lubin Fan , Dong-Ming Yan
{"title":"Efficient roof reconstruction from a single aerial image","authors":"Yu Du , Mingyang Zhao , Jian Shi , Lubin Fan , Dong-Ming Yan","doi":"10.1016/j.cag.2025.104355","DOIUrl":"10.1016/j.cag.2025.104355","url":null,"abstract":"<div><div>We present a novel and fast LoD2 (level of detail) modeling method for generating high-quality 3D polygonal meshes from <em>single aerial images</em>. Our approach initially detects line segments to extract polygons as vectorized contours for target roofs. Subsequently, the topological structures of 2D roofs are generated and then optimized by solving geometric constraints. Finally, refined 2D roof structures are lifted to 3D meshes using a geometry-preserving extrusion algorithm. The proposed approach achieves superior quality compared to existing point cloud-based algorithms. In addition, our pipeline allows for interactions between steps. With simple user adjustments, our method can handle complicated roof cases, significantly improving efficiency compared to manual modeling software.</div></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"132 ","pages":"Article 104355"},"PeriodicalIF":2.8,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144851886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Chengzhang Zhu , Hangyu Ji , Hanzhen Liu , Shuhan Tang , Xiaokuan Kang , Yalong Xiao
{"title":"DM-SegNet: Dual-Mamba architecture for 3D medical image segmentation with global context modeling","authors":"Chengzhang Zhu , Hangyu Ji , Hanzhen Liu , Shuhan Tang , Xiaokuan Kang , Yalong Xiao","doi":"10.1016/j.cag.2025.104334","DOIUrl":"10.1016/j.cag.2025.104334","url":null,"abstract":"<div><div>Accurate 3D medical image segmentation demands architectures capable of reconciling global context modeling with spatial topology preservation. While State Space Models (SSMs) like Mamba show potential for sequence modeling, existing medical SSMs suffer from encoder–decoder incompatibility: the encoder’s 1D sequence flattening compromises spatial structures, while conventional decoders fail to leverage Mamba’s state propagation. We present DM-SegNet, a Dual-Mamba architecture integrating directional state transitions with anatomy-aware hierarchical decoding. The core innovations include a quadri-directional spatial Mamba module employing four-directional 3D scanning to maintain anatomical spatial coherence, a gated spatial convolution layer that enhances spatially sensitive feature representation prior to state modeling, and a Mamba-driven decoding framework enabling bidirectional state synchronization across scales. Extensive evaluation on two clinically significant benchmarks demonstrates the efficacy of DM-SegNet: achieving state-of-the-art Dice Similarity Coefficient (DSC) of 85.44% on the Synapse dataset for abdominal organ segmentation and 90.22% on the BraTS2023 dataset for brain tumor segmentation.</div></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"132 ","pages":"Article 104334"},"PeriodicalIF":2.8,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144860264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Christopher Otto , Prashanth Chandran , Sebastian Weiss , Markus Gross , Gaspard Zoss , Derek Bradley
{"title":"Multimodal Conditional 3D Face Geometry Generation","authors":"Christopher Otto , Prashanth Chandran , Sebastian Weiss , Markus Gross , Gaspard Zoss , Derek Bradley","doi":"10.1016/j.cag.2025.104325","DOIUrl":"10.1016/j.cag.2025.104325","url":null,"abstract":"<div><div>We present a new method for multimodal conditional 3D face geometry generation that allows user-friendly control over the output identity and expression via a number of different conditioning signals. Within a single model, we demonstrate 3D faces generated from artistic sketches, portrait photos, Canny edges, FLAME face model parameters, 2D face landmarks, or text prompts. Our approach is based on a diffusion process that generates 3D geometry in a 2D parameterized UV domain. Geometry generation passes each conditioning signal through a set of cross-attention layers (IP-Adapter), one set for each user-defined conditioning signal. The result is an easy-to-use 3D face generation tool that produces topology-consistent, high-quality geometry with fine-grain user control.</div></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"132 ","pages":"Article 104325"},"PeriodicalIF":2.8,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144880353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Gisele Holtz , Waldemar Celes , Luiz Fernando Martha
{"title":"Semi-implicit representation of discrete black oil reservoir models","authors":"Gisele Holtz , Waldemar Celes , Luiz Fernando Martha","doi":"10.1016/j.cag.2025.104373","DOIUrl":"10.1016/j.cag.2025.104373","url":null,"abstract":"<div><div>Numerical black oil reservoir simulation is widely employed in the oil and gas industry for planning and predicting field explorations. Reservoir simulation relies on a topologically structured grid of hexahedral cells, modeling complex geological features. Visualization of these large-scale simulated discrete models is essential for inspection and analysis but presents challenges due to their size and geometric complexity. While conventional methods focus on rendering external faces, advanced techniques, like volume rendering, require full model representation on the GPU and an efficient point-location algorithm. Previous works proposed GPU-based model representations, but real-time applications remain limited by performance. Simplified alternatives improve speed but at the cost of accuracy. This work introduces a compact semi-implicit representation of discrete reservoir models that combines a topological coordinate field and a depth map to enable the implementation of fast and accurate visualization algorithms. Computational experiments demonstrate the advantage of the proposed representation regarding memory usage and performance while preserving the visual quality of rendered images.</div></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"132 ","pages":"Article 104373"},"PeriodicalIF":2.8,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144829925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Geodesic heatmap-based segmentation-free framework for robust tooth landmark detection","authors":"Weijie Liu, Shaojie Zhuang, Yeying Fan, Guangshun Wei, Yuanfeng Zhou","doi":"10.1016/j.cag.2025.104332","DOIUrl":"10.1016/j.cag.2025.104332","url":null,"abstract":"<div><div>Automatic tooth landmark detection is a crucial component in orthodontic treatment, aiding in tooth morphology assessment, treatment planning, and oral health monitoring. However, challenges remain due to diverse landmark types, varying landmark quantities, anatomical variations, and dental abnormalities. To address these issues, this paper proposes a robust two-stage framework for precise landmark detection. In the first stage, an adaptive partitioning strategy employs a lightweight network to predict tooth centroids, which are then used to partition the dental points into localized patches, eliminating the need for precise segmentation. In the second stage, a geodesic distance-based heatmap is introduced to improve landmark detection accuracy. Furthermore, an anatomy-aware spatial augmentation strategy is proposed to simulate clinically challenging scenarios, thereby improving the model’s learning capability and its robustness, particularly in cases of abnormal teeth. Extensive experimental results on a public dataset demonstrate the superiority of our method, with significant improvements over state-of-the-art approaches.</div></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"132 ","pages":"Article 104332"},"PeriodicalIF":2.8,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144851887","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"TSlicer: An optimal topology-based slicing algorithm for Z-monotone 3D meshes","authors":"Ricardo Dutra da Silva , Henrique Romaniuk Ramalho , Rodrigo Minetto , Neri Volpato , Jorge Stolfi","doi":"10.1016/j.cag.2025.104372","DOIUrl":"10.1016/j.cag.2025.104372","url":null,"abstract":"<div><div>We address a computational problem that is an essential step in computer graphics, 3D printing, and many other processes: namely, the slicing of a 3D polygonal structured mesh model (as can be extracted from an STL, OBJ, or 3MF file) by a set of parallel planes. We describe <span>TSlicer</span>, a sweep-plane algorithm that exploits the topological information provided by the mesh data structure to reduce the number of intersection tests. The output is a set of polygons on each cutting plane. The topological information allows us to produce the sides of these polygons directly in the proper sequence and orientation. Furthermore, a key optimization is proposed to a topological data structure to speed up the traversal of meshes with any Z-monotone polygons as faces. We show that <span>TSlicer</span> is optimal in the asymptotic worst-case sense, and, according to experiments, substantially faster than a previous method for slicing unstructured triangle list models, as provided by STL files. The source code and mesh models used in this study are available on GitHub.<span><span><sup>1</sup></span></span></div></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"132 ","pages":"Article 104372"},"PeriodicalIF":2.8,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144858075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Swapnil Nagnath Mahajan , Karthik Krishna M. , Ramanathan Muthuganapathy
{"title":"OrthoCAD-322K: A cross-modal approach for retrieving 3D CAD models from orthographic views using a graph-based framework on a developed large-scale dataset","authors":"Swapnil Nagnath Mahajan , Karthik Krishna M. , Ramanathan Muthuganapathy","doi":"10.1016/j.cag.2025.104357","DOIUrl":"10.1016/j.cag.2025.104357","url":null,"abstract":"<div><div>Despite the widespread adoption of 3D CAD systems, 2D orthographic drawings remain integral to engineering workflows. However, millions of legacy drawings lack corresponding 3D models, hindering their integration into modern simulation, manufacturing, and digital twin systems. Existing methods for 2D to 3D CAD retrieval often fall short of meeting the structural precision required for engineering-grade drawings. We propose a cross-modal retrieval framework that aligns vector-based 2D DXF (Drawing Exchange Format) views with 3D CAD models using contrastive learning. Our architecture integrates a Graphormer-based encoder for 2D input and a PointNet-based encoder for 3D CAD models. We introduce a novel proximity-based spatial encoding to enhance structural precision and robustness across varying view configurations. Using the filtered subset (<span><math><mo>∼</mo></math></span>283K) of the newly developed large-scale dataset OrthoCAD-322K, extensive ablation and comparison studies demonstrate the robustness and generalization of the model in different input conditions and architectures. Source code is available at <span><span>https://github.com/Swapnil-Mahajan-MS/OrthoCAD-322K</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"132 ","pages":"Article 104357"},"PeriodicalIF":2.8,"publicationDate":"2025-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144842131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Xingyu Liu , Jiehui Zhou , Xumeng Wang , Kam-Kwai Wong , Wei Zhang , Juntian Zhang , Minfeng Zhu , Wei Chen
{"title":"CausalPrism: A visual analytics approach for subgroup-based causal heterogeneity exploration","authors":"Xingyu Liu , Jiehui Zhou , Xumeng Wang , Kam-Kwai Wong , Wei Zhang , Juntian Zhang , Minfeng Zhu , Wei Chen","doi":"10.1016/j.cag.2025.104356","DOIUrl":"10.1016/j.cag.2025.104356","url":null,"abstract":"<div><div>In causal inference, estimating Heterogeneous Treatment Effects (HTEs) from observational data is critical for understanding how different subgroups respond to treatments, with broad applications such as precision medicine and targeted advertising. However, existing work on HTE, subgroup discovery, and causal visualization is insufficient to address two challenges: first, the sheer number of potential subgroups and the necessity to balance multiple objectives (<em>e.g.</em>, high effects and low variances) pose a considerable analytical challenge. Second, effective subgroup analysis has to follow the analysis goal specified by users and provide causal results with verification. To this end, we propose a visual analytics approach for subgroup-based causal heterogeneity exploration. Specifically, we first formulate causal subgroup discovery as a constrained multi-objective optimization problem and adopt a heuristic genetic algorithm to learn the Pareto front of optimal subgroups described by interpretable rules. Combining with this model, we develop a prototype system, <em>CausalPrism</em>, that incorporates tabular visualization, multi-attribute rankings, and uncertainty plots to support users in interactively exploring and sorting subgroups and explaining treatment effects. Quantitative experiments validate that the proposed model can efficiently mine causal subgroups that outperform state-of-the-art HTE and subgroup discovery methods, and case studies and expert interviews demonstrate the effectiveness and usability of the system. Code is available at <span><span>OSF</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"131 ","pages":"Article 104356"},"PeriodicalIF":2.8,"publicationDate":"2025-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144830159","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hui Chen , Yutong Fang , Chaoqun Wang , Chengju Chen , Xiao Wen , Peng Dai , Hongxin Zhang
{"title":"NovalisArch+: Efficient hybrid procedural-AIGC for high-fidelity 3D city generation","authors":"Hui Chen , Yutong Fang , Chaoqun Wang , Chengju Chen , Xiao Wen , Peng Dai , Hongxin Zhang","doi":"10.1016/j.cag.2025.104326","DOIUrl":"10.1016/j.cag.2025.104326","url":null,"abstract":"<div><div>3D city modeling is a key area of computer vision and powers smart city applications. However, existing methods, such as procedural modeling and deep learning-based approaches, either require heavy manual intervention in asset creation or suffer from ”blind zone effects” caused by viewpoint limitations, thereby limiting the cost-effectiveness and interactivity of urban scene generation. To address these challenges, we introduce NovalisArch+, a lightweight 3D reconstruction framework that synergistically integrates procedural content generation (PCG) and AI-generated content (AIGC) to enable asset-efficient urban modeling. By encoding region-specific aesthetic rules for Chinese cities and unifying GIS data analysis, rule-driven architectural generation, and AIGC-driven physically based rendering (PBR) material synthesis, NovalisArch+ establishes a scalable and efficient pipeline for constructing semantically coherent, stylistically consistent 3D urban environments.</div></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"132 ","pages":"Article 104326"},"PeriodicalIF":2.8,"publicationDate":"2025-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144860265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A parameterization method for B-spline curve interpolation via supervised regression","authors":"Shangyi Lin, Jieqing Feng","doi":"10.1016/j.cag.2025.104360","DOIUrl":"10.1016/j.cag.2025.104360","url":null,"abstract":"<div><div>Parameterization plays a crucial role in the quality of B-spline curve interpolation, but automatically selecting an appropriate method for diverse data distributions remains challenging. A recent classification-based hybrid parameterization approach addresses this issue, statistically outperforming alternative methods, but it comes with relatively high computational costs. In this work, an automatic parameterization method via supervised regression is proposed for B-spline curve interpolation. A regressor is first trained on a dataset of randomly generated data point sequences (each of length four), with optimal parameters from those given by classical methods used as labels. The regressor then estimates the optimal local parameters for each set of four consecutive data points based on their local geometric distribution. Global parameters that closely match the local ones are computed through a merging process. Since local parameters are directly generated by the regressor, the proposed method is more efficient than the classification-based hybrid approach. Additionally, since regressors are inherently more flexible than classifiers, the proposed regression-based method is compatible with any existing or new parameterization method — rather than being limited to just the three representative methods used in the classification-based approach — and is capable of producing better results. Experimental results demonstrate that the proposed method efficiently produces superior interpolation curves compared to existing techniques, even outperforming the previous classification-based approach with an idealized theoretical classifier.</div></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"132 ","pages":"Article 104360"},"PeriodicalIF":2.8,"publicationDate":"2025-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144829919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}