Tian Feng , Long Li , Weitao Li , Bo Li , Junao Shen
{"title":"CaRoLS: Condition-adaptive multi-level road layout synthesis","authors":"Tian Feng , Long Li , Weitao Li , Bo Li , Junao Shen","doi":"10.1016/j.cag.2025.104451","DOIUrl":null,"url":null,"abstract":"<div><div>Synthesizing road layouts, which define the spatial structure of cities, is critical for many urban applications. Conventional deep learning methods, however, struggle to handle both unconditional and conditional inputs, and rarely capture the multi-level complexity of real road networks. We propose CaRoLS, a unified two-stage method for condition-adaptive multi-level road layout synthesis. Specifically, the Multi-level Layout Reconstruction stage uses a pre-trained variational autoencoder to encode a real-world road layout into a latent representation and then reconstructs the image. The Condition-adaptive Representation Generation stage employs a diffusion model to generate a latent representation from Gaussian noise, or from noise combined with an optional conditioning image containing natural and socio-economic information. This design balances computational efficiency with the ability to model continuous data. To further enhance output quality, we introduce a Condition-aware Decoder Block module that integrates global context and local details, replacing the standard U-Net decoder blocks in the diffusion model. Experiments on an Australian metropolitan dataset show that CaRoLS outperforms representative general and specialized synthesis methods. Compared to the current state-of-the-art methods, improvements reach up to 36.47% and 4.05% in image and topological metrics for the unconditional mode, and 56.25% and 3.18% in the conditional mode. These results demonstrate that CaRoLS generates multi-level road layouts with strong structural fidelity and high connectivity, and provides a unified pipeline for both unconditional and conditional synthesis.</div></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"133 ","pages":"Article 104451"},"PeriodicalIF":2.8000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Graphics-Uk","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0097849325002924","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
Synthesizing road layouts, which define the spatial structure of cities, is critical for many urban applications. Conventional deep learning methods, however, struggle to handle both unconditional and conditional inputs, and rarely capture the multi-level complexity of real road networks. We propose CaRoLS, a unified two-stage method for condition-adaptive multi-level road layout synthesis. Specifically, the Multi-level Layout Reconstruction stage uses a pre-trained variational autoencoder to encode a real-world road layout into a latent representation and then reconstructs the image. The Condition-adaptive Representation Generation stage employs a diffusion model to generate a latent representation from Gaussian noise, or from noise combined with an optional conditioning image containing natural and socio-economic information. This design balances computational efficiency with the ability to model continuous data. To further enhance output quality, we introduce a Condition-aware Decoder Block module that integrates global context and local details, replacing the standard U-Net decoder blocks in the diffusion model. Experiments on an Australian metropolitan dataset show that CaRoLS outperforms representative general and specialized synthesis methods. Compared to the current state-of-the-art methods, improvements reach up to 36.47% and 4.05% in image and topological metrics for the unconditional mode, and 56.25% and 3.18% in the conditional mode. These results demonstrate that CaRoLS generates multi-level road layouts with strong structural fidelity and high connectivity, and provides a unified pipeline for both unconditional and conditional synthesis.
期刊介绍:
Computers & Graphics is dedicated to disseminate information on research and applications of computer graphics (CG) techniques. The journal encourages articles on:
1. Research and applications of interactive computer graphics. We are particularly interested in novel interaction techniques and applications of CG to problem domains.
2. State-of-the-art papers on late-breaking, cutting-edge research on CG.
3. Information on innovative uses of graphics principles and technologies.
4. Tutorial papers on both teaching CG principles and innovative uses of CG in education.