{"title":"Generating floor plans with deep learning: A cross-validation assessment over different dataset sizes","authors":"Ricardo C Rodrigues, R. Duarte","doi":"10.1177/14780771221120842","DOIUrl":null,"url":null,"abstract":"The advent of deep learning has enabled a series of opportunities; one of them is the ability to tackle subjective factors on the floor plan design and make predictions though spatial semantic maps. Nonetheless, the amount available of data grows exponentially on a daily basis, in this sense, this research seeks to investigate deep generative methods of floor plan design and its relationship between data volume, with training time, quality and diversity in the outputs; in other words, what is the amount of data required to rapidly train models that return optimal results. In our research, we used a variation of the Conditional Generative Adversarial Network algorithm, that is, Pix2pix, and a dataset of approximately 80 thousand images to train 10 models and evaluate their performance through a series of computational metrics. The results show that the potential of this data-driven method depends not only on the diversity of the training set but also on the linearity of the distribution; therefore, high-dimensional datasets did not achieve good results. It is also concluded that models trained on small sets of data (800 images) may return excellent results if given the correct training instructions (Hyperparameters), but the best baseline to this generative task is in the mid-term, using around 20 to 30 thousand images with a linear distribution. Finally, it is presented standard guidelines for dataset design, and the impact of data curation along the entire process.","PeriodicalId":45139,"journal":{"name":"International Journal of Architectural Computing","volume":"20 1","pages":"630 - 644"},"PeriodicalIF":1.6000,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Architectural Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/14780771221120842","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"ARCHITECTURE","Score":null,"Total":0}
引用次数: 1
Abstract
The advent of deep learning has enabled a series of opportunities; one of them is the ability to tackle subjective factors on the floor plan design and make predictions though spatial semantic maps. Nonetheless, the amount available of data grows exponentially on a daily basis, in this sense, this research seeks to investigate deep generative methods of floor plan design and its relationship between data volume, with training time, quality and diversity in the outputs; in other words, what is the amount of data required to rapidly train models that return optimal results. In our research, we used a variation of the Conditional Generative Adversarial Network algorithm, that is, Pix2pix, and a dataset of approximately 80 thousand images to train 10 models and evaluate their performance through a series of computational metrics. The results show that the potential of this data-driven method depends not only on the diversity of the training set but also on the linearity of the distribution; therefore, high-dimensional datasets did not achieve good results. It is also concluded that models trained on small sets of data (800 images) may return excellent results if given the correct training instructions (Hyperparameters), but the best baseline to this generative task is in the mid-term, using around 20 to 30 thousand images with a linear distribution. Finally, it is presented standard guidelines for dataset design, and the impact of data curation along the entire process.