运用计算机视觉的建筑摄影美学尺度:局部与整体

Victor Sardenberg, M. Becker
{"title":"运用计算机视觉的建筑摄影美学尺度:局部与整体","authors":"Victor Sardenberg, M. Becker","doi":"10.47330/dcio.2022.ggnl1577","DOIUrl":null,"url":null,"abstract":"The existing methods for solution space navigation require numerical values to score solutions. The authors introduce a method of quantitative aesthetic evaluation utilizing Computer Vision (CV) as a criterion to navigate solution spaces. Therefore, aesthetics can complement structural, environmental, and other quantitative criteria. The work stands in the extended history of quantifying the visual aesthetic experience. Some precedents are: Birkhoff [1933] and Max Bense [1965] built an approach with experiments to empirically support a measure, whereas Birkin [2010], Ostwald, and Vaughan [2016] devised the first computational methods working on vector drawings. Our research automates and accelerates aesthetic quantification by utilizing CV to extract computable datasets from images. We are especially keen on architectural images as a shorthand to assign an aesthetic value to design, aiming to navigate the solution space in architecture. This work devises a method for rearranging parts in architectural images focusing on formal aspects, in opposition to semantic segmentation where objects unrelated to architectural design (cars, persons, sky…) are quantified to score images [Verma and Jana and Ramamritham 2018]. It uses Maximally Stable Extremal Regions (MSER) [Matas 2004] to recognize architectural parts because it is superior to similar methods such as SimpleBlobDetector in this task. Our method disassembles the parts in a diagram of scaled parts (Fig. 2) to analyze them in isolation, and a diagram of connectivity graph (Fig. 3), to evaluate relationships. These diagrams are examined to compare photos of buildings, cars, and trees to assess the applicability of such a method to a range of objects. Parts and connections are thus quantified, and these values are inputted in a refined version of Birkhoff’s formula to calculate an aesthetic score for each image for navigating the solution space. Finally, it tests the method to draw comparisons between the discrete and continuous paradigms (Fig. 1) in the contemporary discourse of architecture, comparing Zaha Hadid Architects` Heydar Aliyev Centre and Gilles Retsin´s Diamonds House to argue that there is a difference between the aesthetic effects of continuous and discrete designs, besides their distinction in tectonic logic. The method proved to be an efficient procedure for comparatively quantifying the aesthetic judgment of architectural images, enabling designers to incorporate aesthetics as a complementary criterion for solution space navigation in computational design. The method of computational aesthetic measure for solution space navigation and its calibrations via crowdsourced evaluation of images is further detailed in a paper by the authors being published at the 2022 eCAADe conference.","PeriodicalId":129906,"journal":{"name":"Design Computation Input/Output 2022","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Aesthetic Measure of Architectural Photography utilizing Computer Vision: Parts-from-Wholes\",\"authors\":\"Victor Sardenberg, M. Becker\",\"doi\":\"10.47330/dcio.2022.ggnl1577\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The existing methods for solution space navigation require numerical values to score solutions. The authors introduce a method of quantitative aesthetic evaluation utilizing Computer Vision (CV) as a criterion to navigate solution spaces. Therefore, aesthetics can complement structural, environmental, and other quantitative criteria. The work stands in the extended history of quantifying the visual aesthetic experience. Some precedents are: Birkhoff [1933] and Max Bense [1965] built an approach with experiments to empirically support a measure, whereas Birkin [2010], Ostwald, and Vaughan [2016] devised the first computational methods working on vector drawings. Our research automates and accelerates aesthetic quantification by utilizing CV to extract computable datasets from images. We are especially keen on architectural images as a shorthand to assign an aesthetic value to design, aiming to navigate the solution space in architecture. This work devises a method for rearranging parts in architectural images focusing on formal aspects, in opposition to semantic segmentation where objects unrelated to architectural design (cars, persons, sky…) are quantified to score images [Verma and Jana and Ramamritham 2018]. It uses Maximally Stable Extremal Regions (MSER) [Matas 2004] to recognize architectural parts because it is superior to similar methods such as SimpleBlobDetector in this task. Our method disassembles the parts in a diagram of scaled parts (Fig. 2) to analyze them in isolation, and a diagram of connectivity graph (Fig. 3), to evaluate relationships. These diagrams are examined to compare photos of buildings, cars, and trees to assess the applicability of such a method to a range of objects. Parts and connections are thus quantified, and these values are inputted in a refined version of Birkhoff’s formula to calculate an aesthetic score for each image for navigating the solution space. Finally, it tests the method to draw comparisons between the discrete and continuous paradigms (Fig. 1) in the contemporary discourse of architecture, comparing Zaha Hadid Architects` Heydar Aliyev Centre and Gilles Retsin´s Diamonds House to argue that there is a difference between the aesthetic effects of continuous and discrete designs, besides their distinction in tectonic logic. The method proved to be an efficient procedure for comparatively quantifying the aesthetic judgment of architectural images, enabling designers to incorporate aesthetics as a complementary criterion for solution space navigation in computational design. The method of computational aesthetic measure for solution space navigation and its calibrations via crowdsourced evaluation of images is further detailed in a paper by the authors being published at the 2022 eCAADe conference.\",\"PeriodicalId\":129906,\"journal\":{\"name\":\"Design Computation Input/Output 2022\",\"volume\":\"53 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Design Computation Input/Output 2022\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.47330/dcio.2022.ggnl1577\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Design Computation Input/Output 2022","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47330/dcio.2022.ggnl1577","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

现有的解空间导航方法需要数值来对解进行评分。作者介绍了一种利用计算机视觉(CV)作为导航解空间的标准的定量美学评价方法。因此,美学可以补充结构、环境和其他定量标准。这部作品代表了量化视觉审美体验的漫长历史。一些先例是:Birkhoff[1933]和Max Bense[1965]用实验建立了一种方法来经验地支持一种测量,而Birkin[2010]、Ostwald和Vaughan[2016]设计了第一个用于矢量图的计算方法。我们的研究通过利用CV从图像中提取可计算的数据集来自动化和加速美学量化。我们特别热衷于将建筑图像作为一种简写,为设计赋予美学价值,旨在引导建筑的解决方案空间。这项工作设计了一种方法来重新排列建筑图像中的部分,侧重于形式方面,而不是语义分割,其中与建筑设计无关的对象(汽车,人物,天空……)被量化以对图像进行评分[Verma和Jana和Ramamritham 2018]。它使用最大稳定极值区域(maximum Stable extreme region, MSER) [Matas 2004]来识别建筑部件,因为它在此任务中优于类似的方法,如SimpleBlobDetector。我们的方法在缩放部件图(图2)和连接图(图3)中拆卸部件,以单独分析它们,以评估关系。这些图被用来比较建筑物、汽车和树木的照片,以评估这种方法对一系列对象的适用性。因此,部分和连接被量化,这些值被输入到Birkhoff公式的精炼版本中,以计算每个图像的美学分数,以导航解决方案空间。最后,它测试了在当代建筑话语中对离散和连续范式进行比较的方法(图1),比较了扎哈·哈迪德建筑师事务所的盖达尔·阿利耶夫中心和Gilles Retsin的钻石之家,以证明除了构造逻辑上的区别之外,连续和离散设计的美学效果之间存在差异。该方法被证明是一种比较量化建筑图像审美判断的有效方法,使设计师能够将美学作为计算设计中解空间导航的补充标准。在2022年eCAADe会议上发表的一篇论文中,作者进一步详细介绍了求解空间导航的计算美学测量方法及其通过众包图像评估的校准。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Aesthetic Measure of Architectural Photography utilizing Computer Vision: Parts-from-Wholes
The existing methods for solution space navigation require numerical values to score solutions. The authors introduce a method of quantitative aesthetic evaluation utilizing Computer Vision (CV) as a criterion to navigate solution spaces. Therefore, aesthetics can complement structural, environmental, and other quantitative criteria. The work stands in the extended history of quantifying the visual aesthetic experience. Some precedents are: Birkhoff [1933] and Max Bense [1965] built an approach with experiments to empirically support a measure, whereas Birkin [2010], Ostwald, and Vaughan [2016] devised the first computational methods working on vector drawings. Our research automates and accelerates aesthetic quantification by utilizing CV to extract computable datasets from images. We are especially keen on architectural images as a shorthand to assign an aesthetic value to design, aiming to navigate the solution space in architecture. This work devises a method for rearranging parts in architectural images focusing on formal aspects, in opposition to semantic segmentation where objects unrelated to architectural design (cars, persons, sky…) are quantified to score images [Verma and Jana and Ramamritham 2018]. It uses Maximally Stable Extremal Regions (MSER) [Matas 2004] to recognize architectural parts because it is superior to similar methods such as SimpleBlobDetector in this task. Our method disassembles the parts in a diagram of scaled parts (Fig. 2) to analyze them in isolation, and a diagram of connectivity graph (Fig. 3), to evaluate relationships. These diagrams are examined to compare photos of buildings, cars, and trees to assess the applicability of such a method to a range of objects. Parts and connections are thus quantified, and these values are inputted in a refined version of Birkhoff’s formula to calculate an aesthetic score for each image for navigating the solution space. Finally, it tests the method to draw comparisons between the discrete and continuous paradigms (Fig. 1) in the contemporary discourse of architecture, comparing Zaha Hadid Architects` Heydar Aliyev Centre and Gilles Retsin´s Diamonds House to argue that there is a difference between the aesthetic effects of continuous and discrete designs, besides their distinction in tectonic logic. The method proved to be an efficient procedure for comparatively quantifying the aesthetic judgment of architectural images, enabling designers to incorporate aesthetics as a complementary criterion for solution space navigation in computational design. The method of computational aesthetic measure for solution space navigation and its calibrations via crowdsourced evaluation of images is further detailed in a paper by the authors being published at the 2022 eCAADe conference.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信