Tian Gan , Rayan Succar , Simone Macrì , Manuel Ruiz Marín , Maurizio Porfiri
{"title":"从城市数据中发现因果关系,其中城市规模与信息理论相结合","authors":"Tian Gan , Rayan Succar , Simone Macrì , Manuel Ruiz Marín , Maurizio Porfiri","doi":"10.1016/j.cities.2025.105980","DOIUrl":null,"url":null,"abstract":"<div><div>Causal discovery from urban data offers an unprecedented opportunity for research and practice in urban science. Whether they implement traditional regression analyses or more sophisticated tools for time series analysis, existing approaches to causal discovery consider cities to be statistically equivalent – an assumption that is seldom met in real urban systems, where cities dramatically differ in population size, land area, etc. In this study, we embrace the heterogeneity between cities within a novel approach to causal discovery that integrates urban scaling and information theory, towards the discovery of associations between urban processes. Our approach takes as input time series of salient urban variables for causal discovery, along with a set of static features that capture the heterogeneities within the system. Using a Cobb-Douglas function, we extract features-adjusted metropolitan indicators (FsAMIs) that mitigate spurious dependencies among variables due to the underlying variations in spatial features. Through the application of transfer entropy on FsAMIs, we ultimately perform model-free causal inference between urban processes. We validate our approach on synthetic datasets representative of a pair of time-varying urban mock variables and on two real-world open urban datasets related to climate change and infectious diseases. Results demonstrate that our framework outperforms the state of the art across diverse scenarios, minimizing false (positive and negative) inferences. Our methodology offers a powerful tool that advances data-driven urban policy and planning, leveraging open urban datasets to gain robust insight into relationships among urban variables.</div></div>","PeriodicalId":48405,"journal":{"name":"Cities","volume":"162 ","pages":"Article 105980"},"PeriodicalIF":6.0000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Causal discovery from city data, where urban scaling meets information theory\",\"authors\":\"Tian Gan , Rayan Succar , Simone Macrì , Manuel Ruiz Marín , Maurizio Porfiri\",\"doi\":\"10.1016/j.cities.2025.105980\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Causal discovery from urban data offers an unprecedented opportunity for research and practice in urban science. Whether they implement traditional regression analyses or more sophisticated tools for time series analysis, existing approaches to causal discovery consider cities to be statistically equivalent – an assumption that is seldom met in real urban systems, where cities dramatically differ in population size, land area, etc. In this study, we embrace the heterogeneity between cities within a novel approach to causal discovery that integrates urban scaling and information theory, towards the discovery of associations between urban processes. Our approach takes as input time series of salient urban variables for causal discovery, along with a set of static features that capture the heterogeneities within the system. Using a Cobb-Douglas function, we extract features-adjusted metropolitan indicators (FsAMIs) that mitigate spurious dependencies among variables due to the underlying variations in spatial features. Through the application of transfer entropy on FsAMIs, we ultimately perform model-free causal inference between urban processes. We validate our approach on synthetic datasets representative of a pair of time-varying urban mock variables and on two real-world open urban datasets related to climate change and infectious diseases. Results demonstrate that our framework outperforms the state of the art across diverse scenarios, minimizing false (positive and negative) inferences. Our methodology offers a powerful tool that advances data-driven urban policy and planning, leveraging open urban datasets to gain robust insight into relationships among urban variables.</div></div>\",\"PeriodicalId\":48405,\"journal\":{\"name\":\"Cities\",\"volume\":\"162 \",\"pages\":\"Article 105980\"},\"PeriodicalIF\":6.0000,\"publicationDate\":\"2025-04-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cities\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S026427512500280X\",\"RegionNum\":1,\"RegionCategory\":\"经济学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"URBAN STUDIES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cities","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S026427512500280X","RegionNum":1,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"URBAN STUDIES","Score":null,"Total":0}
Causal discovery from city data, where urban scaling meets information theory
Causal discovery from urban data offers an unprecedented opportunity for research and practice in urban science. Whether they implement traditional regression analyses or more sophisticated tools for time series analysis, existing approaches to causal discovery consider cities to be statistically equivalent – an assumption that is seldom met in real urban systems, where cities dramatically differ in population size, land area, etc. In this study, we embrace the heterogeneity between cities within a novel approach to causal discovery that integrates urban scaling and information theory, towards the discovery of associations between urban processes. Our approach takes as input time series of salient urban variables for causal discovery, along with a set of static features that capture the heterogeneities within the system. Using a Cobb-Douglas function, we extract features-adjusted metropolitan indicators (FsAMIs) that mitigate spurious dependencies among variables due to the underlying variations in spatial features. Through the application of transfer entropy on FsAMIs, we ultimately perform model-free causal inference between urban processes. We validate our approach on synthetic datasets representative of a pair of time-varying urban mock variables and on two real-world open urban datasets related to climate change and infectious diseases. Results demonstrate that our framework outperforms the state of the art across diverse scenarios, minimizing false (positive and negative) inferences. Our methodology offers a powerful tool that advances data-driven urban policy and planning, leveraging open urban datasets to gain robust insight into relationships among urban variables.
期刊介绍:
Cities offers a comprehensive range of articles on all aspects of urban policy. It provides an international and interdisciplinary platform for the exchange of ideas and information between urban planners and policy makers from national and local government, non-government organizations, academia and consultancy. The primary aims of the journal are to analyse and assess past and present urban development and management as a reflection of effective, ineffective and non-existent planning policies; and the promotion of the implementation of appropriate urban policies in both the developed and the developing world.