一步吉布斯抽样生成合成住户

IF 7.6 1区 工程技术 Q1 TRANSPORTATION SCIENCE & TECHNOLOGY
Marija Kukic, Xinling Li, Michel Bierlaire
{"title":"一步吉布斯抽样生成合成住户","authors":"Marija Kukic,&nbsp;Xinling Li,&nbsp;Michel Bierlaire","doi":"10.1016/j.trc.2024.104770","DOIUrl":null,"url":null,"abstract":"<div><p>The generation of synthetic households is challenging due to the necessity of maintaining consistency between the two layers of interest: the household itself, and the individuals composing it. Hence, the problem is typically tackled in two steps, first focusing on the individual layer and then on the household layer. The existing two-step simulation method proposes generating the households based on their roles which diminishes the generality of the approach and makes it difficult to reproduce despite its beneficial properties. In this paper, we propose an alternative extension of Gibbs sampling for generating hierarchical datasets such as synthetic households, in order to make simulation more general and reusable. We demonstrate the performance of our method in a case study based on the 2015 Swiss micro-census data and compare it against state-of-the-art approaches. We show the influence of modeling decisions on different performance metrics and how the analyst can easily enforce consistency while avoiding generating illogical households. We show that the algorithm maintains the conditional distributions while satisfying the marginals of all variables simultaneously, all while generating consistent synthetic households.</p></div>","PeriodicalId":54417,"journal":{"name":"Transportation Research Part C-Emerging Technologies","volume":null,"pages":null},"PeriodicalIF":7.6000,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0968090X24002912/pdfft?md5=4bc5446f0b68d223d680306ec50f8b43&pid=1-s2.0-S0968090X24002912-main.pdf","citationCount":"0","resultStr":"{\"title\":\"One-step Gibbs sampling for the generation of synthetic households\",\"authors\":\"Marija Kukic,&nbsp;Xinling Li,&nbsp;Michel Bierlaire\",\"doi\":\"10.1016/j.trc.2024.104770\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The generation of synthetic households is challenging due to the necessity of maintaining consistency between the two layers of interest: the household itself, and the individuals composing it. Hence, the problem is typically tackled in two steps, first focusing on the individual layer and then on the household layer. The existing two-step simulation method proposes generating the households based on their roles which diminishes the generality of the approach and makes it difficult to reproduce despite its beneficial properties. In this paper, we propose an alternative extension of Gibbs sampling for generating hierarchical datasets such as synthetic households, in order to make simulation more general and reusable. We demonstrate the performance of our method in a case study based on the 2015 Swiss micro-census data and compare it against state-of-the-art approaches. We show the influence of modeling decisions on different performance metrics and how the analyst can easily enforce consistency while avoiding generating illogical households. We show that the algorithm maintains the conditional distributions while satisfying the marginals of all variables simultaneously, all while generating consistent synthetic households.</p></div>\",\"PeriodicalId\":54417,\"journal\":{\"name\":\"Transportation Research Part C-Emerging Technologies\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2024-07-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S0968090X24002912/pdfft?md5=4bc5446f0b68d223d680306ec50f8b43&pid=1-s2.0-S0968090X24002912-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Research Part C-Emerging Technologies\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0968090X24002912\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"TRANSPORTATION SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part C-Emerging Technologies","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0968090X24002912","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

合成住户的生成具有挑战性,因为必须保持两个相关层之间的一致性:住户本身和组成住户的个人。因此,这个问题通常分两步解决,首先是个人层,然后是家庭层。现有的两步模拟法建议根据家庭的角色生成家庭,这削弱了该方法的通用性,使其难以复制,尽管它具有有益的特性。在本文中,我们提出了吉布斯抽样的另一种扩展方法,用于生成合成家庭等分层数据集,以使模拟更具通用性和可重用性。我们在基于 2015 年瑞士微观人口普查数据的案例研究中展示了我们方法的性能,并将其与最先进的方法进行了比较。我们展示了建模决策对不同性能指标的影响,以及分析师如何在避免生成不合逻辑住户的同时轻松实现一致性。我们表明,该算法既能保持条件分布,又能同时满足所有变量的边际值,同时还能生成一致的合成住户。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
One-step Gibbs sampling for the generation of synthetic households

The generation of synthetic households is challenging due to the necessity of maintaining consistency between the two layers of interest: the household itself, and the individuals composing it. Hence, the problem is typically tackled in two steps, first focusing on the individual layer and then on the household layer. The existing two-step simulation method proposes generating the households based on their roles which diminishes the generality of the approach and makes it difficult to reproduce despite its beneficial properties. In this paper, we propose an alternative extension of Gibbs sampling for generating hierarchical datasets such as synthetic households, in order to make simulation more general and reusable. We demonstrate the performance of our method in a case study based on the 2015 Swiss micro-census data and compare it against state-of-the-art approaches. We show the influence of modeling decisions on different performance metrics and how the analyst can easily enforce consistency while avoiding generating illogical households. We show that the algorithm maintains the conditional distributions while satisfying the marginals of all variables simultaneously, all while generating consistent synthetic households.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
15.80
自引率
12.00%
发文量
332
审稿时长
64 days
期刊介绍: Transportation Research: Part C (TR_C) is dedicated to showcasing high-quality, scholarly research that delves into the development, applications, and implications of transportation systems and emerging technologies. Our focus lies not solely on individual technologies, but rather on their broader implications for the planning, design, operation, control, maintenance, and rehabilitation of transportation systems, services, and components. In essence, the intellectual core of the journal revolves around the transportation aspect rather than the technology itself. We actively encourage the integration of quantitative methods from diverse fields such as operations research, control systems, complex networks, computer science, and artificial intelligence. Join us in exploring the intersection of transportation systems and emerging technologies to drive innovation and progress in the field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信