机动车碰撞记录的语义嵌入方法:以纽约市曼哈顿区交通安全为例

IF 2.4 3区 工程技术 Q3 TRANSPORTATION
Yuxuan Wang, Ruoxin Xiong, Hao Yu, Jie Bao, Zhao Yang
{"title":"机动车碰撞记录的语义嵌入方法:以纽约市曼哈顿区交通安全为例","authors":"Yuxuan Wang, Ruoxin Xiong, Hao Yu, Jie Bao, Zhao Yang","doi":"10.1080/19439962.2021.1994681","DOIUrl":null,"url":null,"abstract":"Abstract This study introduces a hybrid Latent Dirichlet Allocation (LDA) model to excavate hidden crash patterns from the large-scale crash dataset. External semantic descriptions have been attached to raw GPS coordinates of crash events. The K-means clustering algorithm is first applied to determine land use characteristics of crash points by grouping surrounding Points of Interests (POIs). Then, each crash record is transformed into a formalized label consisting of land use, Annual Average Daily Traffic (AADT), and time stamps, allowing the analysis of massive traffic crash data as document corpora. Finally, a data-driven modeling approach based on the LDA is conducted to discover hidden crash patterns from traffic crash records combining the external semantic information. The approach is verified using motor vehicle crash data in Manhattan County of New York City. The novel semantic analysis of crash records provides an effective method to investigate the hidden information in traffic crashes. Identifying spatial-temporal patterns on motor vehicle crashes would provide insights into underlying traffic behaviors for intelligent policy-making and resource allocation.","PeriodicalId":46672,"journal":{"name":"Journal of Transportation Safety & Security","volume":"8 1","pages":"1913 - 1933"},"PeriodicalIF":2.4000,"publicationDate":"2021-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A semantic embedding methodology for motor vehicle crash records: A case study of traffic safety in Manhattan Borough of New York City\",\"authors\":\"Yuxuan Wang, Ruoxin Xiong, Hao Yu, Jie Bao, Zhao Yang\",\"doi\":\"10.1080/19439962.2021.1994681\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract This study introduces a hybrid Latent Dirichlet Allocation (LDA) model to excavate hidden crash patterns from the large-scale crash dataset. External semantic descriptions have been attached to raw GPS coordinates of crash events. The K-means clustering algorithm is first applied to determine land use characteristics of crash points by grouping surrounding Points of Interests (POIs). Then, each crash record is transformed into a formalized label consisting of land use, Annual Average Daily Traffic (AADT), and time stamps, allowing the analysis of massive traffic crash data as document corpora. Finally, a data-driven modeling approach based on the LDA is conducted to discover hidden crash patterns from traffic crash records combining the external semantic information. The approach is verified using motor vehicle crash data in Manhattan County of New York City. The novel semantic analysis of crash records provides an effective method to investigate the hidden information in traffic crashes. Identifying spatial-temporal patterns on motor vehicle crashes would provide insights into underlying traffic behaviors for intelligent policy-making and resource allocation.\",\"PeriodicalId\":46672,\"journal\":{\"name\":\"Journal of Transportation Safety & Security\",\"volume\":\"8 1\",\"pages\":\"1913 - 1933\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2021-10-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Transportation Safety & Security\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1080/19439962.2021.1994681\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"TRANSPORTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Transportation Safety & Security","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1080/19439962.2021.1994681","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"TRANSPORTATION","Score":null,"Total":0}
引用次数: 3

摘要

摘要本文引入一种混合潜狄利克雷分配(Latent Dirichlet Allocation, LDA)模型,从大规模碰撞数据集中挖掘隐藏的碰撞模式。外部语义描述已附加到碰撞事件的原始GPS坐标。首先应用k均值聚类算法,通过对周边兴趣点(poi)进行分组,确定碰撞点的土地利用特征。然后,将每个碰撞记录转换为由土地使用、年平均每日交通量(AADT)和时间戳组成的正式标签,从而允许将大量交通碰撞数据作为文档语料库进行分析。最后,提出了一种基于LDA的数据驱动建模方法,结合外部语义信息从交通碰撞记录中发现隐藏的碰撞模式。该方法使用纽约市曼哈顿县的机动车碰撞数据进行了验证。新的碰撞记录语义分析方法为研究交通碰撞中隐藏的信息提供了一种有效的方法。识别机动车碰撞的时空模式将为智能决策和资源配置提供对潜在交通行为的洞察。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A semantic embedding methodology for motor vehicle crash records: A case study of traffic safety in Manhattan Borough of New York City
Abstract This study introduces a hybrid Latent Dirichlet Allocation (LDA) model to excavate hidden crash patterns from the large-scale crash dataset. External semantic descriptions have been attached to raw GPS coordinates of crash events. The K-means clustering algorithm is first applied to determine land use characteristics of crash points by grouping surrounding Points of Interests (POIs). Then, each crash record is transformed into a formalized label consisting of land use, Annual Average Daily Traffic (AADT), and time stamps, allowing the analysis of massive traffic crash data as document corpora. Finally, a data-driven modeling approach based on the LDA is conducted to discover hidden crash patterns from traffic crash records combining the external semantic information. The approach is verified using motor vehicle crash data in Manhattan County of New York City. The novel semantic analysis of crash records provides an effective method to investigate the hidden information in traffic crashes. Identifying spatial-temporal patterns on motor vehicle crashes would provide insights into underlying traffic behaviors for intelligent policy-making and resource allocation.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
6.00
自引率
15.40%
发文量
38
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信