Unveiling Topics from Scientific Literature on the Subject of Self-driving Cars using Latent Dirichlet Allocation

W. Y. Ayele, Gustaf Juell-Skielse
{"title":"Unveiling Topics from Scientific Literature on the Subject of Self-driving Cars using Latent Dirichlet Allocation","authors":"W. Y. Ayele, Gustaf Juell-Skielse","doi":"10.1109/IEMCON.2018.8615056","DOIUrl":null,"url":null,"abstract":"Self-driving cars are becoming popular topics in academia. Consumers of self-driving cars and vehicles have different concerns, for example, safety and security, to name a few. Also, the public sector has interests in self-driving cars such as amending policies to enable the management of self-driving vehicles in cities, urban planning, traffic management and, etc. In this paper, more than 2700 corpus are extracted from literature from several subject areas to identify latent (hidden) topics of self-driving cars. Latent Dirichlet Allocation (LDA) is used for topic identification. The result of this study shows that topics identified are valid research areas such as urban planning, driver car (computer) interaction, self-driving control and system design, ethics in self-driving cars, safety and risk assessment, training dataset quality and machine learning in self-driving cars are among the topics identified. Furthermore, the network visualization of association graph of terms shows that the most frequently discussed concepts reveal that control of self-driving cars is based on algorithms, data, design, method, and model. The methods used in this study and the results can be used as decision tools, if carefully applied, in diverse disciplines that are disrupted by the introduction of self-driving cars. For future study, we plan to extend this study with a larger dataset and other data mining techniques.","PeriodicalId":368939,"journal":{"name":"2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)","volume":"18 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEMCON.2018.8615056","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Self-driving cars are becoming popular topics in academia. Consumers of self-driving cars and vehicles have different concerns, for example, safety and security, to name a few. Also, the public sector has interests in self-driving cars such as amending policies to enable the management of self-driving vehicles in cities, urban planning, traffic management and, etc. In this paper, more than 2700 corpus are extracted from literature from several subject areas to identify latent (hidden) topics of self-driving cars. Latent Dirichlet Allocation (LDA) is used for topic identification. The result of this study shows that topics identified are valid research areas such as urban planning, driver car (computer) interaction, self-driving control and system design, ethics in self-driving cars, safety and risk assessment, training dataset quality and machine learning in self-driving cars are among the topics identified. Furthermore, the network visualization of association graph of terms shows that the most frequently discussed concepts reveal that control of self-driving cars is based on algorithms, data, design, method, and model. The methods used in this study and the results can be used as decision tools, if carefully applied, in diverse disciplines that are disrupted by the introduction of self-driving cars. For future study, we plan to extend this study with a larger dataset and other data mining techniques.
利用潜在狄利克雷分配从科学文献中揭示自动驾驶汽车的主题
自动驾驶汽车正在成为学术界的热门话题。自动驾驶汽车和车辆的消费者有不同的担忧,例如,安全和保障,等等。公共部门也对自动驾驶汽车产生了浓厚的兴趣,比如修改城市自动驾驶汽车管理政策、城市规划、交通管理等。本文从多个学科领域的文献中提取了2700多个语料库,以识别自动驾驶汽车的潜在(隐藏)主题。使用潜在狄利克雷分配(Latent Dirichlet Allocation, LDA)进行主题识别。本研究的结果表明,确定的主题是有效的研究领域,如城市规划、驾驶员(计算机)交互、自动驾驶控制和系统设计、自动驾驶汽车伦理、安全和风险评估、训练数据集质量和自动驾驶汽车的机器学习等。此外,术语关联图的网络可视化显示,最常讨论的概念揭示了自动驾驶汽车的控制是基于算法、数据、设计、方法和模型的。如果仔细应用,本研究中使用的方法和结果可以作为决策工具,用于被自动驾驶汽车引入所破坏的不同学科。对于未来的研究,我们计划用更大的数据集和其他数据挖掘技术来扩展这项研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信