A. Nica, Mihai Trăscău, Alexandru Andrei Rotaru, C. Andreescu, Alexandru Sorici, A. Florea, Vlad Bacue
{"title":"UPB校园自动驾驶数据集的采集与处理","authors":"A. Nica, Mihai Trăscău, Alexandru Andrei Rotaru, C. Andreescu, Alexandru Sorici, A. Florea, Vlad Bacue","doi":"10.1109/CSCS.2019.00041","DOIUrl":null,"url":null,"abstract":"Although there is a diversity of publicly available datasets for autonomous driving, from small-scale to larger collections with thousands of miles of driving, we consider that the process of collecting and processing them is often overlooked in the literature. From a data-driven perspective, quality of a dataset has proven as important as quantity especially when evaluating self-driving technologies where safety is crucial. In this paper, we provide a guideline going through all the steps from configuring the hardware setup to obtaining a clean dataset. We describe the data collection scenario design, the hardware and software employed in the process, the challenges that must be considered, data filtering and validation stage. This work stems from our experience in collecting the UPB campus driving dataset released together with this work. It is our belief that having a clean and efficient process of collecting a small but meaningful dataset has the potential to improve benchmarking autonomous driving solutions, capturing local environment particularities.","PeriodicalId":352411,"journal":{"name":"2019 22nd International Conference on Control Systems and Computer Science (CSCS)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Collecting and Processing a Self-Driving Dataset in the UPB Campus\",\"authors\":\"A. Nica, Mihai Trăscău, Alexandru Andrei Rotaru, C. Andreescu, Alexandru Sorici, A. Florea, Vlad Bacue\",\"doi\":\"10.1109/CSCS.2019.00041\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Although there is a diversity of publicly available datasets for autonomous driving, from small-scale to larger collections with thousands of miles of driving, we consider that the process of collecting and processing them is often overlooked in the literature. From a data-driven perspective, quality of a dataset has proven as important as quantity especially when evaluating self-driving technologies where safety is crucial. In this paper, we provide a guideline going through all the steps from configuring the hardware setup to obtaining a clean dataset. We describe the data collection scenario design, the hardware and software employed in the process, the challenges that must be considered, data filtering and validation stage. This work stems from our experience in collecting the UPB campus driving dataset released together with this work. It is our belief that having a clean and efficient process of collecting a small but meaningful dataset has the potential to improve benchmarking autonomous driving solutions, capturing local environment particularities.\",\"PeriodicalId\":352411,\"journal\":{\"name\":\"2019 22nd International Conference on Control Systems and Computer Science (CSCS)\",\"volume\":\"61 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 22nd International Conference on Control Systems and Computer Science (CSCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSCS.2019.00041\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 22nd International Conference on Control Systems and Computer Science (CSCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSCS.2019.00041","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Collecting and Processing a Self-Driving Dataset in the UPB Campus
Although there is a diversity of publicly available datasets for autonomous driving, from small-scale to larger collections with thousands of miles of driving, we consider that the process of collecting and processing them is often overlooked in the literature. From a data-driven perspective, quality of a dataset has proven as important as quantity especially when evaluating self-driving technologies where safety is crucial. In this paper, we provide a guideline going through all the steps from configuring the hardware setup to obtaining a clean dataset. We describe the data collection scenario design, the hardware and software employed in the process, the challenges that must be considered, data filtering and validation stage. This work stems from our experience in collecting the UPB campus driving dataset released together with this work. It is our belief that having a clean and efficient process of collecting a small but meaningful dataset has the potential to improve benchmarking autonomous driving solutions, capturing local environment particularities.