{"title":"视觉几何任务的无监督联合多任务学习","authors":"P. Jha, D. Tsanev, L. Lukic","doi":"10.1109/ivworkshops54471.2021.9669211","DOIUrl":null,"url":null,"abstract":"In this paper, we present a novel architecture and training methodology for learning monocular depth prediction, camera pose estimation, optical flow, and moving object segmentation using a common encoder in an unsupervised fashion. We demonstrate that the geometrical relationships between these tasks not only support joint unsupervised learning as shown in previous works but also allow them to share common features. We also show the advantage of using a two-stage learning approach to improve the performance of the base network.","PeriodicalId":256905,"journal":{"name":"2021 IEEE Intelligent Vehicles Symposium Workshops (IV Workshops)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Unsupervised Joint Multi-Task Learning of Vision Geometry Tasks\",\"authors\":\"P. Jha, D. Tsanev, L. Lukic\",\"doi\":\"10.1109/ivworkshops54471.2021.9669211\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we present a novel architecture and training methodology for learning monocular depth prediction, camera pose estimation, optical flow, and moving object segmentation using a common encoder in an unsupervised fashion. We demonstrate that the geometrical relationships between these tasks not only support joint unsupervised learning as shown in previous works but also allow them to share common features. We also show the advantage of using a two-stage learning approach to improve the performance of the base network.\",\"PeriodicalId\":256905,\"journal\":{\"name\":\"2021 IEEE Intelligent Vehicles Symposium Workshops (IV Workshops)\",\"volume\":\"81 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Intelligent Vehicles Symposium Workshops (IV Workshops)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ivworkshops54471.2021.9669211\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Intelligent Vehicles Symposium Workshops (IV Workshops)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ivworkshops54471.2021.9669211","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Unsupervised Joint Multi-Task Learning of Vision Geometry Tasks
In this paper, we present a novel architecture and training methodology for learning monocular depth prediction, camera pose estimation, optical flow, and moving object segmentation using a common encoder in an unsupervised fashion. We demonstrate that the geometrical relationships between these tasks not only support joint unsupervised learning as shown in previous works but also allow them to share common features. We also show the advantage of using a two-stage learning approach to improve the performance of the base network.