P. Aswin, J. Chandana, Seethal Reghunath, Maya Menon
{"title":"Stereo-Vision Based System For Object Detection And Recognition","authors":"P. Aswin, J. Chandana, Seethal Reghunath, Maya Menon","doi":"10.1109/ICOEI.2019.8862588","DOIUrl":null,"url":null,"abstract":"This paper proposes a method for detecting and recognizing the object using Stereo Vision, Scale-Invariant Feature Transform (SIFT) and Fast library for approximate Nearest Neighbors (FLANN) concept with its implementation on an embedded system. Using stereo vision on the microprocessor Raspberry Pi, the implemented system takes the two images produced as input, calculates the disparity map which provides the relative depth information. Using this map and the Scale-Invariant Feature Transform (SIFT), features are obtained and matched with a database having large collection of images. This implementation uses Fast Library for Approximate Nearest Neighbors (FLANN), which unlike the Brute-Force matching algorithm can support large databases. This system gives a voice output when the object is recognized by text to speech conversion.","PeriodicalId":212501,"journal":{"name":"2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOEI.2019.8862588","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
This paper proposes a method for detecting and recognizing the object using Stereo Vision, Scale-Invariant Feature Transform (SIFT) and Fast library for approximate Nearest Neighbors (FLANN) concept with its implementation on an embedded system. Using stereo vision on the microprocessor Raspberry Pi, the implemented system takes the two images produced as input, calculates the disparity map which provides the relative depth information. Using this map and the Scale-Invariant Feature Transform (SIFT), features are obtained and matched with a database having large collection of images. This implementation uses Fast Library for Approximate Nearest Neighbors (FLANN), which unlike the Brute-Force matching algorithm can support large databases. This system gives a voice output when the object is recognized by text to speech conversion.