{"title":"机器人快速图像目标识别方法","authors":"A. Harvey, H. Cohen","doi":"10.1109/ETFA.1992.683330","DOIUrl":null,"url":null,"abstract":"In Vision guided robotic operations, the rapid location and identification of image objects for assembly is essential. The image object data taken with a ccd camera may be in the form of a grey level, binary or colour image. This paper discusses our work on new speed up methods for object recognition in binary and grey level images. These are a coarse/fine stepping method and a sparse template technique. An extension of the coarse fine technique to give larger speedups is discussed. INTRODUCTION Searches for objects within images are extremely computationally expensive when scale, position and orientation of objects within the image is unknown. This paper reports results of a comprehensive coarse-fine object location program in which the criteria for coarse to fine switching have been investigated. These methods can speed up the column and row position search by a factor of 64. A matching error function is used to switch between coarse and fine search modes. Also a sparse template technique is used to give a further 64 fold speed up even more a for larger templates. This paper reports position search speed up times obtained on a number plate reading project. This work is of importance in vision guided assembly operations where machine vision techniques are used for locating parts and was first applied to a vehicle number plate location system. COARSE FINE COLUMN SEARCH By applying a reduced column search technique to object location in an image, a large reduction in computation may . be made. A basic approach is to move the template over the raster scan image, computing the mismatch function at each pixel location. Clearly if the matching calculations could be made at a reduced number of locations, ie a coarse search at say every fourth or fifth. pixel then a large speed up in the matching operation could be made. If a check is made by comparing the current matching error with the error calculation at the previous pixel position, then a decision can be made to move along say four or more columns, if the error is not moving towards a match. Alternatively, if the matching error is decreasing rapidly, from A to C in Fig 5, then the template pixel array will be moved along column by column after each calculation. Investigations showed that the template map may be moved along six columns for the coarse search and still find the matching position. A five to one speed up was obtained in this way. It is important to note that speedup factors are image dependent. COARSE FINE ROW SEARCH Similarly, a coarse fine (reduced) row search using a change in the image matching criterion between rows as a switch will give a speed up factor of the order of four or five to one assuming similar spatial changes in the vertical and horizontal directions of the image. This will be the case for images of uniform texture. More structured images of an industrial nature will not be so predictable. However matching error information is not as readily available in the case of the reduced row search. The matching error change method used was to save the lowest value of the matching error for the previous two rows and make a coarse fine search decision on the basis of this difference. An important point to consider in coarse fine searches is the dimensions of the template. A template four columns wide will have a much narrower correlation region than a 40 column wide template. Thus in certain cases for instance in the case of whole of image movement, where the template size is not fixed, it may be better to make the template longer (more columns) but not as wide, ie less rows if a coarse column search only is to be made. SPARSE TEMPLATES Matching error calculations may be speeded up by the use of sparse partial templates in which only say every fourth column and row brightness value of the template is checked against the image to be searched. A 16 times reduction in error calculations can be made in this way. For the case of noise-free images and templates, there is no need to use the full template. For the more practical case of differences between the image object and it's template, the full template would be used when the matching error indicates that a match is close to the current search position. As the template becomes more sparse, the chance of falsely switching to a full template becomes quite large, setting an upper limit to the sparseness of the template. Another way the template calculation may be speeded up is by the use of a smaller template array. For textured or fractal type images, this is an important option. COARSE FINE SEARCH WITH BACKTRACKING. By allowing the column search to reverse in direction, the coarse-fine search step may be doubled, doubling the coarse fine speedup. Since the translation can reverse, we can move to the far side of the correlation region, point B in diagram 5 during the coarse search and then move backwards to the point of best match, ie point D. This gives a further factor of four speedup to the coarse-fine row, column search, APPLICATION TO OBJECTS OF VARYING SCALE If a plot is made of matching error versus image size or scale a finite correlation region typically of the order of 10 pixels will be found for all but completely random images. On the diagrams page, diagrams two and three 2 and 3 on the last page of the paper for the pebbles image. This result indicates that a finite set of templates may be used to find the object of unknown scale by making coarse steps in the template scale to form a finite template library. Again it is not necessary to perform a matching calculation over the entire template and sparse template techniques can also be used to reduce the number of matching calculations for each template. Sparse template techniques may also be used to reduce the calculation burden for each template. The number of competing template candidates in the image will a lso in f luence the template sca le s tep requi red to differentiate between the template objects. Also see reference 5.","PeriodicalId":154369,"journal":{"name":"IEEE International Workshop on Emerging Technologies and Factory Automation,","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Methods For Fast Image Object Recognition In Robotics\",\"authors\":\"A. Harvey, H. Cohen\",\"doi\":\"10.1109/ETFA.1992.683330\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In Vision guided robotic operations, the rapid location and identification of image objects for assembly is essential. The image object data taken with a ccd camera may be in the form of a grey level, binary or colour image. This paper discusses our work on new speed up methods for object recognition in binary and grey level images. These are a coarse/fine stepping method and a sparse template technique. An extension of the coarse fine technique to give larger speedups is discussed. INTRODUCTION Searches for objects within images are extremely computationally expensive when scale, position and orientation of objects within the image is unknown. This paper reports results of a comprehensive coarse-fine object location program in which the criteria for coarse to fine switching have been investigated. These methods can speed up the column and row position search by a factor of 64. A matching error function is used to switch between coarse and fine search modes. Also a sparse template technique is used to give a further 64 fold speed up even more a for larger templates. This paper reports position search speed up times obtained on a number plate reading project. This work is of importance in vision guided assembly operations where machine vision techniques are used for locating parts and was first applied to a vehicle number plate location system. COARSE FINE COLUMN SEARCH By applying a reduced column search technique to object location in an image, a large reduction in computation may . be made. A basic approach is to move the template over the raster scan image, computing the mismatch function at each pixel location. Clearly if the matching calculations could be made at a reduced number of locations, ie a coarse search at say every fourth or fifth. pixel then a large speed up in the matching operation could be made. If a check is made by comparing the current matching error with the error calculation at the previous pixel position, then a decision can be made to move along say four or more columns, if the error is not moving towards a match. Alternatively, if the matching error is decreasing rapidly, from A to C in Fig 5, then the template pixel array will be moved along column by column after each calculation. Investigations showed that the template map may be moved along six columns for the coarse search and still find the matching position. A five to one speed up was obtained in this way. It is important to note that speedup factors are image dependent. COARSE FINE ROW SEARCH Similarly, a coarse fine (reduced) row search using a change in the image matching criterion between rows as a switch will give a speed up factor of the order of four or five to one assuming similar spatial changes in the vertical and horizontal directions of the image. This will be the case for images of uniform texture. More structured images of an industrial nature will not be so predictable. However matching error information is not as readily available in the case of the reduced row search. The matching error change method used was to save the lowest value of the matching error for the previous two rows and make a coarse fine search decision on the basis of this difference. An important point to consider in coarse fine searches is the dimensions of the template. A template four columns wide will have a much narrower correlation region than a 40 column wide template. Thus in certain cases for instance in the case of whole of image movement, where the template size is not fixed, it may be better to make the template longer (more columns) but not as wide, ie less rows if a coarse column search only is to be made. SPARSE TEMPLATES Matching error calculations may be speeded up by the use of sparse partial templates in which only say every fourth column and row brightness value of the template is checked against the image to be searched. A 16 times reduction in error calculations can be made in this way. For the case of noise-free images and templates, there is no need to use the full template. For the more practical case of differences between the image object and it's template, the full template would be used when the matching error indicates that a match is close to the current search position. As the template becomes more sparse, the chance of falsely switching to a full template becomes quite large, setting an upper limit to the sparseness of the template. Another way the template calculation may be speeded up is by the use of a smaller template array. For textured or fractal type images, this is an important option. COARSE FINE SEARCH WITH BACKTRACKING. By allowing the column search to reverse in direction, the coarse-fine search step may be doubled, doubling the coarse fine speedup. Since the translation can reverse, we can move to the far side of the correlation region, point B in diagram 5 during the coarse search and then move backwards to the point of best match, ie point D. This gives a further factor of four speedup to the coarse-fine row, column search, APPLICATION TO OBJECTS OF VARYING SCALE If a plot is made of matching error versus image size or scale a finite correlation region typically of the order of 10 pixels will be found for all but completely random images. On the diagrams page, diagrams two and three 2 and 3 on the last page of the paper for the pebbles image. This result indicates that a finite set of templates may be used to find the object of unknown scale by making coarse steps in the template scale to form a finite template library. Again it is not necessary to perform a matching calculation over the entire template and sparse template techniques can also be used to reduce the number of matching calculations for each template. Sparse template techniques may also be used to reduce the calculation burden for each template. The number of competing template candidates in the image will a lso in f luence the template sca le s tep requi red to differentiate between the template objects. Also see reference 5.\",\"PeriodicalId\":154369,\"journal\":{\"name\":\"IEEE International Workshop on Emerging Technologies and Factory Automation,\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE International Workshop on Emerging Technologies and Factory Automation,\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ETFA.1992.683330\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE International Workshop on Emerging Technologies and Factory Automation,","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ETFA.1992.683330","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Methods For Fast Image Object Recognition In Robotics
In Vision guided robotic operations, the rapid location and identification of image objects for assembly is essential. The image object data taken with a ccd camera may be in the form of a grey level, binary or colour image. This paper discusses our work on new speed up methods for object recognition in binary and grey level images. These are a coarse/fine stepping method and a sparse template technique. An extension of the coarse fine technique to give larger speedups is discussed. INTRODUCTION Searches for objects within images are extremely computationally expensive when scale, position and orientation of objects within the image is unknown. This paper reports results of a comprehensive coarse-fine object location program in which the criteria for coarse to fine switching have been investigated. These methods can speed up the column and row position search by a factor of 64. A matching error function is used to switch between coarse and fine search modes. Also a sparse template technique is used to give a further 64 fold speed up even more a for larger templates. This paper reports position search speed up times obtained on a number plate reading project. This work is of importance in vision guided assembly operations where machine vision techniques are used for locating parts and was first applied to a vehicle number plate location system. COARSE FINE COLUMN SEARCH By applying a reduced column search technique to object location in an image, a large reduction in computation may . be made. A basic approach is to move the template over the raster scan image, computing the mismatch function at each pixel location. Clearly if the matching calculations could be made at a reduced number of locations, ie a coarse search at say every fourth or fifth. pixel then a large speed up in the matching operation could be made. If a check is made by comparing the current matching error with the error calculation at the previous pixel position, then a decision can be made to move along say four or more columns, if the error is not moving towards a match. Alternatively, if the matching error is decreasing rapidly, from A to C in Fig 5, then the template pixel array will be moved along column by column after each calculation. Investigations showed that the template map may be moved along six columns for the coarse search and still find the matching position. A five to one speed up was obtained in this way. It is important to note that speedup factors are image dependent. COARSE FINE ROW SEARCH Similarly, a coarse fine (reduced) row search using a change in the image matching criterion between rows as a switch will give a speed up factor of the order of four or five to one assuming similar spatial changes in the vertical and horizontal directions of the image. This will be the case for images of uniform texture. More structured images of an industrial nature will not be so predictable. However matching error information is not as readily available in the case of the reduced row search. The matching error change method used was to save the lowest value of the matching error for the previous two rows and make a coarse fine search decision on the basis of this difference. An important point to consider in coarse fine searches is the dimensions of the template. A template four columns wide will have a much narrower correlation region than a 40 column wide template. Thus in certain cases for instance in the case of whole of image movement, where the template size is not fixed, it may be better to make the template longer (more columns) but not as wide, ie less rows if a coarse column search only is to be made. SPARSE TEMPLATES Matching error calculations may be speeded up by the use of sparse partial templates in which only say every fourth column and row brightness value of the template is checked against the image to be searched. A 16 times reduction in error calculations can be made in this way. For the case of noise-free images and templates, there is no need to use the full template. For the more practical case of differences between the image object and it's template, the full template would be used when the matching error indicates that a match is close to the current search position. As the template becomes more sparse, the chance of falsely switching to a full template becomes quite large, setting an upper limit to the sparseness of the template. Another way the template calculation may be speeded up is by the use of a smaller template array. For textured or fractal type images, this is an important option. COARSE FINE SEARCH WITH BACKTRACKING. By allowing the column search to reverse in direction, the coarse-fine search step may be doubled, doubling the coarse fine speedup. Since the translation can reverse, we can move to the far side of the correlation region, point B in diagram 5 during the coarse search and then move backwards to the point of best match, ie point D. This gives a further factor of four speedup to the coarse-fine row, column search, APPLICATION TO OBJECTS OF VARYING SCALE If a plot is made of matching error versus image size or scale a finite correlation region typically of the order of 10 pixels will be found for all but completely random images. On the diagrams page, diagrams two and three 2 and 3 on the last page of the paper for the pebbles image. This result indicates that a finite set of templates may be used to find the object of unknown scale by making coarse steps in the template scale to form a finite template library. Again it is not necessary to perform a matching calculation over the entire template and sparse template techniques can also be used to reduce the number of matching calculations for each template. Sparse template techniques may also be used to reduce the calculation burden for each template. The number of competing template candidates in the image will a lso in f luence the template sca le s tep requi red to differentiate between the template objects. Also see reference 5.