{"title":"A novel approach to computer-aided diagnosis of mammographic images","authors":"H. Sari-Sarraf, S. Gleason, K. Hudson, K. Hubner","doi":"10.1109/ACV.1996.572060","DOIUrl":"https://doi.org/10.1109/ACV.1996.572060","url":null,"abstract":"The article is a work-in-progress report of a research endeavor that deals with the design and development of a novel approach to computer aided diagnosis (CAD) of mammographic images. With the initial emphasis being on the analysis of microcalcifications, the proposed approach defines a synergistic paradigm that utilizes new methodologies together with previously developed techniques. The new paradigm is intended to promote a higher degree of accuracy in CAD of mammograms with an increased overall throughput. The process of accomplishing these goals is initiated by the fractal encoding of the input image, which gives rise to the generation of focus-of-attention regions (FARs), that is, regions that contain anomalies. The primary thrust of this work is to demonstrate that by considering FARs, rather than the entire input image, the performances of the ensuing processes (i.e., segmentation, feature extraction, and classification) are enhanced in terms of accuracy and speed. An experimental study is included that demonstrates the impact of FAR generation on the process of microcalcification segmentation.","PeriodicalId":222106,"journal":{"name":"Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125259564","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
R. Bolle, J. Connell, N. Haas, R. Mohan, G. Taubin
{"title":"VeggieVision: a produce recognition system","authors":"R. Bolle, J. Connell, N. Haas, R. Mohan, G. Taubin","doi":"10.1109/ACV.1996.572062","DOIUrl":"https://doi.org/10.1109/ACV.1996.572062","url":null,"abstract":"The authors present an automatic product 1D system (\"VeggieVision\"), intended to ease the produce checkout process. The system consists of an integrated scale and imaging system with a user-friendly interface. When a produce item is placed on the scale, an image is taken. A variety of features, color, texture (shape, density), are then extracted. These features are compared to stored \"signatures\" which were obtained by prior system training (either on-line or off-line). Depending on the certainty of the classification, the final decision is made either by the system or by a human from a number of choices selected by the system. Over 95% of the time, the correct produce classification is in the top four choices.","PeriodicalId":222106,"journal":{"name":"Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122498937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Histogram refinement for content-based image retrieval","authors":"Greg Pass, R. Zabih","doi":"10.1109/ACV.1996.572008","DOIUrl":"https://doi.org/10.1109/ACV.1996.572008","url":null,"abstract":"Color histograms are widely used for content-based image retrieval. Their advantages are efficiency, and insensitivity to small changes in camera viewpoint. However, a histogram is a coarse characterization of an image, and so images with very different appearances can have similar histograms. We describe a technique for comparing images called histogram refinement, which imposes additional constraints on histogram based matching. Histogram refinement splits the pixels in a given bucket into several classes, based upon some local property. Within a given bucket, only pixels in the same class are compared. We describe a split histogram called a color coherence vector (CCV), which partitions each histogram bucket based on spatial coherence. CCVs can be computed at over 5 images per second on a standard workstation. A database with 15,000 images can be queried using CCVs in under 2 seconds. We demonstrate that histogram refinement can be used to distinguish images whose color histograms are indistinguishable.","PeriodicalId":222106,"journal":{"name":"Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117211017","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Real-time recognition of activity using temporal templates","authors":"A. Bobick, James W. Davis","doi":"10.1109/ACV.1996.571995","DOIUrl":"https://doi.org/10.1109/ACV.1996.571995","url":null,"abstract":"A new view based approach to the representation and recognition of action is presented. The basis of the representation is a motion history image (MHI)-a static image where intensity is a function of the recency of motion in a sequence. We develop a recognition method which uses both binary and scalar valued versions of the MHI as temporal templates to match against stored instances of actions. The method automatically performs temporal segmentation, as invariant to linear changes in speed, and runs in real time on a standard platform. The applications we have begun to develop include simple room monitoring and an interactive game.","PeriodicalId":222106,"journal":{"name":"Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96","volume":"120 12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132384519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An evolutive OCR system based on continuous learning","authors":"Frank Lebourgeois, Jean-Luc Henry","doi":"10.1109/ACV.1996.572073","DOIUrl":"https://doi.org/10.1109/ACV.1996.572073","url":null,"abstract":"The paper presents an evolutive OCR system based on a cooperation between the recognition stage and the contextual stage which makes possible continuous training. The authors use the contextual correction in order to modify the behavior of the recognition stage by adjusting the internal representation of character models. They also introduce a specific classifier suitable for continuous training. The proposed classifier is based on the k-nearest neighbor rule modified by the introduction of weights. During the continuous training, the system selects models of pattern which contribute actively to a correct recognition.","PeriodicalId":222106,"journal":{"name":"Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132500470","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Color and texture fusion: application to aerial image segmentation and GIS updating","authors":"M. Jolly, Alok Gupta","doi":"10.1109/ACV.1996.571985","DOIUrl":"https://doi.org/10.1109/ACV.1996.571985","url":null,"abstract":"The paper describes an algorithm for combining color and texture information for the segmentation of color images. The algorithm uses maximum likelihood classification combined with a certainty based fusion criterion. The algorithm was validated using mosaics of real color textures. It was also tested on real outdoor color scenes and aerial images. This algorithm is part of a more complex system which is currently being designed to assist an operator in updating an old map of an area using aerial images.","PeriodicalId":222106,"journal":{"name":"Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114823002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yukio Matsuyama, Toshifumi Honda, H. Yamamura, H. Sasazawa, M. Nomoto, Takanori Ninomiya, A. Schick, L. Listl, P. Köllensperger, D. Spriegel, P. Mengel, Richard Schneider
{"title":"Automated solder joint inspection system using optical 3-D image detection","authors":"Yukio Matsuyama, Toshifumi Honda, H. Yamamura, H. Sasazawa, M. Nomoto, Takanori Ninomiya, A. Schick, L. Listl, P. Köllensperger, D. Spriegel, P. Mengel, Richard Schneider","doi":"10.1109/ACV.1996.572014","DOIUrl":"https://doi.org/10.1109/ACV.1996.572014","url":null,"abstract":"An automated system has been developed for visually inspecting the solder joints of SMDs (Surface Mounted Devices). The system is capable of inspecting fine pitch components down to 0.3 mm pitch QFPs (Quad Flat Packages). A unique image detection method was also developed to obtain precise 3-D images of solder joints. The principle of a confocal microscope is employed but plural sensors are used to detect reflected light at different focusing positions simultaneously. The system is unaffected by secondary reflection and dead angles. The warp in a PC (Printed Circuit) board surface is calculated in real time using the detected 3-D images, and board height to be detected in successive areas is predicted based on this calculation. Real-time automatic focusing control is then performed using newly developed defect detection algorithms, the system can recognize leads, pads and solder fillets from the detected images. Because 3-D shape features are extracted and used for defect judgment, user-defined parameters have been made easy to understand and/or to modify. Operational evaluation of the system confirms a 100% defect detection rate and a very low false alarm rate (0.16%).","PeriodicalId":222106,"journal":{"name":"Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127523027","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Document layout structure extraction using bounding boxes of different entitles","authors":"Jisheng Liang, J. Ha, R. Haralick, I. T. Phillips","doi":"10.1109/ACV.1996.572074","DOIUrl":"https://doi.org/10.1109/ACV.1996.572074","url":null,"abstract":"The paper presents an efficient technique for document page layout structure extraction and classification by analyzing the spatial configuration of the bounding boxes of different entities on the given image. The algorithm segments an image into a list of homogeneous zones. The classification algorithm labels each zone as test, table, line-drawing, halftone, ruling, or noise. The text lines and words are extracted within text zones and neighboring text lines are merged to form text blocks. The tabular structure is further decomposed into row and column items. Finally, the document layout hierarchy is produced from these extracted entities.","PeriodicalId":222106,"journal":{"name":"Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130344545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Position estimation from outdoor visual landmarks for teleoperation of lunar rovers","authors":"Fabio Gagliardi Cozman, E. Krotkov","doi":"10.1109/ACV.1996.572046","DOIUrl":"https://doi.org/10.1109/ACV.1996.572046","url":null,"abstract":"The paper presents a new application of computer vision to space robotics: a teleoperation interface which analyzes images sent by a mobile robot in space missions and produces position estimates based an the images. The estimates are displayed to the robot operator as additional information to prevent loss of orientation. The current version of the interface detects mountain formations in images and automatically searches for mountain peaks in a given topographic map. A new algorithm for position estimation uses a statistical description of the various disturbances and signals in the measurement process to produce estimates. The authors have tested the system with real images obtained in the Pittsburgh East and Dromedary Peak USGS quadrangles; they report significant improvements in speed and accuracy compared to previous systems.","PeriodicalId":222106,"journal":{"name":"Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129546894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Handwritten Numeral Recognition Using Personal Handwriting Characteristics Based On Clustering Method","authors":"Y. Hotta, S. Naoi, M. Suwa","doi":"10.1109/ACV.1996.572078","DOIUrl":"https://doi.org/10.1109/ACV.1996.572078","url":null,"abstract":"To improve recognition rate, it is important not only to utilize one character feature but personal handwriting characteristics. This paper realizes above approach based on our investigation result that characters written by the same writer have similar shapes and that there are several shapes even in the same category. In our method, clustering method is used to absorb the variance of character shapes in the category. First, character recognition for each character is executed. Next, misrecognized character candidates are extracted as isolated cluster by within-category clustering. Then, recognition results of the extracted characters are amended by between-category clustering which evaluates the distance between the cluster composed of misrecognized characters and the cluster composed of correctly recognized characters in every categories. Finally, experimental results shows that recognition rate is remarkably improved by our method.","PeriodicalId":222106,"journal":{"name":"Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96","volume":"33 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114097180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}