Stylianos D. Tzikopoulos, H. Georgiou, M. Mavroforakis, N. Dimitropoulos, S. Theodoridis
{"title":"A fully automated complete segmentation scheme for mammograms","authors":"Stylianos D. Tzikopoulos, H. Georgiou, M. Mavroforakis, N. Dimitropoulos, S. Theodoridis","doi":"10.1109/ICDSP.2009.5201262","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201262","url":null,"abstract":"This paper presents a fully automated complete segmentation method for mammographic images. Image preprocessing techniques are first applied to mammograms to remove the noise and then a breast boundary extraction algorithm is implemented, in order to distinguish breast tissue from the background. Next, an improved version of an existing pectoral muscle scheme is performed and a new nipple segmentation technique is applied, detecting the nipple when it is in profile. This improves the estimated breast boundary and serves as a key-point for further processing of the image. This composite method has been implemented and applied to miniMIAS, one of the most well-known mammographic databases. This database consists of 322 mediolateral oblique (MLO) view mammograms, obtained via a digitization procedure. The results are evaluated by an expert radiologist and are very promising. Accordingly, it is expected that this procedure can produce improved results, when applied to high-quality digital mammograms.","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127731011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Combined de-noising and sharpening of color images in DCT domain","authors":"R. Bilcu, S. Alenius, Markku Vehviläinen","doi":"10.1109/ICDSP.2009.5201097","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201097","url":null,"abstract":"In this paper we present a new DCT-based method for combined de-noising and sharpening of color images. Our solution combines alpha-rooting and thresholding the DCT coefficients to achieve both sharpening and noise reduction. In our approach, sharpening and de-noising are done in the YUV color space with Y, U and V components being processed differently. Moreover, the size of the sliding DCT transform is variable and adaptive to the local characteristics of the input image thus increasing the visual quality of the processed image. We describe our method in detail and we present experimental results, performed on color images, to compare its performance with other existing solutions.","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116884645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A. Khademi, Danoush Hosseinzadeh, A. Venetsanopoulos, A. Moody
{"title":"Nonparametric statistical tests for exploration of correlation and nonstationarity in images","authors":"A. Khademi, Danoush Hosseinzadeh, A. Venetsanopoulos, A. Moody","doi":"10.1109/ICDSP.2009.5201186","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201186","url":null,"abstract":"This work proposes two statistical-based techniques to quantify (with confidence) whether random 2D data (images) are correlated or nonstationary. Traditionally, such exploratory data analysis techniques have been developed for 1D signals, such as EEG. This paper presents a new application of Mantel's test for clustering to examine spatial dependence and a novel 2D extension of the traditional 1D version of the reverse arrangements test to examine data nonstationary. Simulated data (correlated and nonstationary) were generated and subject to several rotations, scales and translations, in order to test the robustness of the techniques. Mantel's test for clustering correctly classified the images as correlated for 100% of the cases (including those with rotations, scales and translations (RSTs)). For the 2D extension of the reverse arrangements test, the linear trend analysis correctly found 15/16 regions to have pixel-wise nonstationarity, and the nonlinear trend analysis correctly classified nonstationarity in all but two cases (14/16) (for all RSTs). As a result of the high classification rates, the techniques are relatively invariant to changes in RST. These two statistical tests have a variety of applications in medical imaging (i.e. modeling), and are discussed in this work. An additional application of the work is presented in the end, demonstrating the possibility that such test statistics may be used as features to classify different textures.","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131007789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Digital watermarking in peer to peer networks","authors":"D. Tsolis, S. Sioutas, T. Papatheodorou","doi":"10.1109/ICDSP.2009.5201108","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201108","url":null,"abstract":"As a general and effective protection measure for copyright violations which occur with the use of digital technologies including peer to peer (P2P) networks, copyright owners often use digital watermarking techniques so as to encrypt copyright information to the content or otherwise restrict or even block access to the digital content through the Internet and the P2P infrastructure. This paper claims that DRM and P2P can be quite complementary. Specifically, a P2P infrastructure is presented which allows broad digital content exchange while on the same time supports copyright protection and management through watermarking technologies for digital images.","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132912816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An improved CT to fluoroscopy registration algorithm for the kinematic analysis of knee joints","authors":"M. Pickering, J. Scarvell, Paul N. Smith","doi":"10.1109/ICDSP.2009.5201122","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201122","url":null,"abstract":"Understanding the kinematics of the knee joint as it undergoes normal physical activities is an important goal of the orthopaedic research community. The limitations of previous approaches to capturing knee kinematics include the requirement for tantalum beads to be implanted prior to imaging and the use of biplanar X-ray imaging which is not commonly available in hospital imaging departments. A recently proposed alternative approach is to register 3D CT data with individual 2D video frames of the knee captured using single-plane X-ray fluoroscopy. The main limitation of this approach has been the inaccuracy of the registration algorithm for out-of-plane translation. In this paper we propose an improved registration algorithm which uses the relatively new Cross-Cumulative Residual Entropy similarity measure. Experimental results show that the accuracy of our proposed technique is superior to previous approaches particularly for out-of-plane translation.","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134583764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"No-reference video quality measurement using neural networks","authors":"J. Choe, Kwon Lee, Chulhee Lee","doi":"10.1109/ICDSP.2009.5201054","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201054","url":null,"abstract":"Objective video quality measurements emerge as an important issue as multimedia data is increasingly transmitted over the channels where bandwidth may not be guaranteed. Among various objective models for video quality measurement, no-reference models have the largest application areas. In this paper, we propose a no-reference video quality assessment method for H.264 using artificial neural networks. Various features are extracted from H.264 bit-stream data and these features are inputted to a neural network. The neural network is trained to predict subjective video quality scores obtained by a number of evaluators. Experimental results show promising results, though a larger database would be required to train neural networks to provide robust performance.","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121371894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Theodore Petsatodis, Aristodemos Pnevmatikakis, F. Talantzis, U. Díaz-Orueta
{"title":"Interactive surfaces for enhanced cognitive care","authors":"Theodore Petsatodis, Aristodemos Pnevmatikakis, F. Talantzis, U. Díaz-Orueta","doi":"10.1109/ICDSP.2009.5201158","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201158","url":null,"abstract":"Cognitive care for the elderly can be provided by training their cognitive skills in order to reduce age-related decline of cognitive capabilities. This is typically achieved with cognitive games that can benefit from computer interfaces. In this paper we present the design and use of a multi-touch surface that serves as a front-end for games designed to specifically support the decline in declarative and prospective memory. The selected games are a result of both user studies that identified a set of requirements for the design of the surface and previous research on this issue. The system can be potentially embedded on the surface of a table. It comprises of a modified Thin Film Transistor panel along with an acrylic surface. A video camera is used to capture the Frustrated Total Internal Reflection of fingers when these are subjected to infrared illuminators. Multi-touch functionality is achieved by a Kalman-based multiple target tracking algorithm that runs upon the feed of the camera and detects any number of fingers and their movement. Results indicate that users perceive the experience as a positive and functional process.","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128844904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Effects of time lapse on Speaker Recognition results","authors":"H. Beigi","doi":"10.1109/ICDSP.2009.5201239","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201239","url":null,"abstract":"The effect of time lapse has not been studied well in most biometrics. Here, this effect is studied for Speaker Recognition, namely, Speaker Identification and Speaker Verification. The RecoMadeEasyTM speaker recognition engine has been used to obtain baseline results for 22 speakers who have been involved in a long-term study. The speakers have given data in three seatings with 1 to 2 months delay between consecutive collections. The speakers were real proficiency test candidates who were asked to speak in response to prompts. At each seating, several recordings were made in response to different prompts. The error rates are discussed, going from one seating to the next, for Identification and Verification. Large degradations are seen across different seatings. Two different adaptation techniques have been studied for reducing this discrepancy with very promising results.","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128518056","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Ntalampiras, D. Arsic, A. Störmer, T. Ganchev, I. Potamitis, N. Fakotakis
{"title":"PROMETHEUS database: A multimodal corpus for research on modeling and interpreting human behavior","authors":"S. Ntalampiras, D. Arsic, A. Störmer, T. Ganchev, I. Potamitis, N. Fakotakis","doi":"10.1109/ICDSP.2009.5201142","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201142","url":null,"abstract":"The present paper describes the construction of a multimodal database, referred to as the PROMETHEUS database, which contains recordings from heterogeneous sensors. The main purpose of this database is the development of a framework for monitoring and interpretation of human behavior in unrestricted environments of both indoor and outdoor type. It contains single-person and multi-person scenarios, but also covers scenarios with interactions between groups of people. It is devoted to detection of typical and atypical events, while care has been to taken for the recordings to be as close to real-world conditions as possible. The uniqueness of the PROMETHEUS database comes not only from the unique sensor sets but is due primarily to its generic design, which allows for embracing a wide range of real-world applications (including smart-home and human-robot interaction interfaces, indoors/outdoors public areas surveillance etc).","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129117524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
D. Arsic, Björn Schuller, Benedikt Hörnler, G. Rigoll
{"title":"Resolving partial occlusions in crowded environments utilizing range data and video cameras","authors":"D. Arsic, Björn Schuller, Benedikt Hörnler, G. Rigoll","doi":"10.1109/ICDSP.2009.5201162","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201162","url":null,"abstract":"Video surveillance systems are omnipresent in our daily life, but still suffer from some drawbacks, which hardens the integration of fully automated systems. Currently standards CCD sensors are used to monitor public and private spaces. These are not yet able to resolve revere occlusions in narrow environments. Therefore we suggest the integration of 3D sensors, in particular a photonic mixture device, into current frameworks, in order to support the reliable detection and segmentation in dense situations. We propose the use of basic techniques to segment persons in range data, to guarantee real-time processing capabilities. With a reliable foreground segmentation and the computation of depth gradients the segmentation performance will drastically rise.","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115612589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}