Rong-Chi Chang, Yun-Long Sie, Su-Mei Chou, T. Shih
{"title":"Photo defect detection for image inpainting","authors":"Rong-Chi Chang, Yun-Long Sie, Su-Mei Chou, T. Shih","doi":"10.1109/ISM.2005.91","DOIUrl":"https://doi.org/10.1109/ISM.2005.91","url":null,"abstract":"Image inpainting (or image completion) techniques use textural or structural information to repair or fill damaged portion of a picture. However, most techniques request a human to identify the portion to be inpainted. We developed a new mechanism which can automatically detect defect portions in a photo, including damages by color ink spray and scratch drawing. The mechanism is based on several filters and structural information of damages. Old photos from the author's family are used for testing. Preliminary results show that most damages can be automatically detected without human involvement. The mechanism is integrated with our inpainting algorithms to complete a fully automatic photo defects repairing system.","PeriodicalId":322363,"journal":{"name":"Seventh IEEE International Symposium on Multimedia (ISM'05)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120920051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Efficient and fair multi-level packet scheduling for differentiated services","authors":"Chin-Chi Wu, H. Wu, Woei Lin","doi":"10.1109/ISM.2005.53","DOIUrl":"https://doi.org/10.1109/ISM.2005.53","url":null,"abstract":"As the demands on quality of service (QoS) of real-time applications over the Internet increase, many research efforts have developed various packet scheduling schemes to support differentiated services. In this paper, we propose a new multi-level packet scheduling algorithm, MLDDRR, enhanced from the existing dynamic deficit round-robin (DDRR) for the support of delay-sensitive applications. The network operator can simply change the level of service differentiation by adjusting parameters. The MLDDRR can achieve high throughput efficiently and simultaneously provide smaller delay for short packets of each service class. The feature of small delay for short packets is of great importance for improving the playback quality of real-time applications such as VoIP or scalable media stream delivery. Simulation results showing the high effectiveness and small overhead of MLDDRR are also presented.","PeriodicalId":322363,"journal":{"name":"Seventh IEEE International Symposium on Multimedia (ISM'05)","volume":"315 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120948708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"SIOX: simple interactive object extraction in still images","authors":"G. Friedland, Kristian Jantz, R. Rojas","doi":"10.1109/ISM.2005.106","DOIUrl":"https://doi.org/10.1109/ISM.2005.106","url":null,"abstract":"The following article presents an approach for interactive foreground extraction in still images that is currently being integrated into the GIMP. The presented approach has been derived from color signatures, a technique originating from image retrieval. The article explains the algorithm and presents some benchmark results to show the improvements in speed and accuracy compared to state of the art solutions. The article also describes how the algorithm can easily be adapted for video segmentation tasks.","PeriodicalId":322363,"journal":{"name":"Seventh IEEE International Symposium on Multimedia (ISM'05)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123836867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Nonlinear dynamical analysis of normal voices","authors":"M. E. Dajer, J. Pereira, Carlos Dias Maciel","doi":"10.1109/ISM.2005.84","DOIUrl":"https://doi.org/10.1109/ISM.2005.84","url":null,"abstract":"Human voice has been the focus of study for different areas of sciences. Researches in the last two decades have established the existence of chaos in human voice production. The purpose of this paper is to use nonlinear dynamics methods in the analysis of normal voices from healthy subjects and correlate them to traditional acoustic parameters as well as perceptual analysis. Twelve human voice signals from healthy subjects, 6 males and 6 females, ranging in age from 19 to 39 years old were used. Sustained vowel sounds /a/, /e/ and /i/ if, from Brazilian Portuguese were recorded at a sampling rate of 22,050 Hz and analyzed in order to obtain acoustic perturbation measures (jitter, shimmer, coefficient of excess - EX, and pitch amplitude - PA), The phase space reconstruction method was used to describe the nonlinear dynamic characteristics of voice signal samples. This paper shows that nonlinear dynamical methods as phase space reconstruction seems to be a suitable technique for voice signals analysis, due to the chaotic component of the human voice. The results suggest that non-linear dynamic analysis does not replace existing techniques instead they may improve and complement the recent voice analysis methods available for health professionals, speech therapist and clinician.","PeriodicalId":322363,"journal":{"name":"Seventh IEEE International Symposium on Multimedia (ISM'05)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128151702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Study on rounding errors of IntMDCT in perceptual audio coding","authors":"Te Li, R. Yu, S. Koh","doi":"10.1109/ISM.2005.111","DOIUrl":"https://doi.org/10.1109/ISM.2005.111","url":null,"abstract":"With the proliferation of broadband access and continuous decline of storage prize per gigabyte, there has been an increasing demand of audio solution that provides high sampling rate and high resolution. Lossless audio is undoubtedly the ultimate solution. In response to this demand, MPEG issued a call for proposal soliciting technology contributions that provides a state-of-art solution. At the technology end, lossless compression requires the usage of integer transform. The integer modified discrete cosine transform (IntMDCT) has been adopted in MPEG-4 scalable to lossless (SLS) coding to enable this efficient lossless operation. Because of rounding operations, rounding errors introduced by IntMDCT exist during the whole coding process. With the SLS having capability of using operations that spreads over the bitrate spectrum which ranges from lossy to lossless, it is of interest to study the effect of rounding errors in IntMDCT for operation of SLS in lossy mode. This paper analyzes the contributions of noise due to these errors. It is found that the noise introduced by rounding operations of IntMDCT does not affect the perceptual quality of the coded audio under any circumstances. As such, it concludes that the MDCT and IntMDCT filterbanks are interchangeable at lossy bitrate. With the fact that SLS uses both MDCT and IntMDCT, the finding in this paper suggests the possibility of using only IntMDCT filterbank.","PeriodicalId":322363,"journal":{"name":"Seventh IEEE International Symposium on Multimedia (ISM'05)","volume":"296 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132832665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Application layer error correction scheme for video header protection on wireless network","authors":"Chia-Ho Pan, I-Hsien Lee, Sheng-Chieh Huang, Chih-Chi Cheng, Chung-Jr Lian, Liang-Gee Chen","doi":"10.1109/ISM.2005.34","DOIUrl":"https://doi.org/10.1109/ISM.2005.34","url":null,"abstract":"In wireless video streaming application, video information may be corrupted by a noisy channel. By introducing error resilience and error concealment techniques, many researchers have tried to eliminate quality degradation of reconstructed picture in decoding a corrupted data. On the contrary, there are relatively fewer works discussing the ways to diminish the corruption. Hence, system designers need to use different methods to restrict the error cause by channel within a tolerable extent. In other words, the system will be difficult to be implemented in practical design. In this paper, we propose a way to protect the video header information in application layer without modifying standardized syntax. Beside, we also consider channel condition of wireless transmission and propose a way to reduce redundant bits used in channel coding. By doing this, the bitstream can be simply transmitted over practical wireless network and the reconstructed picture quality outperforms the original one.","PeriodicalId":322363,"journal":{"name":"Seventh IEEE International Symposium on Multimedia (ISM'05)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134456174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Sketch-based retrieval on flash movies via primary scenes","authors":"Yu Yang, Qing Li, Minhao Yu, Yueting Zhuang","doi":"10.1109/ISM.2005.107","DOIUrl":"https://doi.org/10.1109/ISM.2005.107","url":null,"abstract":"As a multimedia format, Flash is becoming more and more popular over the Web. The typical structure of Flash can benefit from both image retrieval and video retrieval methods. In this paper, we present an approach of sketch-based retrieval on flash movies with analysis on directional and motional relations. Via the selection of primary scenes, query result can be displayed to users in an ideal way. Experiment of the proposed approach is evaluated on a test set with different genres of flash movies, and it shows the usefulness of the approach.","PeriodicalId":322363,"journal":{"name":"Seventh IEEE International Symposium on Multimedia (ISM'05)","volume":"146 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134112175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"SP-frame selection for video streaming over burst-loss networks","authors":"Wai-tian Tan, Gene Cheung","doi":"10.1109/ISM.2005.108","DOIUrl":"https://doi.org/10.1109/ISM.2005.108","url":null,"abstract":"SP-frame is a new picture type supported by H.264. The traditional usage of SP-frames is for switching between different compressed bit-streams. In this paper, we proposed and evaluated a scheme that uses SP frames as a mechanism to switch within a single compressed stream for the purpose of achieving error resilience and rate scalability. We have only considered the restricted but practical case in which only one secondary SP frame is allowed for every primary SP frame. Nevertheless, simulation results show that the technique can significantly increase the chance of video frames meeting their deadlines, and also improve overall PSNR.","PeriodicalId":322363,"journal":{"name":"Seventh IEEE International Symposium on Multimedia (ISM'05)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115025632","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"BIOGLYPH: biometric identification in pervasive environments","authors":"D. Popel, Elena I. Popel","doi":"10.1109/ISM.2005.41","DOIUrl":"https://doi.org/10.1109/ISM.2005.41","url":null,"abstract":"Despite the wide appreciation of biometric principles in security applications, biometric solutions are far from being affordable and available \"on demand\" anytime and anywhere. Many security biometric solutions require dedicated devices for data acquisition delaying their deployment and limiting the scope. This paper introduces a system developed to identify and authenticate individuals based on their signatures and/or handwriting. The issues of pervasive services are addressed (i) by integrating unique data acquisition and processing techniques which are capable of communicating with a variety of off-the-shelf devices such as pressure sensitive pens, mice, and touch pads, and (ii) by using the self-learning database solutions for achieving accurate results. Aiming to show the architecture of a pervasive biometric system, this paper does not go into technical details of methods and structures.","PeriodicalId":322363,"journal":{"name":"Seventh IEEE International Symposium on Multimedia (ISM'05)","volume":"198 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114400164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Variable thresholding based multiple description video coding","authors":"S. Pavan, Sridhar Gangadharpalli, V. Sridhar","doi":"10.1109/ISM.2005.118","DOIUrl":"https://doi.org/10.1109/ISM.2005.118","url":null,"abstract":"Video streaming applications have become increasingly popular in the recent years, and their use will continue to grow in the future. However, variations in channel conditions cause delays and packet erasures resulting in degradation of quality of service (QoS). Multiple description (MD) coding is one method to reduce the detrimental effects caused by these channel variations. In this paper we have presented a novel pre-processing MD approach, which makes use of the redundancies already present in the original sequence to create multiple descriptions. The proposed approach is compatible with the commonly used video coding standards such as the H.26x and MPEG. The proposed scheme improves performance in terms of the coding efficiency and error resiliency compared to the approaches present in the literature.","PeriodicalId":322363,"journal":{"name":"Seventh IEEE International Symposium on Multimedia (ISM'05)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117157087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}