{"title":"Hybrid fitness assignment strategy in IGA","authors":"F. Sugimoto, M. Yoneyama","doi":"10.1109/MMSP.2002.1203301","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203301","url":null,"abstract":"We have been developing a hybrid fitness assignment strategy to realize a natural interaction in IGA. The strategy allows a user to select some individuals and evaluate a grade that shows how the selected individual resembles a target image. In this paper, we will show a method to compose fitness when a user selects two individuals in the hybrid fitness assignment strategy. It is known that better performance is obtained when two individuals are selected in the generations limited with a condition. The condition is equivalent to the actual situation in which it is difficult for a user to select only one individual. The hybrid strategy is useful to realize a more natural interaction in the actual situation.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121879903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Energy based collaborative source localization using acoustic micro-sensor array","authors":"Y. Hu, Dan Li","doi":"10.1109/MMSP.2002.1203323","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203323","url":null,"abstract":"A novel sensor network source localization method based on acoustic energy measurements is presented. This method makes use of the characteristics that the acoustic energy decays exponentially with respect to the distance from an omni-directional acoustic source. By comparing energy readings measured at surrounding acoustic sensors during the same time interval can be accurately estimated. We show that the potential target location is restricted to a hyper-sphere in the sensor field given the acoustic energy reading at a pair of sensors. Given multiple sensor acoustic energy readings, the target location is solved as the location that is closest (in the least square sense) to all the corresponding hyper-spheres. We further simplified this nonlinear least square problem to an unconstrained linear least square problem that yields a closed form solution. Experiment results using military vehicle acoustic data show great promise of this novel approach.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124503020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Beat-ID: identifying music via beat analysis","authors":"D. Kirovski, H. Attias","doi":"10.1109/MMSP.2002.1203279","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203279","url":null,"abstract":"Music identification is an effective tool that enables multimedia players to extract a distinct statistical digest of the played content, look up into a music database using the extracted unique identifier, and then take advantage of the services available for that particular content. In this paper, we introduce beat-IDs, the first music identification system that creates the digest of the music clip by understanding the basic structure of every musical piece: its beat. A beat-ID is created in two steps: first, the system detects the average beat period of a given music clip using a modified EM algorithm and then, it analyzes the statistical properties of the clip with respect to the detected beats. The extracted 32-byte beat-ID contains two components: the length of the average beat period and a compressed statistical digest of signal's energy distribution in an average beat period. Finally, we introduce an algorithm for matching beat-IDs that quantifies the matching accuracy between two music identifiers using an error analysis. In this paper, the properties of beat-IDs are demonstrated using a relatively small database of audio clips.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127653276","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A. Perkis, Jijun Zhang, Torgunn Halvorsen, J. O. Kjøde, Francisco Rivas
{"title":"A streaming media engine using digital item adaptation","authors":"A. Perkis, Jijun Zhang, Torgunn Halvorsen, J. O. Kjøde, Francisco Rivas","doi":"10.1109/MMSP.2002.1203251","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203251","url":null,"abstract":"This paper describes the use of capability descriptors for MPEG-21 digital item adaptation. The capability descriptors describe the terminal and network for an application involving media resource delivery. The capability descriptors are used in a streaming media environment through a test bed demonstrating possible functionalities of a universal multimedia access (UMA) enabled system. The test bed involves viewer configuration through capability negotiations. The configuration triggers a media resource adaptation for accessing streaming media using a variety of access schemes on a diverse range of terminals. The application focus is towards future generation mobile and wireless multimedia communications, ensuring the user gets the same reliability and seamlessness as in a fixed environment.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"316 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116599574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Streaming agent for wired network/wireless link rate-mismatch environment","authors":"Gene Cheung, Wai-tian Tan, T. Yoshimura","doi":"10.1109/MMSP.2002.1203327","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203327","url":null,"abstract":"It has been shown that an agent located at the junction of wired and wireless links can help streaming media systems identify where packet losses occur and therefore maintain proper end-to-end congestion control. In this paper, we further expand the functionality of such agents in two ways. First, they allow streaming servers to identify the allowed transmission rate in both the wired and wireless parts of the path. Second, they serve as a relay to exploit the difference in transmission rate of the two parts. Simulation results show that PSNR improvement of over 2 dB can be achieved without extra bandwidth usage in the bottleneck links.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116986393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Detecting corrupted intra macroblocks in H.263 video","authors":"O. Lehtoranta, T. Hämäläinen, V. Lappalainen","doi":"10.1109/MMSP.2002.1203241","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203241","url":null,"abstract":"Corrupted low frequency data of intra coded macroblocks can significantly degrade quality of video in error prone wireless networks. Therefore, a new method for detecting the corrupted blocks is presented. The method exploits temporal smoothness of video by computing the absolute difference between subsequent video frames. A threshold function is used to highlight the block differences, and a heuristic is developed to detect the corrupted blocks. The proposed method is evaluated with our wireless video simulator, which shows that the method substantially improves image quality of video conferencing sequences in presence of transmission errors. In addition, the method is compared to average intersample difference across the block boundaries (AIDB) algorithm whose performance is shown to be more sensitive to selection of correct threshold values than the proposed method.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"78 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134083149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Data masking: a secure-covert channel paradigm","authors":"R. Radhakrishnan, K. Shanmugasundaram, N. Memon","doi":"10.1109/MMSP.2002.1203315","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203315","url":null,"abstract":"It is well known that encryption provides secure channels for communicating entities. However, due to lack of covertness on these channels, an eavesdropper can identify encrypted streams through statistical test and capture them for further cryptanalysis. Hence, the communicating entities can use steganography to achieve covertness. In this paper, we propose a new form of multimedia steganography called data masking. Instead of embedding a secret message into a multimedia object, as in traditional multimedia steganography, we process the entire secret message using an inverse Wiener filter to make it look like a multimedia object itself. Thereby we foil an eavesdropper who is primarily applying statistical tests to detect encrypted communication channels. We show that our approach can potentially give a covert channel capacity, which is an order of magnitude higher than traditional steganography.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"221 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134521466","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Realtime object extraction and tracking with an active camera using image mosaics","authors":"Chia-Wen Lin, Chih-Ming Wang, Yao-Jen Chang, Yung-Chang Chen","doi":"10.1109/MMSP.2002.1203269","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203269","url":null,"abstract":"Moving object extraction plays a key role in applications such as object-based videoconference, surveillance, and so on. The difficulties of moving object segmentation lie in the fact that physical objects are normally not homogeneous with respect to low-level features and it's usually tough to segment them accurately and efficiently. Object segmentation based on prestored background information has proved to be effective and efficient in several applications such as videophone, video conferencing, and surveillance, etc. The previous works, however, were mainly concentrated on object segmentation with a static camera and in a stationary background. In this paper, we propose a robust and fast segmentation algorithm and a reliable tracking strategy without knowing the shape of the object in advance. The proposed system can real-time extract the foreground from the background and track the moving object with an active (pan-tilt) camera such that the moving object always stays around the center of images.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"30 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114019050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Unequal error protection of embedded multimedia objects for packet-erasure channels","authors":"M. Madhavi, J. Fowler","doi":"10.1109/MMSP.2002.1203248","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203248","url":null,"abstract":"The application of forward-error-correcting codes to data organized as multiple, independent multimedia objects and encoded with modern embedded coders is investigated. Capitalizing on the strict importance-ordering characteristics of embedded encodings, the strength of the error protection is optimized such that is more important to the reconstructed quality of the dataset is assigned stronger protection. The focus of the investigation is on providing this optimization while maintaining the ability ton independently access the individual multimedia objects. Experimental results are presented for still-image objects that illustrate that the desired independent-access ability comes at a cost in reconstruction quality, and that this cost increases as the channel-loss conditions actually experienced degrade from those for which the optimal protection arrangement was designed.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114187297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Rate-distortion optimized streaming from the edge of the network","authors":"Jacob Chakareski, P. Chou, B. Girod","doi":"10.1109/MMSP.2002.1203245","DOIUrl":"https://doi.org/10.1109/MMSP.2002.1203245","url":null,"abstract":"This paper addresses the problem of streaming packetized media over a lossy packet network through an intermediate proxy server to a client, in a rate-distortion optimized way. The proxy, located at the junction of the backbone network and the last hop to the client, coordinates the communication between the media server and the client using hybrid receiver/sender-driven streaming in a rate-distortion optimization framework. The framework enables the proxy to determine at every instant which packets, if any, it should either request from the media server or retransmit directly to the client, in order to meet a constraint on the average transmission rate while minimizing the average end-to-end distortion. Performance gains of up to 1.5 dB and up to 4 dB are observed over rate-distortion optimized sender-driven systems for the case when the last hop is wireline and wireless, respectively.","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126322916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}