{"title":"Rhythm complexity measures for music pattern recognition","authors":"I. Shmulevich, D. Povel","doi":"10.1109/MMSP.1998.738930","DOIUrl":"https://doi.org/10.1109/MMSP.1998.738930","url":null,"abstract":"Three measures of rhythm complexity are considered. It is suggested that these measures be used in a system for machine recognition of music patterns as determinants of relative weights assigned to pitch and rhythm errors. The three measures are characterized and a procedure for determining parameters of one of the measures is described.","PeriodicalId":180426,"journal":{"name":"1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134048981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A non-linear model transformation for ML stochastic matching in additive noise","authors":"S. Wong, Bertram E. Shi","doi":"10.1109/MMSP.1998.738926","DOIUrl":"https://doi.org/10.1109/MMSP.1998.738926","url":null,"abstract":"We present a non-linear model transformation for adapting Gaussian mixture HMMs using both static and dynamic MFCC observation vectors to the presence of additive noise. This transformation depends upon a few compensation coefficients which can be estimated from a short training token of noise. Alternatively, one can also apply maximum-likelihood stochastic matching to estimate the compensation coefficients from speech embedded in noise. This can eliminate the need for segmentation of pure noise from speech for the estimation and can also compensate for inaccuracies in the estimation of the compensation coefficients as well as those due to the approximations used in deriving the transformation.","PeriodicalId":180426,"journal":{"name":"1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132114816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Constrained optimization of filter banks in subband image coding","authors":"Tao Wang, B. Wah","doi":"10.1109/MMSP.1998.738980","DOIUrl":"https://doi.org/10.1109/MMSP.1998.738980","url":null,"abstract":"The design of filter banks in subband image coding is critical for achieving high image quality. In this paper, we study the design from both the signal processing domain and the theory of wavelets. We formulate the design of filter banks as a two-stage nonlinear constrained optimization problem, each of which is solved by sequential quadratic programming (SQP). Using a wavelet image coding prototype, we show improved quality of the designed filter banks in terms of image compression and peak signal-to-noise ratios (PSNRs).","PeriodicalId":180426,"journal":{"name":"1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134183579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An extension of Macromedia DIRECTOR for multimedia virtual visits","authors":"R. Barneva, G. Cortelazzo","doi":"10.1109/MMSP.1998.738931","DOIUrl":"https://doi.org/10.1109/MMSP.1998.738931","url":null,"abstract":"An extension of the well-known authoring tool Macromedia DIRECTOR is presented. It is designed to facilitate the development of multimedia virtual visits to sites such as museums, art expositions, industrial and architectural exhibitions. The extension is written in the internal DIRECTOR language Lingo. It has been used to develop the virtual visits \"Da Padovanino a Tiepolo\" and \"EPOC\", and has given excellent results.","PeriodicalId":180426,"journal":{"name":"1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114839898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Unified framework of source-channel-modulation coding in low power multimedia wireless communications","authors":"T. Lan, A. Tewfik","doi":"10.1109/MMSP.1998.739046","DOIUrl":"https://doi.org/10.1109/MMSP.1998.739046","url":null,"abstract":"We study the problem of jointly optimizing source/channel/modulation coding to achieve a given quality of service while minimizing the total power consumption of mobiles. This paper emphasizes the design of a robust transmission framework in the context of a total power minimization scheme. We design a simple but efficient transmission scheme which is called multistage coded modulation that combines source/channel (S/C) coding and multirate modulation (MM). We then integrate into this scheme a complexity-scalable video coder to achieve the best trade-offs in multimedia wireless communications between total power, quality of service, and capacity. Preliminary results show superior video frame quality (around 3 dB better than the simple S/C coding scheme) of the proposed system in the severe channel condition, with total power consumption kept to a minimum.","PeriodicalId":180426,"journal":{"name":"1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)","volume":"93 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117269143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Real-time spontaneous interaction system with narratives","authors":"R. Nakatsu, N. Tosa","doi":"10.1109/MMSP.1998.738942","DOIUrl":"https://doi.org/10.1109/MMSP.1998.738942","url":null,"abstract":"Real-time spontaneous interaction is the basis of our active behavior such as communication. On the other hand, narratives in movies or novels give us the opportunity to experience dramatic events which are not encountered in our daily lives. The integration of these factors is expected to provide us with a new type of experience. In this paper, by integrating movies and interaction technologies, we propose the concept of \"interactive movies\". We first explain the concept of interactive movies and describe a prototype system we have developed. We then describe the construction of a second system, which we are currently developing, as well as several improvements in the system.","PeriodicalId":180426,"journal":{"name":"1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124520659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"System modeling and implementation of a generic video codec","authors":"Jong-Il Kim, B. Evans","doi":"10.1109/MMSP.1998.738952","DOIUrl":"https://doi.org/10.1109/MMSP.1998.738952","url":null,"abstract":"The rapidly emerging and increasing complexity of video coding standards require a new design paradigm. This paper describes a modular, scalable, extensible simulation and design methodology for system-level design of video codecs. Video codec dataflow is modeled by synchronous dataflow, and implemented in a heterogeneous CAD framework. As a result, generic video codec is decomposed into basic modules. Each module is easy to interface and extend. Any video codec standard (e.g., H.263+ or MPEG-4) can be mapped on that basis and be retargeted for various architectures including DSPs and ASIPs. We develop module libraries for video codecs which can be dynamically linked to an extensible framework for simulation and algorithm development. Some parts of the basic modules can be mixed with other domain modules which may have different interaction semantics especially for hardware design and retargeting purposes. We based our framework on the Ptolemy software environment.","PeriodicalId":180426,"journal":{"name":"1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124913034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Mouth motion learning and generating from observation","authors":"P. Hong, Thomas S. Huang, X. Lin","doi":"10.1109/MMSP.1998.738971","DOIUrl":"https://doi.org/10.1109/MMSP.1998.738971","url":null,"abstract":"This paper presents a system for analyzing and generating human mouth motion. We apply model-based tracking to a set of typical mouth image sequences and obtain model motion sequences, which are used to build the mouth motion space by applying principal component analysis (PCA). Given an abstract description of the mouth motion in the mouth motion space, our system can generate a new mouth motion image sequence.","PeriodicalId":180426,"journal":{"name":"1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123574961","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Image coding ringing artifact reduction using morphological post-filtering","authors":"S. H. Oguz, Y. Hu, Truong Q. Nguyen","doi":"10.1109/MMSP.1998.739051","DOIUrl":"https://doi.org/10.1109/MMSP.1998.739051","url":null,"abstract":"Ringing is an annoying artifact frequently encountered in low bit-rate transform and subband decomposition based compression of different media such as image, intra frame video and graphics. A mathematical morphology based post-processing algorithm is presented in this paper for image ringing artifact suppression. First, we use binary morphological operators to isolate the regions of an image where the ringing artifact is most prominent to the human visual system (HVS) while preserving genuine edges and other (high-frequency) fine details present in the image. Then, a gray-level morphological nonlinear smoothing filter is applied to the unmasked regions of the image under the filtering mask to eliminate ringing within this constraint region. To gauge the effectiveness of this approach, we propose an HVS compatible objective measure of the ringing artifact. Preliminary simulations indicate that the proposed method is capable of significantly reducing the ringing artifact on both subjective and objective basis.","PeriodicalId":180426,"journal":{"name":"1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129213720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
I. Moccagatta, S. Regunathan, O. Al-Shaykh, Homer H. Chen
{"title":"Robust image compression with packetization: the MPEG-4 still texture case","authors":"I. Moccagatta, S. Regunathan, O. Al-Shaykh, Homer H. Chen","doi":"10.1109/MMSP.1998.738995","DOIUrl":"https://doi.org/10.1109/MMSP.1998.738995","url":null,"abstract":"In this paper, we propose a bit-stream packetization approach to make the MPEG-4 still texture bit-stream robust to channel degradation. Our approach does not affect the spatial/quality scalability features of the still texture algorithm, and it has limited effect on coding efficiency. Also, this packetization approach requires minimal changes to the syntax and a small overhead. Finally, the proposed technique is shown to provide good error robustness over a wide range of error conditions.","PeriodicalId":180426,"journal":{"name":"1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)","volume":"90 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124587272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}