{"title":"A study on content-based music classification","authors":"Yibin Zhang, Jie Zhou","doi":"10.1109/ISSPA.2003.1224828","DOIUrl":"https://doi.org/10.1109/ISSPA.2003.1224828","url":null,"abstract":"Content-based music recognition can play an important role in human cognition research and multimedia applications. In this paper, we present a study on content-based music classification using short-time analysis techniques together with pattern recognition techniques to distinguish between five music styles. A database of total 1027 audio signals (99 piano, 204 symphony, 304 popular song, 242 Beijing opera, and 178 Chinese comic dialogues) is used for the experiments, which is much larger than the previous works. A comparative evaluation between different short-time features in terms of their classification ability, as well as between different classifiers is carried out on the database. The results show that harmonious degree is the most effective feature and the BPNNC is the best classifier. Some interesting results about different music styles are also reported.","PeriodicalId":264814,"journal":{"name":"Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings.","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125663215","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Wideband space-time communication with implicit channel feedback","authors":"G. Barriac, Upamanyu Madhow","doi":"10.1109/ISSPA.2003.1224681","DOIUrl":"https://doi.org/10.1109/ISSPA.2003.1224681","url":null,"abstract":"We consider wideband communication (e.g., using OFDM) over a typical cellular \"downlink,\" in which the base station may have several antenna elements, while the mobile has 1or 2 antenna elements. The following are our major findings: (a) Implicit channel feedback regarding the covariance matrix for the downlink space-time channel can be obtained, without any power or bandwidth overhead, by suitably averaging uplink channel measurements across frequency. This approach applies to both time division duplex (TDD) and frequency division duplex (FDD) systems. The covariance feedback can be used to obtain better performance on the downlink, at lower encoding and decoding complexity, compared to standard space-time coding. (b) The conventional design without channel feedback is to space the transmit antennas far enough apart so as to ensure uncorrelated responses. However, when implicit feedback is available, much better performance is obtained by significantly smaller antenna spacing, optimized such that the number of dominant eigenmodes of the channel matches the (small) number of receive antenna elements.","PeriodicalId":264814,"journal":{"name":"Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125685068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A new hybrid long-term and short-term prediction algorithm for packet loss erasure over IP-networks","authors":"M. Elsabrouty, M. Bouchard, T. Aboulnasr","doi":"10.1109/ISSPA.2003.1224715","DOIUrl":"https://doi.org/10.1109/ISSPA.2003.1224715","url":null,"abstract":"Packet loss is a common problem in Internet protocol (IP) networks. Delayed, misrouted or corrupted packets all introduce a gap in the information stream being transmitted. This gap is even more critical in the case of real time voice transmission that does not tolerate delay. The receiver in this case is obliged to generate a signal to play instead of the missing speech segment. This paper introduces a high performance speech concealment algorithm for PCM coded speech. The proposed algorithm implements a combination of linear prediction model and reverse order replicated pitch period (RORPP) implemented as in the ITU-T G.711. The new algorithm produced better objective MOS scores when compared to both the commercial tool of packet repetition and to the above mentioned ITU-T long term prediction standard.","PeriodicalId":264814,"journal":{"name":"Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings.","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121980914","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Sentence boundary detection in Czech TTS system using neural networks","authors":"J. Romportl, D. Tihelka, J. Matoušek","doi":"10.1109/ISSPA.2003.1224860","DOIUrl":"https://doi.org/10.1109/ISSPA.2003.1224860","url":null,"abstract":"This paper proposes results of an application of a neural network on the problem of deciding whether a certain punctuation mark in Czech text is or is not the end of a sentence. It also discusses possibilities of using methods for relevant parameters extraction and compares a neural network based method with a Bayes classifier and a heuristic classifier.","PeriodicalId":264814,"journal":{"name":"Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings.","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121738407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Fabian J Theis, D. Hartl, S. Krauss‐Etschmann, E. Lang
{"title":"Neural network signal analysis in immunology","authors":"Fabian J Theis, D. Hartl, S. Krauss‐Etschmann, E. Lang","doi":"10.1109/ISSPA.2003.1224857","DOIUrl":"https://doi.org/10.1109/ISSPA.2003.1224857","url":null,"abstract":"This paper aims to investigate whether both supervised and unsupervised signal analysis contributes to the interpretation of immunological data. For this purpose a data base was set up containing measured data from bronchoalveolarlavage fluid which was obtained from 37 children with pulmonary diseases. The children were dichotomized into two groups: 20 children suffered from chronic bronchitis whereas 17 children had an interstitial lung disease. A self-organizing map (SOM) was utilized to test higher-order correlations between cellular subsets and the patient groups. Furthermore, a supervised approach with a perceptron trained to the patients' diagnosis was applied. The SOM confirmed the results that were expected from previous statistical analyses and shed light on formerly not considered relationships. The supervised perceptron learning after principal component analysis for dimension reduction turned out to be highly successful by linearly separating the patients into two groups with different diagnoses. The simplicity of the perceptron made it easy to extract diagnosis rules, which partly were known already and is now readily be tested on larger data sets.","PeriodicalId":264814,"journal":{"name":"Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings.","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131993595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Billboard advertising detection in sport TV","authors":"G. Cai, Liming Chen, Junchang Li","doi":"10.1109/ISSPA.2003.1224759","DOIUrl":"https://doi.org/10.1109/ISSPA.2003.1224759","url":null,"abstract":"Precise visibility measuring of billboard advertising is a key element for organizers and broadcasters to make cost effective their sport live relay. However, this activity currently is very manpower and time consuming as it is manually processed for the moment. In this paper we describe a technique for detection of commercial advertisement in sport TV. Based on some a priori knowledge of sport field and commercial advertisement, our technique makes use of fast Hough transform and text's geometry features in order to extract advertisement from sport TV images. Our experiments show that our technique achieves more than 90% accuracy rate.","PeriodicalId":264814,"journal":{"name":"Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings.","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132031407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Linear fractional shift invariant (LFSI) systems","authors":"O. Akay","doi":"10.1109/ISSPA.2003.1224771","DOIUrl":"https://doi.org/10.1109/ISSPA.2003.1224771","url":null,"abstract":"In this paper, we formulate continuous time linear fractional shift invariant (LFSI) systems that generalize the well-known linear time invariant (LTI) systems by means of an angle parameter /spl phi/. LTI systems are obtained as a special case of LFSI systems for /spl phi/ = 0. LFSI systems belong to the large class of time-varying systems. Whereas LTI systems commute with time shifts, LFSI systems commute with fractional shifts defined on the time-frequency plane. Just as the conventional Fourier transform (FT) diagonalizes LTI systems, an LFSI system associated with angle /spl phi/ is diagonalized by the fractional Fourier transform (FrFT) defined at the perpendicular angle /spl phi/ + (/spl phi//2). We show that the eigen-functions of an LFSI system at angle /spl phi/ are linear FM (chirp) signals with a sweep rate of tan /spl phi/. Finally, we demonstrate via a simulation example that, in certain cases, LFSI systems can outperform LTI systems.","PeriodicalId":264814,"journal":{"name":"Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings.","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130170902","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Single-carrier frequency-domain design of space-time block coding transmission over frequency-selective fading channels","authors":"Changjiang Xu, T. Le-Ngoc","doi":"10.1109/ISSPA.2003.1224691","DOIUrl":"https://doi.org/10.1109/ISSPA.2003.1224691","url":null,"abstract":"A unified frequency-domain design scheme for single-carrier space-time block coding (STBC) transmission systems over frequency-selective fading channels is presented. The block transmission is used to mitigate the multipath fading induced intersymbol interference (ISI). The diversity performance of the presented scheme for zero-padding and cyclic prefix block transmissions is analyzed. It is showed that the zero-padding STBC transmission can achieve the diversity of order N/sub a/ (L+1), where N/sub a/ is the number of both transmit and receive antennas and L is the order of the channel transfer function, while the cyclic prefix STBC transmission only achieves the diversity of order N/sub g/.","PeriodicalId":264814,"journal":{"name":"Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130458491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
J. L. Tecpanecatl-Xihuitl, R. Aguilar-Ponce, M. Bayoumi, B. Zavidovique
{"title":"Digital IF decimation filters for 3G systems using pipeline/interleaving architecture","authors":"J. L. Tecpanecatl-Xihuitl, R. Aguilar-Ponce, M. Bayoumi, B. Zavidovique","doi":"10.1109/ISSPA.2003.1224880","DOIUrl":"https://doi.org/10.1109/ISSPA.2003.1224880","url":null,"abstract":"This paper presents efficient of IF decimation filters architecture using pipeline/interleaving (PI) technique in which the amount of multiplications is reduced by 50%. The decimation filters are important blocks in software radio terminals to process different communications standards like GSM, IS-95, and UMTS. These kinds of blocks are needed to process the I, and Q components on the digital down-converter. The proposed architecture is evaluated by MATLAB. This evaluation shows that the proposed structures can be utilized in a multimode fashion. The frequency response of the decimator filter for each standard is analyzed and the frequency response for the decimator filter using Pl architectures is also evaluated. The new architecture offers saving of 50% the amount of multiplications compare to the traditional implementation.","PeriodicalId":264814,"journal":{"name":"Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings.","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130465814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Non-Wiener effects in recursive least squares adaptation","authors":"A. Beex, J. Zeidler","doi":"10.1109/ISSPA.2003.1224947","DOIUrl":"https://doi.org/10.1109/ISSPA.2003.1224947","url":null,"abstract":"In a number of adaptive filtering applications, non-Wiener effects have been observed for the (normalized) least- mean-square algorithm. These effects can lead to performance improvements over the fixed Wiener filter with the same model structure, and are characterized by dynamic behavior of the adaptive filter weights. Here we investigate whether such non-Wiener effects can also occur in the recursive least squares algorithm, and under which circumstances. Examples show that non-Wiener effects can also occur with the recursive least squares algorithm, in particular when the exponential forgetting factor is small. The latter corresponds to a short memory depth, the need for which one generally associates with tracking of time-varying phenomena.","PeriodicalId":264814,"journal":{"name":"Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings.","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134252192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}