2013 IEEE International Conference on Acoustics, Speech and Signal Processing最新文献_第5页

A dynamic system model of time-varying subjective quality of video streams over HTTP 基于HTTP的视频流主观质量时变的动态系统模型

2013 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2013-05-26 DOI: 10.1109/ICASSP.2013.6638329

Chao Chen, L. Choi, G. Veciana, C. Caramanis, R. Heath, A. Bovik

引用次数: 31

N-gram analysis for sleeping cell detection in LTE networks LTE网络中睡眠小区检测的n图分析

2013 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2013-05-26 DOI: 10.1109/ICASSP.2013.6638499

Fedor Chernogorov, T. Ristaniemi, Kimmo Brigatti, Sergey Chernov

引用次数: 21

Unsupervised discovery of linguistic structure including two-level acoustic patterns using three cascaded stages of iterative optimization 语言结构的无监督发现，包括使用三个级联迭代优化阶段的两级声学模式

2013 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2013-05-26 DOI: 10.1109/ICASSP.2013.6639239

Cheng-Tao Chung, Chun-an Chan, Lin-Shan Lee

{"title":"Unsupervised discovery of linguistic structure including two-level acoustic patterns using three cascaded stages of iterative optimization","authors":"Cheng-Tao Chung, Chun-an Chan, Lin-Shan Lee","doi":"10.1109/ICASSP.2013.6639239","DOIUrl":"https://doi.org/10.1109/ICASSP.2013.6639239","url":null,"abstract":"Techniques for unsupervised discovery of acoustic patterns are getting increasingly attractive, because huge quantities of speech data are becoming available but manual annotations remain hard to acquire. In this paper, we propose an approach for unsupervised discovery of linguistic structure for the target spoken language given raw speech data. This linguistic structure includes two-level (subword-like and word-like) acoustic patterns, the lexicon of word-like patterns in terms of subword-like patterns and the N-gram language model based on word-like patterns. All patterns, models, and parameters can be automatically learned from the unlabelled speech corpus. This is achieved by an initialization step followed by three cascaded stages for acoustic, linguistic, and lexical iterative optimization. The lexicon of word-like patterns defines allowed consecutive sequence of HMMs for subword-like patterns. In each iteration, model training and decoding produces updated labels from which the lexicon and HMMs can be further updated. In this way, model parameters and decoded labels are respectively optimized in each iteration, and the knowledge about the linguistic structure is learned gradually layer after layer. The proposed approach was tested in preliminary experiments on a corpus of Mandarin broadcast news, including a task of spoken term detection with performance compared to a parallel test using models trained in a supervised way. Results show that the proposed system not only yields reasonable performance on its own, but is also complimentary to existing large vocabulary ASR systems.","PeriodicalId":183968,"journal":{"name":"2013 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"96 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116991492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 28

SSIM-based adaptive quantization in HEVC 基于ssim的HEVC自适应量化

2013 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2013-05-26 DOI: 10.1109/ICASSP.2013.6637940

Chuohao Yeo, Hui Li Tan, Y. H. Tan

引用次数: 24

Prediction of creaky voice from contextual factors 从语境因素预测沙哑的声音

2013 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2013-05-26 DOI: 10.1109/ICASSP.2013.6639216

Thomas Drugman, John Kane, T. Raitio, C. Gobl

引用次数: 10

UCS-NT: An unbiased compressive sensing framework for Network Tomography UCS-NT:网络断层扫描的无偏压缩感知框架

2013 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2013-05-26 DOI: 10.1109/ICASSP.2013.6638518

H. Mahyar, H. Rabiee, Z. S. Hashemifar

引用次数: 15

Transient modeling for overlap-add sinusoidal model of speech 语音叠加正弦模型的瞬态建模

2013 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2013-05-26 DOI: 10.1109/ICASSP.2013.6639261

Slava Shechtman

引用次数: 2

Distributed multi-hypothesis coding of depth maps using texture motion information and optical flow 基于纹理运动信息和光流的深度图分布式多假设编码

2013 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2013-05-26 DOI: 10.1109/ICASSP.2013.6637939

Matteo Salmistraro, M. Zamarin, L. L. Rakêt, Søren Forchhammer

引用次数: 13

Weighted sum rate maximization for cognitive MISO broadcast channel: Large system analysis 认知MISO广播信道加权和速率最大化:大系统分析

2013 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2013-05-26 DOI: 10.1109/ICASSP.2013.6638587

Y. He, S. Dey

{"title":"Weighted sum rate maximization for cognitive MISO broadcast channel: Large system analysis","authors":"Y. He, S. Dey","doi":"10.1109/ICASSP.2013.6638587","DOIUrl":"https://doi.org/10.1109/ICASSP.2013.6638587","url":null,"abstract":"This paper considers the ergodic weighted sum rate (WSR) maximization problem for an underlay cognitive radio MISO broadcast channel, where a secondary network, consisting of a base-station with M transmit antennas and K single-antenna secondary users (SUs), is allowed to share the same spectrum with a primary user (PU), under an average transmit sum power (ATTP) constraint Pav and an average interference power (AIP) constraint on the PU. We show that the ATTP constraint is always active, and as Pav → ∞, the ergodic WSR approaches infinity similar to the conventional non-CR network case. A low-complexity suboptimal beamforming scheme (called partially-projected regularized zero-forcing beamforming `PP-RZFBF') with a closed-form beamformer is proposed. Due to the non-convexity of PP-RZFBF scheme, a large system analysis is conducted in the limit as M and K approach infinity with a fixed finite ratio r = K/M. We derive deterministic limiting approximations for the PP-RZFBF problem which enables us to determine asymptotically optimal beamformers for PP-RZFBF. Numerical simulations illustrate that the asymptotically optimal beamformers turn out to be quite effective even for small M, K.","PeriodicalId":183968,"journal":{"name":"2013 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124996662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Low-complexity and high-performance non-coherent cell identification detection schemes for OFDM-based systems 基于ofdm系统的低复杂度高性能非相干小区识别检测方案

2013 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2013-05-26 DOI: 10.1109/ICASSP.2013.6638596

Ying-Tsung Lin, Yi-Hsiang Wang, Sau-Gee Chen, Chih-Liang Chen

引用次数: 0