Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific最新文献_第2页

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041620

S. Sakti, Y. Odagaki, Takafumi Sasakura, Graham Neubig, T. Toda, Satoshi Nakamura

{"title":"An event-related brain potential study on the impact of speech recognition errors","authors":"S. Sakti, Y. Odagaki, Takafumi Sasakura, Graham Neubig, T. Toda, Satoshi Nakamura","doi":"10.1109/APSIPA.2014.7041620","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041620","url":null,"abstract":"Most automatic speech recognition (ASR) systems, which aim for perfect transcription of utterances, are trained and tuned by minimizing the word error rate (WER). In this framework, even though the impact of all errors is not the same, all errors (substitutions, deletions, insertions) from any words are treated in a uniform manner. The size of the impact and exactly what the differences are remain unknown. Several studies have proposed possible alternatives to the WER metric. But no analysis has investigated how the human brain processes language and perceives the effect of mistaken output by ASR systems. In this research we utilize event-related brain potential (ERP) studies and directly analyze the brain activities on the impact of ASR errors. Our results reveal that the peak amplitudes of the positive shift after the substitution and deletion violations are much bigger than the insertion violations. This finding indicates that humans perceived each error differently based on its impact of the whole sentence. To investigate the effect of this study, we formulated a new weighted word error rate metric based on the ERP results: ERP-WWER. We re-evaluated the ASR performance using the new ERP-WWER metric and compared and discussed the results with the standard WER.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130315263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

A pilot user's prospective in mobile robotic telepresence system 移动机器人远程呈现系统的先导用户展望

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041635

Muhammad Sikandar Lal Khan, S. Réhman, P. L. Hera, Feng Liu, Haibo Li

{"title":"A pilot user's prospective in mobile robotic telepresence system","authors":"Muhammad Sikandar Lal Khan, S. Réhman, P. L. Hera, Feng Liu, Haibo Li","doi":"10.1109/APSIPA.2014.7041635","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041635","url":null,"abstract":"In this work we present an interactive video conferencing system specifically designed for enhancing the experience of video teleconferencing for a pilot user. We have used an Embodied Telepresence System (ETS) which was previously designed to enhance the experience of video teleconferencing for the collaborators. In this work we have deployed an ETS in a novel scenario to improve the experience of pilot user during distance communication. The ETS is used to adjust the view of the pilot user at the distance location (e.g. distance located conference/meeting). The velocity profile control for the ETS is developed which is implicitly controlled by the head of the pilot user. The experiment was conducted to test whether the view adjustment capability of an ETS increases the collaboration experience of video conferencing for the pilot user or not. The user study was conducted in which participants (pilot users) performed interaction using ETS and with traditional computer based video conferencing tool. Overall, the user study suggests the effectiveness of our approach and hence results in enhancing the experience of video conferencing for the pilot user.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126523654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Iterative depth recovery for multi-view video synthesis from stereo videos 基于立体视频的多视点视频合成迭代深度恢复

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041695

Chen-Hao Wei, Chen-Kuo Chiang, S. Lai

{"title":"Iterative depth recovery for multi-view video synthesis from stereo videos","authors":"Chen-Hao Wei, Chen-Kuo Chiang, S. Lai","doi":"10.1109/APSIPA.2014.7041695","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041695","url":null,"abstract":"We propose a novel depth maps refinement algorithm and generate multi-view video sequences from two-view video sequences for modern autostereoscopic display. In order to generate realistic contents for virtual views, high-quality depth maps are very critical to the view synthesis results. We propose an iterative depth refinement approach of a joint error detection and correction algorithm to refine the depth maps that can be estimated by an existing stereo matching method or provided by a depth capturing device. Error detection aims at two types of error: across-view color-depth-inconsistency error and local color-depth-inconsistency error. Subsequently, the detected error pixels are corrected by searching appropriate candidates under several constraints to amend the depth errors. A trilateral filter is included in the refining process that considers intensity, spatial and temporal terms into the filter weighting to enhance the consistency across frames. In the proposed view synthesis framework, it features a disparity-based view interpolation method to alleviate the translucent artifacts and a directional filter to reduce the aliasing around the object boundaries. Experimental results show that the proposed algorithm effectively fixes errors in the depth maps. In addition, we also show the refined depth maps along with the proposed view synthesis framework significantly improve the novel view synthesis on several benchmark datasets.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130152439","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Analytical prediction formula of random variation in high frequency performance of weak inversion scaled MOSFET 弱反转比例MOSFET高频性能随机变化的解析预测公式

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041810

R. Banchuin, R. Chaisricharoen

引用次数: 0

Compressed video quality assessment with modified MSE 改进MSE的压缩视频质量评估

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041643

Sudeng Hu, Lina Jin, C.-C. Jay Kuo

引用次数: 2

3D object modeling with a Kinect camera 使用Kinect摄像头进行3D物体建模

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041821

Mayoore S. Jaiswal, Jun Xie, Ming-Ting Sun

引用次数: 9

Detecting contrast agents in ultrasound image sequences for tumor diagnosis 在超声图像序列中检测造影剂用于肿瘤诊断

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041598

K. Noro, Koichi Ito, Yukari Yanagisawa, M. Sakamoto, S. Mori, K. Shiga, T. Kodama, T. Aoki

引用次数: 0

Block-based multiscale error concealment using low-rank completion 基于块的低秩补全多尺度错误隐藏

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041587

Mading Li, Jiaying Liu, Chong Ruan, Lu Liu, Zongming Guo

引用次数: 0

Progressive audio scrambling via complete binary tree's traversal and wavelet transform 通过完全二叉树遍历和小波变换实现音频累进置乱

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041525

Twe Ta Oo, T. Onoye

引用次数: 2

Improved channel estimation for ISDB-T using Modified Orthogonal Matching Pursuit over fractional delay TU6 channel 基于改进正交匹配追踪的ISDB-T分数延迟TU6信道估计方法

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041655

Ryan Paderna, T. Higashino, M. Okada

引用次数: 4