2004 International Symposium on Chinese Spoken Language Processing最新文献_第5页

Chinese large-vocabulary name recognition system using character description and syllable spelling recognition 基于汉字描述和音节拼写识别的汉语大词汇名识别系统

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409575

N. J. Wang, Ching-Ho Tsai, Patrick Huang, Jia-Lin Shen

引用次数: 6

High quality harmonic excitation linear predictive speech coding at 2 kb/s 高质量谐波激励线性预测语音编码在2kb /s

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409608

C. Bao, J. Lukasiak, C. Ritz

引用次数: 0

A system for Mandarin short phrase recognition on portable devices 基于便携式设备的汉语短短语识别系统

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409628

Chao Xu, Yi Y. Liu, Yongsheng Yang, Pascale Fung, Z. Cao

引用次数: 3

An improved 4 kbit/s CELP speech coding algorithm 一种改进的4kbit /s CELP语音编码算法

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409609

Yanning Bai, C. Bao

引用次数: 0

Grapheme-to-phoneme conversion in Chinese TTS system 汉语TTS系统中的字素-音素转换

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409612

Honghui Dong, J. Tao, Bo Xu

引用次数: 11

On analysis of eigenpitch in Mandarin Chinese 汉语普通话特征音分析

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409593

Jilei Tian, J. Nurminen

引用次数: 12

The disambiguation strategies of semantic analysis in Chinese spoken dialogue system 汉语口语对话系统语义分析中的消歧策略

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409618

Bei Liu, Limin Du

引用次数: 0

A superposed prosodic model for Chinese text-to-speech synthesis 中文文本-语音合成的叠加韵律模型

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409615

G. Chen, G. Bailly, Qingfeng Liu, Ren-Hua Wang

引用次数: 25

Integrating tonal information into Mandarin name recognition with different strategies 用不同的策略将声调信息整合到汉语人名识别中

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409637

Dongsheng Luo, Xiang Xie, Jingming Kuang

引用次数: 0

An initial prototype system for Chinese spoken document understanding and organization for indexing/browsing and retrieval applications 一个用于中文口语文档理解和组织索引/浏览和检索应用程序的初始原型系统

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409653

Lin-Shan Lee, Shun-Chuan Chen, Yuan Ho, Jia-fu Chen, Ming Li, T. Li

{"title":"An initial prototype system for Chinese spoken document understanding and organization for indexing/browsing and retrieval applications","authors":"Lin-Shan Lee, Shun-Chuan Chen, Yuan Ho, Jia-fu Chen, Ming Li, T. Li","doi":"10.1109/CHINSL.2004.1409653","DOIUrl":"https://doi.org/10.1109/CHINSL.2004.1409653","url":null,"abstract":"The most attractive form of future network content will be multimedia. When voice information is included, it usually carries core concepts for the content. Thus, a spoken document associated with multimedia content can very possibly serve as the key for indexing/browsing and retrieval. However, unlike written documents, multimedia or voice information is very often just audio/video signals. They are very difficult to index, browse or retrieve, since users cannot go through each of them from the beginning to the end during browsing. A possible approach may be to segment the audio/video signals automatically into short paragraphs, each with a central concept or topic, and then automatically generate a title and/or a summary for each of these, in either speech or text form. The topics and central concepts described in the segmented short paragraphs may then be further analyzed and organized into graphic structures describing the relationships among these topics and central concepts. Hence, the multimedia content can be automatically indexed much more efficiently and browsed and retrieved by the user based on the title, summary and graphic structure. We refer to this as the understanding and organization of spoken documents. An initial prototype system for such functions, with broadcast news taken as the example multimedia content, is presented. The graphic structure used to describe the relationships among the topics and central concepts are 2-dimensional tree structures developed based on probabilistic latent semantic analysis.","PeriodicalId":212562,"journal":{"name":"2004 International Symposium on Chinese Spoken Language Processing","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115905254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2