Int. J. Comput. Linguistics Chin. Lang. Process.最新文献_第4页

An HNM Based Scheme for Synthesizing Mandarin Syllable Signal 一种基于HNM的汉语音节信号合成方案

Int. J. Comput. Linguistics Chin. Lang. Process. Pub Date : 2008-09-01 DOI: 10.30019/IJCLCLP.200809.0004

H. Gu, Yan-Zuo Zhou

引用次数: 7

Question Analysis and Answer Passage Retrieval for Opinion Question Answering Systems 意见问答系统的问题分析和答案段落检索

Int. J. Comput. Linguistics Chin. Lang. Process. Pub Date : 2008-09-01 DOI: 10.30019/IJCLCLP.200809.0003

Lun-Wei Ku, Yu-Ting Liang, Hsin-Hsi Chen

{"title":"Question Analysis and Answer Passage Retrieval for Opinion Question Answering Systems","authors":"Lun-Wei Ku, Yu-Ting Liang, Hsin-Hsi Chen","doi":"10.30019/IJCLCLP.200809.0003","DOIUrl":"https://doi.org/10.30019/IJCLCLP.200809.0003","url":null,"abstract":"Question answering systems provide an elegant way for people to access an underlying knowledge base. However, people are interested in not only factual questions, but also opinions. This paper deals with question analysis and answer passage retrieval in opinion QA systems. For question analysis, six opinion question types are defined. A two-layered framework utilizing two question type classifiers is proposed. Algorithms for these two classifiers are described. The performance achieves 87.8% in general question classification and 92.5% in opinion question classification. The question focus is detected to form a query for the information retrieval system and the question polarity is detected to retain relevant sentences which have the same polarity as the question. For answer passage retrieval, three components are introduced. Relevant sentences retrieved are further identified as to whether the focus (Focus Detection) is in a scope of opinion (Opinion Scope Identification) or not, and, if yes, whether the polarity of the scope and the polarity of the question (Polarity Detection) match with each other. The best model achieves an F-measure of 40.59% by adopting partial match for relevance detection at the level of meaningful unit. With relevance issues removed, the F-measure of the best model boosts up to 84.96%.","PeriodicalId":436300,"journal":{"name":"Int. J. Comput. Linguistics Chin. Lang. Process.","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125104025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 23

A Language Information Retrieval Approach to Writing Assistance 一种辅助写作的语言信息检索方法

Int. J. Comput. Linguistics Chin. Lang. Process. Pub Date : 2008-09-01 DOI: 10.30019/IJCLCLP.200809.0002

Jyishane Liu, Pei-Chun Hung, Ching-Ying Lee

引用次数: 2

A Thesaurus-Based Semantic Classification of English Collocations 基于同义词典的英语搭配语义分类

Int. J. Comput. Linguistics Chin. Lang. Process. Pub Date : 2008-09-01 DOI: 10.30019/IJCLCLP.200909.0002

Chung-Chi Huang, Kate H. Kao, Chiung-Hui Tseng, Jason J. S. Chang

{"title":"A Thesaurus-Based Semantic Classification of English Collocations","authors":"Chung-Chi Huang, Kate H. Kao, Chiung-Hui Tseng, Jason J. S. Chang","doi":"10.30019/IJCLCLP.200909.0002","DOIUrl":"https://doi.org/10.30019/IJCLCLP.200909.0002","url":null,"abstract":"Researchers have developed many computational tools aimed at extracting collocations for both second language learners and lexicographers. Unfortunately, the tremendously large number of collocates returned by these tools usually overwhelms language learners. In this paper, we introduce a thesaurus-based semantic classification model that automatically learns semantic relations for classifying adjective-noun (A-N) and verb-noun (V-N) collocations into different thesaurus categories. Our model is based on iterative random walking over a weighted graph derived from an integrated knowledge source of word senses in WordNet and semantic categories of a thesaurus for collocation classification. We conduct an experiment on a set of collocations whose collocates involve varying levels of abstractness in the collocation usage box of Macmillan English Dictionary. Experimental evaluation with a collection of 150 multiple-choice questions commonly used as a similarity benchmark in the TOEFL synonym test shows that a thesaurus structure is successfully imposed to help enhance collocation production for L2 learners. As a result, our methodology may improve the effectiveness of state-of-the-art collocation reference tools concerning the aspects of language understanding and learning, as well as lexicography.","PeriodicalId":436300,"journal":{"name":"Int. J. Comput. Linguistics Chin. Lang. Process.","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131163525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

A Cross-Linguistic Study of Voice Onset Time in Stop Consonant Productions 停顿辅音发声时间的跨语言研究

Int. J. Comput. Linguistics Chin. Lang. Process. Pub Date : 2008-06-01 DOI: 10.30019/IJCLCLP.200806.0005

K. Chao, Li-Mei Chen

引用次数: 56

Multiple Document Summarization Using Principal Component Analysis Incorporating Semantic Vector Space Model 结合语义向量空间模型的主成分分析多文档摘要

Int. J. Comput. Linguistics Chin. Lang. Process. Pub Date : 2008-06-01 DOI: 10.30019/IJCLCLP.200806.0001

O. Vikas, A. Meshram, Girraj Meena, Amit Gupta

引用次数: 15

The Effects of Formal Schema on Reading Comprehension¡XAn Experiment with Chinese EFL Readers 形式图式对阅读理解的影响——对中国英语读者的实验

Int. J. Comput. Linguistics Chin. Lang. Process. Pub Date : 2008-06-01 DOI: 10.30019/IJCLCLP.200806.0004

Xiaoyan Zhang

引用次数: 42

A Study on Consistency Checking Method of Part-Of-Speech Tagging for Chinese Corpora 汉语语料库词性标注一致性检验方法研究

Int. J. Comput. Linguistics Chin. Lang. Process. Pub Date : 2008-06-01 DOI: 10.30019/IJCLCLP.200806.0002

Hu Zhang, Jia-heng Zheng

引用次数: 1

Exploring Shallow Answer Ranking Features in Cross-Lingual and Monolingual Factoid Question Answering 探讨跨语言和单语言伪题回答的浅答案排序特征

Int. J. Comput. Linguistics Chin. Lang. Process. Pub Date : 2008-03-01 DOI: 10.30019/IJCLCLP.200803.0001

Cheng-Wei Lee, Yi-Hsun Lee, W. Hsu

{"title":"Exploring Shallow Answer Ranking Features in Cross-Lingual and Monolingual Factoid Question Answering","authors":"Cheng-Wei Lee, Yi-Hsun Lee, W. Hsu","doi":"10.30019/IJCLCLP.200803.0001","DOIUrl":"https://doi.org/10.30019/IJCLCLP.200803.0001","url":null,"abstract":"Answer ranking is critical to a QA (Question Answering) system because it determines the final system performance. In this paper, we explore the behavior of shallow ranking features under different conditions. The features are easy to implement and are also suitable when complex NLP techniques or resources are not available for monolingual or cross-lingual tasks. We analyze six shallow ranking features, namely, SCO-QAT, keyword overlap, density, IR score, mutual information score, and answer frequency. SCO-QAT (Sum of Co-occurrence of Question and Answer Terms) is a new feature proposed by us that performed well in NTCIR CLQA. It is a co-occurrence based feature that does not need extra knowledge, word-ignoring heuristic rules, or special tools. Instead, for the whole corpus, SCO-QAT calculates co-occurrence scores based solely on the passage retrieval results. Our experiments show that there is no perfect shallow ranking feature for every condition. SCO-QAT performs the best in C-C (Chinese-Chinese) QA, but it is not a good choice in E-C (English-Chinese) QA. Overall, Frequency is the best choice for E-C QA, but its performance is impaired when translation noise is present. We also found that passage depth has little impact on shallow ranking features, and that a proper answer filter with fined-grained answer types is important for E-C QA. We measured the performance of answer ranking in terms of a newly proposed metric EAA (Expected Answer Accuracy) to cope with cases of answers that have the same score after ranking.","PeriodicalId":436300,"journal":{"name":"Int. J. Comput. Linguistics Chin. Lang. Process.","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122382394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Integrating Complementary Features from Vocal Source and Vocal Tract for Speaker Identification 基于声源和声道互补特征的说话人识别

Int. J. Comput. Linguistics Chin. Lang. Process. Pub Date : 2007-09-01 DOI: 10.30019/IJCLCLP.200709.0004

Nengheng Zheng, Tan Lee, Ning Wang, P. Ching

{"title":"Integrating Complementary Features from Vocal Source and Vocal Tract for Speaker Identification","authors":"Nengheng Zheng, Tan Lee, Ning Wang, P. Ching","doi":"10.30019/IJCLCLP.200709.0004","DOIUrl":"https://doi.org/10.30019/IJCLCLP.200709.0004","url":null,"abstract":"This paper describes a speaker identification system that uses complementary acoustic features derived from the vocal source excitation and the vocal tract system. Conventional speaker recognition systems typically adopt the cepstral coefficients, e.g., Mel-frequency cepstral coefficients (MFCC) and linear predictive cepstral coefficients (LPCC), as the representative features. The cepstral features aim at characterizing the formant structure of the vocal tract system. This study proposes a new feature set, named the wavelet octave coefficients of residues (WOCOR), to characterize the vocal source excitation signal. WOCOR is derived by wavelet transformation of the linear predictive (LP) residual signal and is capable of capturing the spectro-temporal properties of vocal source excitation. WOCOR and MFCC contain complementary information for speaker recognition since they characterize two physiologically distinct components of speech production. The complementary contributions of MFCC and WOCOR in speaker identification are investigated. A confidence measure based score-level fusion technique is proposed to take full advantage of these two complementary features for speaker identification. Experiments show that an identification system using both MFCC and WOCOR significantly outperforms one using MFCC only. In comparison with the identification error rate of 6.8% obtained with MFCC-based system, an error rate of 4.1% is obtained with the proposed confidence measure based integrating system.","PeriodicalId":436300,"journal":{"name":"Int. J. Comput. Linguistics Chin. Lang. Process.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130093205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2