{"title":"Proofreader of Mongolian Vocabulary Based on Language Model of Syllabic Statistics","authors":"Ochir, Menghjiya, Gong Zheng","doi":"10.1109/IALP.2009.52","DOIUrl":"https://doi.org/10.1109/IALP.2009.52","url":null,"abstract":"Mongolian script proofreading is delicate but monotonous work, which is for long done by hand. This paper is attempted to build up a Mongolian text proofreading system based on language model of syllabic statistics by decomposing a Mongolian word into syllables before putting them into the model. In the light of lexical features of Mongolian, this paper ushers some exploratory work in automatic proofreading of Mongolian vocabulary","PeriodicalId":156840,"journal":{"name":"2009 International Conference on Asian Language Processing","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116147025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Kappa Measurement of Query Consistency and Its Application","authors":"Chao Song, Muyun Yang, Haoliang Qi, Sheng Li","doi":"10.1109/IALP.2009.70","DOIUrl":"https://doi.org/10.1109/IALP.2009.70","url":null,"abstract":"To measure the diversity of the user interests over the same query, this paper applies kappa coefficient as the indicator of the consistency of users’ clicks for a given query. It compares three different settings of the Kappa parameters and shows the Kappa formula can be well adapted to the Query log analysis. Based on the further analysis of the Kappa results over Sogou Query LOG, it is revealed that the user interests over the same query are rather diversified, which indicates that the personalized information retrieval is a promising solution to improve the current information retrieval performance.","PeriodicalId":156840,"journal":{"name":"2009 International Conference on Asian Language Processing","volume":"147 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122051523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Wavelet-Based Denoising System Using Time-Frequency Adaptation for Speech Enhancement","authors":"Kun-Ching Wang, Chuin-Li Chin, Y. Tsai","doi":"10.1109/IALP.2009.32","DOIUrl":"https://doi.org/10.1109/IALP.2009.32","url":null,"abstract":"In this paper, we propose a novel wavelet denoising system using time-frequency adaptation for providing speech enhancement robustness to non-stationary and colored noise. Different from the conventional methods in threshold choosing, e.g. invariant threshold and time-variant threshold, the proposed wavelet coefficient threshold (WCT) is adapted by both time and frequency information. In order to further improve the intelligibility of the processed speech signal, we apply appropriate wavelet thresholding according to voiced/unvoiced decision. Simulation results showed that the proposed system is capable of reducing noise with little speech degradation and the overall performance is superior to several competitive methods in both objective and subjective evaluations.","PeriodicalId":156840,"journal":{"name":"2009 International Conference on Asian Language Processing","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125875217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The Automatic Analysis of Chunk-Extension Sentences","authors":"Xiangfeng Wei, Quan Zhang","doi":"10.1109/IALP.2009.45","DOIUrl":"https://doi.org/10.1109/IALP.2009.45","url":null,"abstract":"The chunk-extension sentence is a kind of sentence, which is defined with specific conceptual knowledge in the theory of Hierarchical Network of Concepts. This paper introduces some semantic categories of chunk-extension sentences and related typical verbs. By these typical verbs, the eigen semantic chunk and semantic category of a chunk-extension sentence can be activated. The semantic structure of a sentence can be revealed after testing with the transcendent conceptual knowledge of chunk-extension semantic categories. An experiment was done for automatically analyzing chunk-extension sentences based on a system of analyzing the semantic categories of sentences. The precision of the experiment is 71.29%. The errors are mainly caused when recognizing the head verb and selecting the right semantic category from the multi-semantic-category of a verb. The well analyzing result of chunk-extension sentences will be helpful to tackle the multi-verb puzzle in processing Chinese sentences.","PeriodicalId":156840,"journal":{"name":"2009 International Conference on Asian Language Processing","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127954419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research on the Recognition of fKs Based on HNC Theory","authors":"HanFen Zang, Xiangfeng Wei, Quan Zhang","doi":"10.1109/IALP.2009.43","DOIUrl":"https://doi.org/10.1109/IALP.2009.43","url":null,"abstract":"Full-automatic semantic analysis is at all times one of the main targets in natural language processing. Many researchers made great efforts in semantic analyzing or tagging to reach such target. There have been some good results in shallow semantic analysis such as recognizing syntax phrase chunks and labeling semantic roles. This paper introduces the supplementary semantic chunk (in short, fK) in the theory of Hierarchical Network of Concepts (HNC). Based on the characteristic boundary symbols and concepts of fKs, we sum up some operational rules for computer to recognize the fKs in Chinese sentences. These rules are quite effective when recognizing the fKs in an experiment.","PeriodicalId":156840,"journal":{"name":"2009 International Conference on Asian Language Processing","volume":"115 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131595431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Acoustic Comparison of Vowel Length Contrasts in Standard Arabic, Japanese and Thai","authors":"K. Tsukada","doi":"10.1109/IALP.2009.25","DOIUrl":"https://doi.org/10.1109/IALP.2009.25","url":null,"abstract":"In our earlier perception study, we observed that familiarity with first language (L1) phonemic length contrasts in Japanese does not transfer optimally into an unknown language, Arabic. We hypothesized that this finding is related to cross-language differences in how vowel length contrasts are phonetically realized. The present study compares vowel length contrasts that are phonemic in three typologically unrelated languages, i.e., Standard Arabic, Japanese and Thai, in an attempt to understand the extent to which vowel length contrasts are similar or dissimilar in these languages. Acoustic measurements showed short and long categories were clearly differentiated in all three languages and the short-to-long ratio did not substantially differ across languages. This suggests that listeners attend to more than just acoustic vowel duration in making perceptual judgments on short vs. long vowels.","PeriodicalId":156840,"journal":{"name":"2009 International Conference on Asian Language Processing","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126910959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Exploring Syntactic Features for Pronoun Resolution Using Context-Sensitive Convolution Tree Kernel","authors":"Fang Kong, Yancui Li, Guodong Zhou, Qiaoming Zhu","doi":"10.1109/IALP.2009.49","DOIUrl":"https://doi.org/10.1109/IALP.2009.49","url":null,"abstract":"This paper proposes to use a convolution kernel over parse tree to model syntactic structure information for pronoun resolution. Our study reveals that the syntactic structure features embedded in a parse tree are very effective for pronoun resolution and these features can be well captured by the context-sensitive convolution tree kernel. Evaluation on the ACE 2003 corpus shows that among all structured syntactic feature space, Shortest Path Tree achieves the best performance. Then we incorporate more features into SPT, result shows that SPT can use successfully with normal features. Finally, we compare our system with other pronoun resolution systems, our results are outstanding in success rate than normal features and tree kernel-based method of Yang.","PeriodicalId":156840,"journal":{"name":"2009 International Conference on Asian Language Processing","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124901420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research on Feature Extraction from Chinese Text for Opinion Mining","authors":"S. Zhu, Yuanchao Liu, Ming Liu, Peiliang Tian","doi":"10.1109/IALP.2009.11","DOIUrl":"https://doi.org/10.1109/IALP.2009.11","url":null,"abstract":"more and more users and manufacturers concern about product reviews on the web, but it's difficult to quickly find interesting content from massive information. In order to mine sentiment polarity from review sentences, two approaches for product feature extraction and sentence opinion mining are proposed in this paper. Because of the characteristics of Chinese language, lexical analyzing tools are used to process review text, and association rule model is used to mine frequent items as candidate feature. In order to get better result, several filtering algorithms are proposed. Experiment results demonstrate that relation between the precision and recall rate of feature extraction task with different minimum support thresholds in association rules mining, and the promising performance of our approach has also been shown.","PeriodicalId":156840,"journal":{"name":"2009 International Conference on Asian Language Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125366594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Author Profiling for Vietnamese Blogs","authors":"Dang Duc Pham, G. Tran, S. Pham","doi":"10.1109/IALP.2009.47","DOIUrl":"https://doi.org/10.1109/IALP.2009.47","url":null,"abstract":"This paper presents the first work in the task of author profiling for Vietnamese blogs. This task is important in threat identification and marketing intelligence. We have developed a Vietnamese Blog Profiling framework to automatically predict age, gender, geographic origin and occupation of weblogs’ authors purely based on language use. The experiments on the blogs corpus we collected show very promising results with accuracy of around 80% across all traits.","PeriodicalId":156840,"journal":{"name":"2009 International Conference on Asian Language Processing","volume":"120 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116356085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ze-ya Ding, HanFen Zang, Quan Zhang, Jianming Miao, Yu-huan Chi
{"title":"Automatic Machine Translation Evaluation Based on Sentence Structure Information","authors":"Ze-ya Ding, HanFen Zang, Quan Zhang, Jianming Miao, Yu-huan Chi","doi":"10.1109/IALP.2009.42","DOIUrl":"https://doi.org/10.1109/IALP.2009.42","url":null,"abstract":"Automatic evaluation of machine translation plays an important role in improving the performance of machine translation systems. In this paper, we firstly introduce three traditional methods of automatic evaluation, including BLEU, NIST and WER. All these methods are based on surface layer information of translations like vocabularies, so we do some studies on the evaluation method using the information of sentence structure. Because the Hierarchical Network of Concepts (HNC) theory thinks that sentence category and format transformations are two most important links in machine translation, we do some researches about sentence category and format transformations, and get the sentence structure information which is composed of sentence category information and format information of every sentence in the bilingual (Chinese and English) translation corpora. Then, considering the traditional methods above, we propose the method of automatic evaluation based on the information of sentence structure and have proved it effective by experiment.","PeriodicalId":156840,"journal":{"name":"2009 International Conference on Asian Language Processing","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132565809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}