Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010)最新文献_第2页

Are we waves or are we particles? A new insight into deep semantics in natural language processing 我们是波还是粒子?自然语言处理中深层语义的新见解

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) Pub Date : 2010-09-30 DOI: 10.1109/NLPKE.2010.5587805

Svetlana Machova, J. Klecková

引用次数: 2

Shui nationality characters stroke shape input method 水族文字笔画形状输入法

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) Pub Date : 2010-09-30 DOI: 10.1109/NLPKE.2010.5587840

Hanyue Yang, Xiaorong Chen

引用次数: 0

Chinese patent retrieval based on the pragmatic information 基于语用信息的中文专利检索

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) Pub Date : 2010-09-30 DOI: 10.1109/NLPKE.2010.5587776

Liping Wu, Song Liu, F. Ren

引用次数: 1

Part-of-speech tagging for Chinese unknown words in a domain-specific small corpus using morphological and contextual rules 基于形态和语境规则的小语料库中文未知词词性标注

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) Pub Date : 2010-09-30 DOI: 10.1109/NLPKE.2010.5587771

Tao-Hsing Chang, Fu-Yuan Hsu, Chia-Hoang Lee, Hahn-Ming Lee

引用次数: 1

Statistical parsing based on Maximal Noun Phrase pre-processing 基于最大名词短语预处理的统计分析

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) Pub Date : 2010-09-30 DOI: 10.1109/NLPKE.2010.5587850

Qiaoli Zhou, Yue Gu, Xin Liu, Wenjing Lang, Dongfeng Cai

引用次数: 3

Bagging to find better expansion words 寻找更好的扩展词

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) Pub Date : 2010-09-30 DOI: 10.1109/NLPKE.2010.5587826

Bingqing Wang, Yaqian Zhou, Xipeng Qiu, Qi Zhang, Xuanjing Huang

{"title":"Bagging to find better expansion words","authors":"Bingqing Wang, Yaqian Zhou, Xipeng Qiu, Qi Zhang, Xuanjing Huang","doi":"10.1109/NLPKE.2010.5587826","DOIUrl":"https://doi.org/10.1109/NLPKE.2010.5587826","url":null,"abstract":"The supervised learning has been applied into the query expansion techniques, which trains a model to predict the “goodness” or “utility” of the expanded term to the retrieval system. There are many features to measure the relatedness between the expanded word and the query, which can be incorporated in the supervised learning to select the expanded terms. The training data set is generated automatically by a tricky method. However, this method can be affected by many aspects. A severe problem is that the distribution of the features is query-dependent, which has not been discussed in previous work. With a different distribution on the features, it is questionable to merge these training instances together and use the whole data set to train one single model. In this paper, we first investigate the statistical distribution of the auto-generated training data and show the problems in the training data set. Based on our analysis, we proposed to use the bagging method to ensemble several regression models in order to get a better supervised model to make prediction on the expanded terms. We conducted the experiments on the TREC benchmark test collections. Our analysis on the training data reveals some interesting phenomena about the query expansion techniques. The experiment results also show that the bagging approach can achieve the state-of-art retrieval performance on the standard TREC data set.","PeriodicalId":259975,"journal":{"name":"Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125257444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Affix-augmented stem-based language model for persian 波斯语词缀增强词干语言模型

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) Pub Date : 2010-09-30 DOI: 10.1109/NLPKE.2010.5587823

Heshaam Faili, H. Ravanbakhsh

引用次数: 2

Sentiment word identification using the maximum entropy model 基于最大熵模型的情感词识别

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) Pub Date : 2010-09-30 DOI: 10.1109/NLPKE.2010.5587811

Xiaoxu Fei, Huizhen Wang, Jingbo Zhu

引用次数: 16

A reranking method for syntactic parsing with heterogeneous treebanks 异构树库句法分析的重排序方法

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) Pub Date : 2010-09-30 DOI: 10.1109/NLPKE.2010.5587842

Haibo Ding, Muhua Zhu, Jingbo Zhu

引用次数: 0

Flexible English writing support based on negative-positive conversion method 基于正负转换法的灵活英语写作支持

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) Pub Date : 2010-09-30 DOI: 10.1109/NLPKE.2010.5587778

Yasushi Katsura, Kazuyuki Matsumoto, F. Ren

引用次数: 1