2014 International Conference on Asian Language Processing (IALP)最新文献

筛选
英文 中文
Logical operative processes of semantic grammar for machine interpretation 用于机器解释的语义语法的逻辑操作过程
2014 International Conference on Asian Language Processing (IALP) Pub Date : 2014-12-04 DOI: 10.1109/IALP.2014.6973487
Sivakumar Ramakrishnan, Pradeep Isawasan, V. Mohanan
{"title":"Logical operative processes of semantic grammar for machine interpretation","authors":"Sivakumar Ramakrishnan, Pradeep Isawasan, V. Mohanan","doi":"10.1109/IALP.2014.6973487","DOIUrl":"https://doi.org/10.1109/IALP.2014.6973487","url":null,"abstract":"The purpose of this paper is to identify and reveal the significance of primary logical operative processes of semantic grammar of any languages for the establishment of machine interpretation. This neo generative mechanism for logical semantic representation for machine interpretation has been systematically analyzed by logical linguistic and mathematical postulations. These logical operative processes structurally provide a way in which grammatical properties of language can be treated within a framework of speech acts to accommodate and to ease the machine interpretation for ontological representation and cognitive act. This treatment also allows the sentences to be semantically interpreted and hermeneutically analyzed within the temporal movement of speech act for machine interpretation. The logical postulation of operative processes of grammar enables to provide an explanation of the grammatical intuitions of a native speaker of a language in terms of both a variety of cognitive operations and knowledge of distinct object categories to be applied in the machine interpretation.","PeriodicalId":117334,"journal":{"name":"2014 International Conference on Asian Language Processing (IALP)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129436498","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sentiment classification using Enhanced Contextual Valence Shifters 基于增强语境效价移位的情感分类
2014 International Conference on Asian Language Processing (IALP) Pub Date : 2014-12-04 DOI: 10.1109/IALP.2014.6973485
V. Phu, Phan Thi Tuoi
{"title":"Sentiment classification using Enhanced Contextual Valence Shifters","authors":"V. Phu, Phan Thi Tuoi","doi":"10.1109/IALP.2014.6973485","DOIUrl":"https://doi.org/10.1109/IALP.2014.6973485","url":null,"abstract":"We have explored different methods of improving the accuracy of sentiment classification. The sentiment orientation of a document can be positive (+), negative (-), or neutral (0). We combine five dictionaries from [2, 3, 4, 5, 6] into the new one with 21137 entries. The new dictionary has many verbs, adverbs, phrases and idioms, that are not in five ones before. The paper shows that our proposed method based on the combination of Term-Counting method and Enhanced Contextual Valence Shifters method has improved the accuracy of sentiment classification. The combined method has accuracy 68.984% on the testing dataset, and 69.224% on the training dataset. All of these methods are implemented to classify the reviews based on our new dictionary and the Internet Movie data set.","PeriodicalId":117334,"journal":{"name":"2014 International Conference on Asian Language Processing (IALP)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121989443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 40
Designing an Indonesian part of speech tagset and manually tagged Indonesian corpus 设计一个印尼语词性标记集并手动标记印尼语语料库
2014 International Conference on Asian Language Processing (IALP) Pub Date : 2014-12-04 DOI: 10.1109/IALP.2014.6973519
A. Dinakaramani, Rashel Fam, A. Luthfi, R. Manurung
{"title":"Designing an Indonesian part of speech tagset and manually tagged Indonesian corpus","authors":"A. Dinakaramani, Rashel Fam, A. Luthfi, R. Manurung","doi":"10.1109/IALP.2014.6973519","DOIUrl":"https://doi.org/10.1109/IALP.2014.6973519","url":null,"abstract":"We describe our work on designing a linguistically principled part of speech (POS) tagset for the Indonesian language. The process involves a detailed study and analysis of existing tagsets and the manual tagging of an Indonesian corpus. The results of this work are an Indonesian POS tagset consisting of 23 tags and an Indonesian corpus of over 250.000 lexical tokens that have been manually tagged using this tagset.","PeriodicalId":117334,"journal":{"name":"2014 International Conference on Asian Language Processing (IALP)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129033432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 80
Category-associated collocative concept primitives extraction 与类别相关的并置概念原语提取
2014 International Conference on Asian Language Processing (IALP) Pub Date : 2014-12-04 DOI: 10.1109/IALP.2014.6973475
Zhejie Chi, Quan Zhang
{"title":"Category-associated collocative concept primitives extraction","authors":"Zhejie Chi, Quan Zhang","doi":"10.1109/IALP.2014.6973475","DOIUrl":"https://doi.org/10.1109/IALP.2014.6973475","url":null,"abstract":"Collocation is studied as an essential linguistic phenomenon in traditional natural language processing. Similarity, collocative concept primitives are introduced in HNC Concept Primitive Space to present the concept primitive pair co-occurring frequently. Collocative concept primitives can be studied with categories together as concept primitives usually contain category information. To explore the collocation phenomenon in the field of HNC and apply collocative information to language processing, this paper presents a two-stage approach to extract category-associated collocative concept primitives from a classification corpus. By conducting collocative concept primitives extraction in each sub-category corpus and carrying out category-associated collocative concept primitives extraction in the summarized corpus, we generate a category-associated collocative concept primitives list for each category. Our experiments show the items we extract are consistent with the reality and are of significance.","PeriodicalId":117334,"journal":{"name":"2014 International Conference on Asian Language Processing (IALP)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116617926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Which performs better for new word detection, character based or Chinese Word Segmentation based? 基于字符的新单词检测和基于中文分词,哪个表现更好?
2014 International Conference on Asian Language Processing (IALP) Pub Date : 2014-12-04 DOI: 10.1109/IALP.2014.6973474
Haijun Zhang, Shumin Shi
{"title":"Which performs better for new word detection, character based or Chinese Word Segmentation based?","authors":"Haijun Zhang, Shumin Shi","doi":"10.1109/IALP.2014.6973474","DOIUrl":"https://doi.org/10.1109/IALP.2014.6973474","url":null,"abstract":"This paper proposed a novel method to evaluate the performance of New Word Detection (NWD) based on repeats extraction. For small-scale corpus, we put forward employing Conditional Random Field (CRF) as statistical framework to estimate the effects of different strategies of NWD. For the situations of large-scale corpus, as there is no infinity of annotated corpus, comparative experiments are unable to carry out evaluation. Accordingly, this paper proposed a pragmatic quantitative model to analyze and estimate the performance of NWD for all kinds of cases, especially for large-scale corpus situation. Studies have shown there is a good mutual authentication between experimental results and conclusion from the quantitative model. On the basis of analysis for experimental data and quantitative model, a reliable conclusion for effects of Chinese NWD basing the two strategies is reached, which can give a certain instruction for follow-up studies in Chinese new word detection.","PeriodicalId":117334,"journal":{"name":"2014 International Conference on Asian Language Processing (IALP)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115327803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Research on recognition of semantic chunk boundary in Tibetan 藏文语义块边界识别研究
2014 International Conference on Asian Language Processing (IALP) Pub Date : 2014-12-04 DOI: 10.1109/IALP.2014.6973476
Tianhang Wang, Shumin Shi, Heyan Huang, Congjun Long, Ruijing Li
{"title":"Research on recognition of semantic chunk boundary in Tibetan","authors":"Tianhang Wang, Shumin Shi, Heyan Huang, Congjun Long, Ruijing Li","doi":"10.1109/IALP.2014.6973476","DOIUrl":"https://doi.org/10.1109/IALP.2014.6973476","url":null,"abstract":"Semantic chunk is able to well describe the sentence semantic framework. It plays a very important role in Natural Language Processing applications, such as machine translation, QA system and so on. At present, the Tibetan chunk researches are mainly based on rule-methods. In this paper, according to the distinctive language characteristics of Tibetan, we firstly put forward the descriptive definition of the Tibetan semantic chunk and its labeling scheme and then we propose a feature selection algorithm to select the suitable ones automatically from the candidate feature-templates. Through the experiment conducted on the two different kinds of Tibetan corpus, namely corpus-sentence and corpus-discourse, the F-Measure achieves 95.84%, 94.95% and 91.97%, 88.82% by using of Conditional Random Fields (CRF) model and Maximum Entropy (ME) model respectively. The positive results show that the definition of Tibetan semantic chunk in this paper is reasonable and operable. Furthermore, its boundary recognition is feasible and effective via statistical techniques in small scale corpus.","PeriodicalId":117334,"journal":{"name":"2014 International Conference on Asian Language Processing (IALP)","volume":"90 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124190077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Information decompression of Xinjiang travel materials 新疆旅游资料的信息解压
2014 International Conference on Asian Language Processing (IALP) Pub Date : 2014-12-04 DOI: 10.1109/IALP.2014.6973479
Kaihong Yang, Shuzhen Shi
{"title":"Information decompression of Xinjiang travel materials","authors":"Kaihong Yang, Shuzhen Shi","doi":"10.1109/IALP.2014.6973479","DOIUrl":"https://doi.org/10.1109/IALP.2014.6973479","url":null,"abstract":"Previous discussions on the translation of travel materials are mainly confined to functional and semeiotic perspectives. Authors of this paper hold that Xinjiang travel materials involve implicit information related to distinguished ethnical, geographical and historical cultures which cannot be absorbed comprehensively by English-speakers who do not share the same cultural backgrounds. They try to settle the problem with application of information decompression which means to amplify information redundancy to reduce unpredictability during message transmission. Meanwhile, they take translation plus comment, translation plus supplementation and translation plus explanation as measures in decompression. To be exact, in Chinese-English translation of Xinjiang travel materials, authors of the paper decompress the original texts and release the cultural connotations by means of translation plus comment, translation plus supplementation and translation plus explanation so as to convey correct and adequate information to receivers, shorten the cultural gap and achieve effective communication. This paper tries to propose a new prospective for the translation of Xinjiang travel materials.","PeriodicalId":117334,"journal":{"name":"2014 International Conference on Asian Language Processing (IALP)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128290762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semantic type disambiguation for Japanese verbs 日语动词语义型消歧
2014 International Conference on Asian Language Processing (IALP) Pub Date : 2014-12-04 DOI: 10.1109/IALP.2014.6973471
Shohei Okada, Kazuhide Yamamoto
{"title":"Semantic type disambiguation for Japanese verbs","authors":"Shohei Okada, Kazuhide Yamamoto","doi":"10.1109/IALP.2014.6973471","DOIUrl":"https://doi.org/10.1109/IALP.2014.6973471","url":null,"abstract":"The interest has been increasing in recent years in extracting and analyzing evaluations and opinions of service or products from large bodies of text. It is important to classify predicates according to sense because whether or not a statement includes the speaker's opinion depends strongly on its predicate. It is generally assumed that Japanese part-of-speech (POS) for predicates is classified according to sense; however, the POS classifications differ from their semantic classification. On this subject, semantic types, which aim to classify predicates, have been proposed. In this paper, we describe semantic types and present our construction of a disambiguator for Japanese verbs. Specifically, we constructed this disambiguator using a support vector machine by building feature vectors. We used semantic categories of noun and results of morphological analysis for the feature vectors. We then achieved 69.9% accuracy of disambiguation for newspaper articles using 10-fold cross-validation.","PeriodicalId":117334,"journal":{"name":"2014 International Conference on Asian Language Processing (IALP)","volume":"10 10","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120822909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
NormAPI: An API for normalizing Filipino shortcut texts NormAPI:用于规范化菲律宾快捷文本的API
2014 International Conference on Asian Language Processing (IALP) Pub Date : 2014-12-04 DOI: 10.1109/IALP.2014.6973494
N. Nocon, G. Cuevas, Darwin Magat, Peter Suministrado, C. Cheng
{"title":"NormAPI: An API for normalizing Filipino shortcut texts","authors":"N. Nocon, G. Cuevas, Darwin Magat, Peter Suministrado, C. Cheng","doi":"10.1109/IALP.2014.6973494","DOIUrl":"https://doi.org/10.1109/IALP.2014.6973494","url":null,"abstract":"As the number of Internet and mobile phone users grow, texting and chatting have become popular means of communication. Reaching new heights, the extensive use of cellphones and Internet led into the creation of a new language, where words are transformed and made shorter using various styles. Shortcut texting is used in informal venues such as SMS, online, chat rooms, forums and posts in social networks. Huge amounts of data originating from these informal sources can be utilized for various tasks in machine learning and data analytics. As these data may be written in shortcut forms, text normalization is necessary before NLP actions such as information extraction, data mining, text summarization, opinion classification, and even bilingual translations can be fully achieved, by acting as a preprocessing stage that transforms all informal texts back to their original and more understandable forms. This paper is about NormAPI, an API for normalizing Filipino shortcut texts. NormAPI primarily intends to be used as a preprocessing system that corrects informalities in shortcut texts before they are handed for complete data processing.","PeriodicalId":117334,"journal":{"name":"2014 International Conference on Asian Language Processing (IALP)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126680942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Influence of various asymmetrical contextual factors for TTS in a low resource language 低资源语言中各种不对称语境因素对TTS的影响
2014 International Conference on Asian Language Processing (IALP) Pub Date : 2014-12-04 DOI: 10.1109/IALP.2014.6973509
Nirmesh J. Shah, Mohammadi Zaki, H. Patil
{"title":"Influence of various asymmetrical contextual factors for TTS in a low resource language","authors":"Nirmesh J. Shah, Mohammadi Zaki, H. Patil","doi":"10.1109/IALP.2014.6973509","DOIUrl":"https://doi.org/10.1109/IALP.2014.6973509","url":null,"abstract":"The generalized statistical framework of Hidden Markov Model (HMM) has been successfully applied from the field of speech recognition to speech synthesis. In this paper, we have applied HMM-based Speech Synthesis (HTS) method to Gujarati (one of the official languages of India). Adaption and evaluation of HTS for Gujarati language has been done here. In addition, to understand the influence of asymmetrical contextual factors on quality of synthesized speech, we have conducted series of experiments. Evaluation of different HTS built for Gujarati speech using various asymmetrical contextual factors is done in terms of naturalness and speech intelligibility. From the experimental results, it is evident that when more weightage is given to left phoneme in asymmetrical contextual factor, HTS performance improves compared to conventional symmetrical contextual factors for both triphone and pentaphone case. Furthermore, we achieved best performance for Gujarati HTS with left-left-left-centre-right (i.e., LLLCR) contextual factors.","PeriodicalId":117334,"journal":{"name":"2014 International Conference on Asian Language Processing (IALP)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126214289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信