Proceedings of the 2019 3rd International Conference on Natural Language Processing and Information Retrieval最新文献_第3页

Topic Modeling on Indonesian Online Shop Chat 印尼语网上商店聊天的主题建模

Proceedings of the 2019 3rd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2019-06-28 DOI: 10.1145/3342827.3342831

A. Hidayatullah, Wisnu Kurniawan, Chanifah Indah Ratnasari

引用次数: 6

Evaluation of Pseudo-Relevance Feedback using Wikipedia 使用维基百科评估伪相关反馈

Proceedings of the 2019 3rd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2019-06-28 DOI: 10.1145/3342827.3342845

Murtadha Aljubran

引用次数: 0

Deep Speaker Embedding for Speaker-Targeted Automatic Speech Recognition 针对说话人自动语音识别的深度说话人嵌入

Proceedings of the 2019 3rd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2019-06-28 DOI: 10.1145/3342827.3342847

Guan-Lin Chao, John Paul Shen, Ian Lane

引用次数: 0

A Task-oriented Chatbot Based on LSTM and Reinforcement Learning 基于LSTM和强化学习的面向任务的聊天机器人

Proceedings of the 2019 3rd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2019-06-28 DOI: 10.1145/3342827.3342844

Tai-Liang Chou, Yu-Ling Hsueh

引用次数: 7

Building the Language Resource for a Cebuano-Filipino Neural Machine Translation System 基于神经机器翻译系统的语言资源构建

Proceedings of the 2019 3rd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2019-06-28 DOI: 10.1145/3342827.3342833

Kristine Mae M. Adlaon, N. Marcos

{"title":"Building the Language Resource for a Cebuano-Filipino Neural Machine Translation System","authors":"Kristine Mae M. Adlaon, N. Marcos","doi":"10.1145/3342827.3342833","DOIUrl":"https://doi.org/10.1145/3342827.3342833","url":null,"abstract":"Parallel corpus is a critical resource in machine learning based translation. The task of collecting, extracting, and aligning texts in order to build an acceptable corpus for doing translation is very tedious most especially for low-resource languages. In this paper, we present the efforts made to build a parallel corpus for Cebuano and Filipino from two different domains: biblical texts and the web. For the biblical resource, subword unit translation for verbs and copy-able approach for nouns were applied to correct inconsistencies in translation. This correction mechanism was applied as a preprocessing technique. On the other hand, for Wikipedia being the main web resource, commonly occurring topic segments were extracted from both the source and the target languages. These observed topic segments are unique in 4 different categories. The identification of these topic segments may be used for automatic extraction of sentences. A Recurrent Neural Network was used to implement the translation using OpenNMT sequence modeling tool in TensorFlow. The two different corpora were then evaluated by using them as two separate inputs in the neural network. Results have shown a difference in BLEU score in both corpora.","PeriodicalId":254461,"journal":{"name":"Proceedings of the 2019 3rd International Conference on Natural Language Processing and Information Retrieval","volume":"113 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126710273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Text Compression for Myanmar Information Retrieval 缅甸信息检索的文本压缩

Proceedings of the 2019 3rd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2019-06-28 DOI: 10.1145/3342827.3342830

N. Lin, A. KudinovVitaly, Y. Soe

引用次数: 1

Applicability of Text-representing Centroids for Thai Language Documents 文本表示质心在泰语文档中的适用性

Proceedings of the 2019 3rd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2019-06-28 DOI: 10.1145/3342827.3342853

Sureeporn Nualnim, Nirach Romyen, M. Sodanil

引用次数: 1

Text Classification of Network Pyramid Scheme based on Topic Model 基于主题模型的网络传销文本分类

Proceedings of the 2019 3rd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2019-06-28 DOI: 10.1145/3342827.3342835

Pengyu Mu, Jingsha He, Nafei Zhu

引用次数: 0

Using Sentiment Analysis for Comparing Attitudes between Computer Professionals and Laypersons on the Topic of Artificial Intelligence 用情感分析比较计算机专业人员和非专业人员对人工智能话题的态度

Proceedings of the 2019 3rd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2019-06-28 DOI: 10.1145/3342827.3342829

Xueying Wang

引用次数: 1

A Novel Task-Oriented Text Corpus in Silent Speech Recognition and its Natural Language Generation Construction Method 一种面向任务的无声语音识别文本语料库及其自然语言生成构建方法

Proceedings of the 2019 3rd International Conference on Natural Language Processing and Information Retrieval Pub Date : 2019-04-19 DOI: 10.1145/3342827.3342838

Dong Cao, Dongdong Zhang, Haibo Chen

{"title":"A Novel Task-Oriented Text Corpus in Silent Speech Recognition and its Natural Language Generation Construction Method","authors":"Dong Cao, Dongdong Zhang, Haibo Chen","doi":"10.1145/3342827.3342838","DOIUrl":"https://doi.org/10.1145/3342827.3342838","url":null,"abstract":"Millions of people with severe speech disorders around the world may regain their communication capabilities through techniques of silent speech recognition (SSR). Using electroencephalography (EEG) as a biomarker for speech decoding has been popular for SSR. However, the lack of SSR text corpus has impeded the development of this technique. Here, we construct a novel task-oriented text corpus, which is utilized in the field of SSR. In the process of construction, we propose a task-oriented hybrid construction method based on natural language generation (NLG) algorithm. The algorithm focuses on the strategy of data-to-text generation, and has two advantages including linguistic quality and high diversity. These two advantages use template-based method and deep neural networks respectively. In an SSR experiment with the generated text corpus, analysis results show that the performance of our hybrid construction method outperforms the pure method such as template-based natural language generation or neural natural language generation models.","PeriodicalId":254461,"journal":{"name":"Proceedings of the 2019 3rd International Conference on Natural Language Processing and Information Retrieval","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133856133","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3