2019 International Conference on Asian Language Processing (IALP)最新文献_第3页

A Comparative Analysis of Acoustic Characteristics between Kazak & Uyghur Mandarin Learners and Standard Mandarin Speakers 哈萨克族、维吾尔族普通话学习者与标准普通话使用者的声学特征比较分析

2019 International Conference on Asian Language Processing (IALP) Pub Date : 2019-11-01 DOI: 10.1109/IALP48816.2019.9037703

Gulnur Arkin, Gvljan Alijan, A. Hamdulla, Mijit Ablimit

引用次数: 0

On the Etymology of he ‘river’ in Chinese 论汉语“河”的词源

2019 International Conference on Asian Language Processing (IALP) Pub Date : 2019-11-01 DOI: 10.1109/IALP48816.2019.9037654

Huibin Zhuang, Zhanting Bu

引用次数: 0

Diachronic Synonymy and Polysemy: Exploring Dynamic Relation Between Forms and Meanings of Words Based on Word Embeddings 历时同义与多义:基于词嵌入的词形与词义动态关系研究

2019 International Conference on Asian Language Processing (IALP) Pub Date : 2019-11-01 DOI: 10.1109/IALP48816.2019.9037663

Shichen Liang, Jianyu Zheng, Xuemei Tang, Renfen Hu, Zhiying Liu

引用次数: 0

Developing a machine learning-based grade level classifier for Filipino children’s literature 为菲律宾儿童文学开发一个基于机器学习的年级分类器

2019 International Conference on Asian Language Processing (IALP) Pub Date : 2019-11-01 DOI: 10.1109/IALP48816.2019.9037694

Joseph Marvin Imperial, R. Roxas, Erica Mae Campos, Jemelee Oandasan, Reyniel Caraballo, Ferry Winsley Sabdani, Ani Rosa Almaroi

{"title":"Developing a machine learning-based grade level classifier for Filipino children’s literature","authors":"Joseph Marvin Imperial, R. Roxas, Erica Mae Campos, Jemelee Oandasan, Reyniel Caraballo, Ferry Winsley Sabdani, Ani Rosa Almaroi","doi":"10.1109/IALP48816.2019.9037694","DOIUrl":"https://doi.org/10.1109/IALP48816.2019.9037694","url":null,"abstract":"Reading is an essential part of children’s learning. Identifying the proper readability level of reading materials will ensure effective comprehension. We present our efforts to develop a baseline model for automatically identifying the readability of children’s and young adult’s books written in Filipino using machine learning algorithms. For this study, we processed 258 picture books published by Adarna House Inc. In contrast to old readability formulas relying on static attributes like number of words, sentences, syllables, etc., other textual features were explored. Count vectors, Term FrequencyInverse Document Frequency (TF-IDF), n-grams, and character-level n-grams were extracted to train models using three major machine learning algorithms–Multinomial Naïve-Bayes, Random Forest, and K-Nearest Neighbors. A combination of K-Nearest Neighbors and Random Forest via voting-based classification mechanism resulted with the best performing model with a high average training accuracy and validation accuracy of 0.822 and 0.74 respectively. Analysis of the top 10 most useful features for each algorithm show that they share common similarity in identifying readability levels–the use of Filipino stop words. Performance of other classifiers and features were also explored.","PeriodicalId":208066,"journal":{"name":"2019 International Conference on Asian Language Processing (IALP)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121464491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Employing Gated Attention and Multi-similarities to Resolve Document-level Chinese Event Coreference 用门控注意和多重相似度解决文档级汉语事件共指

2019 International Conference on Asian Language Processing (IALP) Pub Date : 2019-11-01 DOI: 10.1109/IALP48816.2019.9037674

Haoyi Cheng, Peifeng Li, Qiaoming Zhu

引用次数: 0

An End-to-End Model Based on TDNN-BiGRU for Keyword Spotting 基于TDNN-BiGRU的端到端关键字识别模型

2019 International Conference on Asian Language Processing (IALP) Pub Date : 2019-11-01 DOI: 10.1109/IALP48816.2019.9037714

Shuzhou Chai, Zhenye Yang, Changsheng Lv, Weiqiang Zhang

引用次数: 2

Improving Japanese-English Bilingual Mapping of Word Embeddings based on Language Specificity 基于语言特异性的日英双语词嵌入映射改进

2019 International Conference on Asian Language Processing (IALP) Pub Date : 2019-11-01 DOI: 10.1109/IALP48816.2019.9037649

Yuting Song, Biligsaikhan Batjargal, Akira Maeda

引用次数: 1

Extremely Low Resource Text simplification with Pre-trained Transformer Language Model 极低的资源文本简化与预训练的转换语言模型

2019 International Conference on Asian Language Processing (IALP) Pub Date : 2019-11-01 DOI: 10.1109/IALP48816.2019.9037650

T. Maruyama, Kazuhide Yamamoto

引用次数: 10

Neural Machine Translation Strategies for Generating Honorific-style Korean 敬语式韩语的神经机器翻译策略研究

2019 International Conference on Asian Language Processing (IALP) Pub Date : 2019-11-01 DOI: 10.1109/IALP48816.2019.9037681

Lijie Wang, Mei Tu, Mengxia Zhai, Huadong Wang, Song Liu, Sang Ha Kim

引用次数: 1

A Study on Syntactic Complexity and Text Readability of ASEAN English News 东盟英语新闻的句法复杂性与篇章可读性研究

2019 International Conference on Asian Language Processing (IALP) Pub Date : 2019-11-01 DOI: 10.1109/IALP48816.2019.9037695

Yusha Zhang, Nankai Lin, Sheng-yi Jiang

{"title":"A Study on Syntactic Complexity and Text Readability of ASEAN English News","authors":"Yusha Zhang, Nankai Lin, Sheng-yi Jiang","doi":"10.1109/IALP48816.2019.9037695","DOIUrl":"https://doi.org/10.1109/IALP48816.2019.9037695","url":null,"abstract":"English is the most widely used language in the world. With the spread and evolution of language, there are differences in the English text expression and reading difficulty in different regions. Due to the difference in the content and wording, English news in some countries is easier to understand than in others. Using an accurate and effective method to calculate the difficulty of text is not only beneficial for news writers to write easy-to-understand articles, but also for readers to choose articles that they can understand. In this paper, we study the differences in the text readability between most ASEAN countries, England and America. We compare the textual readability and syntactic complexity of English news texts among England, America and eight ASEAN countries (Indonesia, Malaysia, Philippines, Singapore, Brunei, Thailand, Vietnam, Cambodia). This paper selected the authoritative news media of each country as the research object. We used different indicators including Flesch-Kincaid Grade Level (FKG), Flesch Reading Ease Index (FRE), Gunning Fog Index (GF), Automated Readability Index (AR), Coleman-Liau Index (CL) and Linsear Write Index (LW) to measure the textual readability, and then applied L2SCA to analyze the syntactic complexity of news text. According to the analysis results, we used the hierarchical clustering method to classify the English texts of different countries into six different levels. Moreover, we elucidated the reasons for such readability differences in these countries.","PeriodicalId":208066,"journal":{"name":"2019 International Conference on Asian Language Processing (IALP)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125219339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2