2017 International Conference on Asian Language Processing (IALP)最新文献_第4页

Filipino and english clickbait detection using a long short term memory recurrent neural network 菲律宾和英语标题党检测使用长短期记忆递归神经网络

2017 International Conference on Asian Language Processing (IALP) Pub Date : 2017-12-01 DOI: 10.1109/IALP.2017.8300597

Philogene Kyle Dimpas, Royce Vincent Po, M. J. Sabellano

引用次数: 12

Adapting monolingual resources for code-mixed hindi-english speech recognition 适应单语言资源的代码混合印地语-英语语音识别

2017 International Conference on Asian Language Processing (IALP) Pub Date : 2017-12-01 DOI: 10.1109/IALP.2017.8300583

Ayushi Pandey, B. M. L. Srivastava, S. Gangashetty

引用次数: 13

Semantic-frame representation for event detection on Twitter Twitter上事件检测的语义框架表示

2017 International Conference on Asian Language Processing (IALP) Pub Date : 2017-12-01 DOI: 10.1109/IALP.2017.8300594

Yanxia Qin, Yue Zhang, Min Zhang, De-Kui Zheng

引用次数: 3

Joint bi-affine parsing and semantic role labeling 联合双仿射解析和语义角色标注

2017 International Conference on Asian Language Processing (IALP) Pub Date : 2017-12-01 DOI: 10.1109/IALP.2017.8300612

Peng Shi, Yue Zhang

引用次数: 5

Joint learning of contextal and global features for named entity disambiguation 上下文特征和全局特征的联合学习用于命名实体消歧

2017 International Conference on Asian Language Processing (IALP) Pub Date : 2017-12-01 DOI: 10.1109/IALP.2017.8300533

Bo Ma, Tonghai Jiang, Yating Yang, Xi Zhou, Lei Wang

引用次数: 0

Correcting misuse of Japanese visually similar characters 纠正日语视觉相似字符的误用

2017 International Conference on Asian Language Processing (IALP) Pub Date : 2017-12-01 DOI: 10.1109/IALP.2017.8300545

Youichiro Ogawa, Kazuhide Yamamoto

引用次数: 1

Isolated digit filipino speech recognition through spectrogram image classification: Towards application in a disaster preparedness participatory toolkit 通过光谱图图像分类的孤立数字菲律宾语音识别:在备灾参与式工具包中的应用

2017 International Conference on Asian Language Processing (IALP) Pub Date : 2017-12-01 DOI: 10.1109/IALP.2017.8300539

Julie Ann A. Salido, Nathaniel Oco, R. Roxas, Emmanuel Malaay, Michael Simora, R. J. Cabatic

{"title":"Isolated digit filipino speech recognition through spectrogram image classification: Towards application in a disaster preparedness participatory toolkit","authors":"Julie Ann A. Salido, Nathaniel Oco, R. Roxas, Emmanuel Malaay, Michael Simora, R. J. Cabatic","doi":"10.1109/IALP.2017.8300539","DOIUrl":"https://doi.org/10.1109/IALP.2017.8300539","url":null,"abstract":"In this paper, we present our work on isolated digit speech recognition: by classifying spectrogram images and for use in a disaster preparedness participatory toolkit. To achieve higher inclusivity, we included a voice component for a wider coverage of respondents especially those who have low literacy and those vision impaired individuals. Our methodology is through speech recognition which is a deviation from usual approaches which normally work on acoustic coefficients and features. As our initial test bed, we focused on the Filipino language — a member of the Malayo-Polynesian language family and is the national language in the Philippines. Our data covers 4,297 utterances of the Filipino digits 0 to 9 collected from 262 speakers, and divided the data into 3 parts: 70% for training, 20% for testing, and 10% for validation. We applied short-time Fourier transform on our training data and we used convolution neural networks in MatLab to classify the spectrogram images. The lowest accuracy rate during our tests is 93.02%. Analyses of the results show that background noises are the cause of the misclassified utterances which will further discussed on this paper. While the results are promising, the work can be extended to include closely related languages.","PeriodicalId":183586,"journal":{"name":"2017 International Conference on Asian Language Processing (IALP)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121076077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Named entity transliteration with sequence-to-sequence neural network 序列到序列神经网络的命名实体音译

2017 International Conference on Asian Language Processing (IALP) Pub Date : 2017-12-01 DOI: 10.1109/IALP.2017.8300621

Zhongwei Li, Chng Eng Siong, Haizhou Li

引用次数: 7

Analyzing word embeddings and improving POS tagger of tigrinya tigrinya的词嵌入分析及POS标注器改进

2017 International Conference on Asian Language Processing (IALP) Pub Date : 2017-12-01 DOI: 10.1109/IALP.2017.8300559

Yemane Tedla, Kazuhide Yamamoto

引用次数: 7

Qualitative data analysis of disaster risk reduction suggestions assisted by topic modeling and word2vec 借助主题建模和word2vec对减灾建议进行定性数据分析

2017 International Conference on Asian Language Processing (IALP) Pub Date : 2017-12-01 DOI: 10.1109/IALP.2017.8300601

Ken Gorro, J. Ancheta, Kris Capao, Nathaniel Oco, R. Roxas, M. J. Sabellano, Brandie Nonnecke, Shrestha Mohanty, Camille Crittenden, Ken Goldberg

{"title":"Qualitative data analysis of disaster risk reduction suggestions assisted by topic modeling and word2vec","authors":"Ken Gorro, J. Ancheta, Kris Capao, Nathaniel Oco, R. Roxas, M. J. Sabellano, Brandie Nonnecke, Shrestha Mohanty, Camille Crittenden, Ken Goldberg","doi":"10.1109/IALP.2017.8300601","DOIUrl":"https://doi.org/10.1109/IALP.2017.8300601","url":null,"abstract":"In this study, we examine suggestions for disaster risk reduction strategies provided by residents in selected disaster-prone areas in the Philippines. The study utilizes 976 suggestions on how their barangay can help them better prepare for a disaster. These were collected through Malasakit, an e-participation platform designed by University of California, Berkeley and National University (Philippines) to engage community participation in gathering qualitative and quantitative data. Analyses were conducted through biterm topic modeling (BTM) and word embedding using gensim. For better accuracy, data preprocessing was performed to remove irrelevant or noisy data. Based on the BTM result, we identified the following important codes: preparedness, disaster, awareness, community, help, seminars, kanal (canal), linisin (clean), drainage, garbage, basura (garbage). Analyses of the topic models show that disaster preparedness is an integral part in disaster risk reduction by improving solid waste management, providing seminars for public awareness and evacuation preparation. A word intrusion test was conducted where BTM scored 55.71% which implies strong cohesion of the words with their topics. For word embedding, we drilled down on the following words: community, preparedness, emergency, barangay (village), help, kanal (drainage), basura (garbage), awareness, seminars, information. The word2vec results has a cosine similarity score of 0.902 which implies strong relatedness of each word. The result shows that the participants give importance to community preparedness for emergency, helping the barangay in clean-up drive, and awareness through seminars and information dissemination.","PeriodicalId":183586,"journal":{"name":"2017 International Conference on Asian Language Processing (IALP)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134143855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 18