缅甸语冠词分类

Myat Sapal Phyu, K. Nwet
{"title":"缅甸语冠词分类","authors":"Myat Sapal Phyu, K. Nwet","doi":"10.1109/AITC.2019.8920927","DOIUrl":null,"url":null,"abstract":"Article classification is a problem of text classification to assign the articles to their corresponding class or topic. According to this work, there are two main barriers to classify Myanmar text in deep learning model, to find the proper way of determining the word boundaries and to build the datasets for Myanmar text classification. This paper shows the empirical evidence on article classification in Myanmar language for both syllable-level and word-level by fine-tuning Convolutional Neural Networks. They are denoted as Syllable-Level Convolutional Neural Networks (SL-CNN) and Word-Level Convolutional Neural Networks (WL-CNN). Although there are few publicly available general-purpose pre-trained vectors for Myanmar language that can be further applied to transfer learning, it is still needed to construct large-scale datasets for classifying Myanmar articles. We construct six datasets to classify Myanmar articles and evaluation is measured by the comparative analysis of these vectors on SL-CNN and WL-CNN with Recurrent Neural Networks for both syllable and word level, SL-RNN and WL-RNN.","PeriodicalId":388642,"journal":{"name":"2019 International Conference on Advanced Information Technologies (ICAIT)","volume":"283 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Articles Classification in Myanmar Language\",\"authors\":\"Myat Sapal Phyu, K. Nwet\",\"doi\":\"10.1109/AITC.2019.8920927\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Article classification is a problem of text classification to assign the articles to their corresponding class or topic. According to this work, there are two main barriers to classify Myanmar text in deep learning model, to find the proper way of determining the word boundaries and to build the datasets for Myanmar text classification. This paper shows the empirical evidence on article classification in Myanmar language for both syllable-level and word-level by fine-tuning Convolutional Neural Networks. They are denoted as Syllable-Level Convolutional Neural Networks (SL-CNN) and Word-Level Convolutional Neural Networks (WL-CNN). Although there are few publicly available general-purpose pre-trained vectors for Myanmar language that can be further applied to transfer learning, it is still needed to construct large-scale datasets for classifying Myanmar articles. We construct six datasets to classify Myanmar articles and evaluation is measured by the comparative analysis of these vectors on SL-CNN and WL-CNN with Recurrent Neural Networks for both syllable and word level, SL-RNN and WL-RNN.\",\"PeriodicalId\":388642,\"journal\":{\"name\":\"2019 International Conference on Advanced Information Technologies (ICAIT)\",\"volume\":\"283 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Advanced Information Technologies (ICAIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AITC.2019.8920927\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Advanced Information Technologies (ICAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AITC.2019.8920927","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

文章分类是将文章分配到相应的类或主题的文本分类问题。根据这项工作,在深度学习模型中对缅甸语文本进行分类存在两个主要障碍,即找到合适的词边界确定方法和构建缅甸语文本分类数据集。本文展示了用微调卷积神经网络对缅甸语在音节级和词级两个层次上的冠词分类的经验证据。分别表示为音节级卷积神经网络(SL-CNN)和词级卷积神经网络(WL-CNN)。虽然很少有公开可用的通用预训练缅甸语向量,可以进一步应用于迁移学习,但仍然需要构建大规模的数据集来对缅甸语文章进行分类。我们构建了6个数据集对缅甸文章进行分类,并通过使用递归神经网络在SL-CNN和WL-CNN上对这些向量进行音节和单词水平的比较分析,以及SL-RNN和WL-RNN的评价。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Articles Classification in Myanmar Language
Article classification is a problem of text classification to assign the articles to their corresponding class or topic. According to this work, there are two main barriers to classify Myanmar text in deep learning model, to find the proper way of determining the word boundaries and to build the datasets for Myanmar text classification. This paper shows the empirical evidence on article classification in Myanmar language for both syllable-level and word-level by fine-tuning Convolutional Neural Networks. They are denoted as Syllable-Level Convolutional Neural Networks (SL-CNN) and Word-Level Convolutional Neural Networks (WL-CNN). Although there are few publicly available general-purpose pre-trained vectors for Myanmar language that can be further applied to transfer learning, it is still needed to construct large-scale datasets for classifying Myanmar articles. We construct six datasets to classify Myanmar articles and evaluation is measured by the comparative analysis of these vectors on SL-CNN and WL-CNN with Recurrent Neural Networks for both syllable and word level, SL-RNN and WL-RNN.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信