{"title":"使用字符嵌入的泰语依赖解析","authors":"Sattaya Singkul, K. Woraratpanya","doi":"10.1109/ICITEED.2019.8930002","DOIUrl":null,"url":null,"abstract":"Dependency parsing (DP) becomes an important part of natural language processing (NLP) applications. However, most of DP methods have been developed for English language, but not for Thai language. In addition, the existing DP methods were still unsolved the problems of long and complex sentences. Therefore, this paper proposes seven Thai DP algorithms. Five different Thai DP algorithms was developed from transition-based parsing and the other two was developed from graph-based parsing. Based on Thai-PUD and English-PUD datasets, containing both long and complex sentences, the experimental results showed that all Thai DP algorithms bundled with character embedding can outperform the baselines.","PeriodicalId":6598,"journal":{"name":"2019 11th International Conference on Information Technology and Electrical Engineering (ICITEE)","volume":"21 1","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Thai Dependency Parsing with Character Embedding\",\"authors\":\"Sattaya Singkul, K. Woraratpanya\",\"doi\":\"10.1109/ICITEED.2019.8930002\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Dependency parsing (DP) becomes an important part of natural language processing (NLP) applications. However, most of DP methods have been developed for English language, but not for Thai language. In addition, the existing DP methods were still unsolved the problems of long and complex sentences. Therefore, this paper proposes seven Thai DP algorithms. Five different Thai DP algorithms was developed from transition-based parsing and the other two was developed from graph-based parsing. Based on Thai-PUD and English-PUD datasets, containing both long and complex sentences, the experimental results showed that all Thai DP algorithms bundled with character embedding can outperform the baselines.\",\"PeriodicalId\":6598,\"journal\":{\"name\":\"2019 11th International Conference on Information Technology and Electrical Engineering (ICITEE)\",\"volume\":\"21 1\",\"pages\":\"1-5\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 11th International Conference on Information Technology and Electrical Engineering (ICITEE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICITEED.2019.8930002\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 11th International Conference on Information Technology and Electrical Engineering (ICITEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITEED.2019.8930002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Dependency parsing (DP) becomes an important part of natural language processing (NLP) applications. However, most of DP methods have been developed for English language, but not for Thai language. In addition, the existing DP methods were still unsolved the problems of long and complex sentences. Therefore, this paper proposes seven Thai DP algorithms. Five different Thai DP algorithms was developed from transition-based parsing and the other two was developed from graph-based parsing. Based on Thai-PUD and English-PUD datasets, containing both long and complex sentences, the experimental results showed that all Thai DP algorithms bundled with character embedding can outperform the baselines.