Plan Optimization for Creating Bilingual Dictionaries of Low-Resource Languages

Arbi Haza Nasution, Yohei Murakami, T. Ishida
{"title":"Plan Optimization for Creating Bilingual Dictionaries of Low-Resource Languages","authors":"Arbi Haza Nasution, Yohei Murakami, T. Ishida","doi":"10.1109/Culture.and.Computing.2017.21","DOIUrl":null,"url":null,"abstract":"The constraint-based approach has been proven useful for inducing bilingual lexicons for closely-related low-resource languages. When we want to create multiple bilingual dictionaries linking several languages, we need to consider manual creation by bilingual language experts if there are no available machine-readable dictionaries are available as input. To overcome the difficulty in planning the creation of bilingual dictionaries, the consideration of various methods and costs, plan optimization is essential. We adopt the Markov Decision Process (MDP) in formalizing plan optimization for creating bilingual dictionaries; the goal is to better predict the most feasible optimal plan with the least total cost before fully implementing the constraint-based bilingual dictionary induction framework. We define heuristics based on input language characteristics to devise a baseline plan for evaluating our MDP-based approach with total cost as an evaluation metric. The MDP-based proposal outperformed heuristic planning on total cost for all datasets examined.","PeriodicalId":244911,"journal":{"name":"2017 International Conference on Culture and Computing (Culture and Computing)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Culture and Computing (Culture and Computing)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/Culture.and.Computing.2017.21","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

Abstract

The constraint-based approach has been proven useful for inducing bilingual lexicons for closely-related low-resource languages. When we want to create multiple bilingual dictionaries linking several languages, we need to consider manual creation by bilingual language experts if there are no available machine-readable dictionaries are available as input. To overcome the difficulty in planning the creation of bilingual dictionaries, the consideration of various methods and costs, plan optimization is essential. We adopt the Markov Decision Process (MDP) in formalizing plan optimization for creating bilingual dictionaries; the goal is to better predict the most feasible optimal plan with the least total cost before fully implementing the constraint-based bilingual dictionary induction framework. We define heuristics based on input language characteristics to devise a baseline plan for evaluating our MDP-based approach with total cost as an evaluation metric. The MDP-based proposal outperformed heuristic planning on total cost for all datasets examined.
低资源语言双语词典创建方案优化
基于约束的方法已被证明可用于为密切相关的低资源语言生成双语词典。当我们想要创建多个连接多种语言的双语字典时,如果没有可用的机器可读字典作为输入,我们需要考虑由双语语言专家手动创建。要克服规划创建双语词典的困难,综合考虑各种方法和成本,优化规划是必不可少的。我们采用马尔可夫决策过程(MDP)来形式化双语词典创建的计划优化;目标是在完全实现基于约束的双语词典归纳框架之前,以最小的总成本更好地预测最可行的最优方案。我们定义了基于输入语言特征的启发式方法,以设计一个基线计划,以总成本作为评估指标来评估我们基于mdp的方法。基于mdp的建议在所有被检查的数据集的总成本上优于启发式规划。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信