用英语作为语际语言来匹配不同来源的泰国职业描述

Vorapon Luantangsrisuk, Rattapoom Kedtiwerasak, Kanchana Saengthongpattana, T. Ruangrajitpakorn
{"title":"用英语作为语际语言来匹配不同来源的泰国职业描述","authors":"Vorapon Luantangsrisuk, Rattapoom Kedtiwerasak, Kanchana Saengthongpattana, T. Ruangrajitpakorn","doi":"10.1109/ICCI57424.2023.10112230","DOIUrl":null,"url":null,"abstract":"This paper proposes a method to improve text matching for occupation descriptions from different standards in Thailand. To solve ambiguity from technical compound words from standards, we apply machine translation, and similarity calculation for matching occupation and profession from sources that focused on different aspects. Thus, despite mentioning on same occupation, the descriptions from different standards share very little terms for matching the job across standards. By translating terms from Thai to English, we solve the issue of synonym technical terms varily used in different sources to improve similarity scores. Furthermore, tree structure of the standards is considered to assist on limiting search space. From experiments, the results of the baseline is very low, and the proposed method averagely exceed the result from the baseline for more than 6 percents for top-3 matching and 5 percents for top-5 matching in terms of accuracy. The use of both Thai and English translated job descriptions for matching using similar terms can noticeably increase a capability of matching jobs that cannot be found using only original Thai job description by adding more features of English-translated terms.","PeriodicalId":112409,"journal":{"name":"2023 IEEE International Conference on Cybernetics and Innovations (ICCI)","volume":"12366 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Matching Thai Profession Descriptions from Different Sources Using English as Interlingual Language\",\"authors\":\"Vorapon Luantangsrisuk, Rattapoom Kedtiwerasak, Kanchana Saengthongpattana, T. Ruangrajitpakorn\",\"doi\":\"10.1109/ICCI57424.2023.10112230\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a method to improve text matching for occupation descriptions from different standards in Thailand. To solve ambiguity from technical compound words from standards, we apply machine translation, and similarity calculation for matching occupation and profession from sources that focused on different aspects. Thus, despite mentioning on same occupation, the descriptions from different standards share very little terms for matching the job across standards. By translating terms from Thai to English, we solve the issue of synonym technical terms varily used in different sources to improve similarity scores. Furthermore, tree structure of the standards is considered to assist on limiting search space. From experiments, the results of the baseline is very low, and the proposed method averagely exceed the result from the baseline for more than 6 percents for top-3 matching and 5 percents for top-5 matching in terms of accuracy. The use of both Thai and English translated job descriptions for matching using similar terms can noticeably increase a capability of matching jobs that cannot be found using only original Thai job description by adding more features of English-translated terms.\",\"PeriodicalId\":112409,\"journal\":{\"name\":\"2023 IEEE International Conference on Cybernetics and Innovations (ICCI)\",\"volume\":\"12366 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE International Conference on Cybernetics and Innovations (ICCI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCI57424.2023.10112230\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Cybernetics and Innovations (ICCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCI57424.2023.10112230","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文提出了一种改进泰国不同标准职业描述文本匹配的方法。为了解决标准中技术复合词的歧义,我们采用机器翻译和相似度计算来匹配来自不同方面的来源的职业和专业。因此,尽管提到了相同的职业,但来自不同标准的描述在跨标准匹配工作方面共享的术语很少。通过将术语从泰语翻译成英语,我们解决了同义词技术术语在不同来源中使用不同的问题,以提高相似度得分。此外,考虑到标准的树状结构有助于限制搜索空间。从实验结果来看,基线结果很低,所提方法的准确率平均超过基线结果,top-3匹配的准确率超过6%以上,top-5匹配的准确率平均超过5%。使用泰语和英语翻译的职位描述来匹配使用相似术语的职位,可以通过添加更多英语翻译术语的特征,显著提高仅使用原始泰语职位描述无法找到的职位匹配能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Matching Thai Profession Descriptions from Different Sources Using English as Interlingual Language
This paper proposes a method to improve text matching for occupation descriptions from different standards in Thailand. To solve ambiguity from technical compound words from standards, we apply machine translation, and similarity calculation for matching occupation and profession from sources that focused on different aspects. Thus, despite mentioning on same occupation, the descriptions from different standards share very little terms for matching the job across standards. By translating terms from Thai to English, we solve the issue of synonym technical terms varily used in different sources to improve similarity scores. Furthermore, tree structure of the standards is considered to assist on limiting search space. From experiments, the results of the baseline is very low, and the proposed method averagely exceed the result from the baseline for more than 6 percents for top-3 matching and 5 percents for top-5 matching in terms of accuracy. The use of both Thai and English translated job descriptions for matching using similar terms can noticeably increase a capability of matching jobs that cannot be found using only original Thai job description by adding more features of English-translated terms.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信