Comparing Machine Learning and Human Judge in SATU Indonesia Awarding Processes

Onno W. Purbo
{"title":"Comparing Machine Learning and Human Judge in SATU Indonesia Awarding Processes","authors":"Onno W. Purbo","doi":"10.26555/jiteki.v7i3.22201","DOIUrl":null,"url":null,"abstract":"Received November 09, 2021 Revised November 25, 2021 Accepted December 23, 2021 For more than ten years, SATU Indonesia Awards, with PT. Astra International Tbk's support is given to inspiring young Indonesians. Every year, more than 10,000 nominations must be short-listed to 90 nominations within one week with five (5) assessment parameters. The research contributions are (1) creating a machine learning mechanism for the awarding process from ten years of the SATU Indonesia Awards nomination archive, (2) creating two (2) models of training data for the five (5) assessed parameters, namely motivation, obstacle, outcome, outreach, and sustainability, and (3) compare machine learning prediction with 2021 judge's assessment. TEMPO Data and Analysis Center (PDAT) extracts the corpus training data from ten years' SATU Indonesia Awards data in six months. The corpus training data contains nomination texts with Judges' scores on motivation, obstacle, outcome, outreach, and sustainability. Two (2) corpus training data and two models were generated with, namely, (1) the average Judges' parameter value per instance and (2) the Judges' smallest value and stored in two (2) corpus of 1220 instances each. The classification model was generated by Random Forest, which has the slightest error among the classification algorithms tested. The first model aims to predict the nomination assessment parameters. The second model is to detect the outlier in the incoming nominees for extraordinary nominees. The machine learning predictions were compared and found to be similar to the 2021 judge's assessment in the awarding processes at SATU Indonesia Awards. The average Judges' pre-final 2021 nominees' scores are compared to the Random Forest's predictions and found to be reasonably similar, with a small RMSE error around 1.1 to 1.6 for all assessment parameters. The smallest RMSE was obtained in the Sustainability parameter. The Obstacle parameter was found to have the largest RMSE.","PeriodicalId":244902,"journal":{"name":"Jurnal Ilmiah Teknik Elektro Komputer dan Informatika","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal Ilmiah Teknik Elektro Komputer dan Informatika","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.26555/jiteki.v7i3.22201","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Received November 09, 2021 Revised November 25, 2021 Accepted December 23, 2021 For more than ten years, SATU Indonesia Awards, with PT. Astra International Tbk's support is given to inspiring young Indonesians. Every year, more than 10,000 nominations must be short-listed to 90 nominations within one week with five (5) assessment parameters. The research contributions are (1) creating a machine learning mechanism for the awarding process from ten years of the SATU Indonesia Awards nomination archive, (2) creating two (2) models of training data for the five (5) assessed parameters, namely motivation, obstacle, outcome, outreach, and sustainability, and (3) compare machine learning prediction with 2021 judge's assessment. TEMPO Data and Analysis Center (PDAT) extracts the corpus training data from ten years' SATU Indonesia Awards data in six months. The corpus training data contains nomination texts with Judges' scores on motivation, obstacle, outcome, outreach, and sustainability. Two (2) corpus training data and two models were generated with, namely, (1) the average Judges' parameter value per instance and (2) the Judges' smallest value and stored in two (2) corpus of 1220 instances each. The classification model was generated by Random Forest, which has the slightest error among the classification algorithms tested. The first model aims to predict the nomination assessment parameters. The second model is to detect the outlier in the incoming nominees for extraordinary nominees. The machine learning predictions were compared and found to be similar to the 2021 judge's assessment in the awarding processes at SATU Indonesia Awards. The average Judges' pre-final 2021 nominees' scores are compared to the Random Forest's predictions and found to be reasonably similar, with a small RMSE error around 1.1 to 1.6 for all assessment parameters. The smallest RMSE was obtained in the Sustainability parameter. The Obstacle parameter was found to have the largest RMSE.
机器学习与人类裁判在SATU印尼评奖过程中的比较
十多年来,在PT. Astra International Tbk的支持下,SATU印度尼西亚奖一直给予鼓舞人心的印度尼西亚年轻人。每年,超过10,000个提名必须在一周内通过五(5)个评估参数入围90个提名。研究贡献是:(1)从十年的SATU印度尼西亚奖提名档案中为颁奖过程创建机器学习机制,(2)为五(5)个评估参数(即动机,障碍,结果,外展和可持续性)创建两(2)个训练数据模型,以及(3)将机器学习预测与2021年评委的评估进行比较。TEMPO数据与分析中心(PDAT)在6个月内从10年的SATU印尼奖数据中提取了语料库训练数据。语料库训练数据包含提名文本,评委在动机、障碍、结果、推广和可持续性方面的得分。生成两(2)个语料库训练数据和两个模型,分别为(1)每实例judge参数的平均值和(2)judge参数的最小值,存储在两(2)个语料库中,每个语料库1220个实例。该分类模型由随机森林算法生成,在所测试的分类算法中误差最小。第一个模型旨在预测提名评估参数。第二个模型是对即将被提名者进行异常值检测。对机器学习预测进行了比较,发现与2021年SATU印度尼西亚奖颁奖过程中评委的评估相似。评委们在2021年提名者决赛前的平均得分与随机森林的预测相比较,发现两者相当相似,所有评估参数的均方根误差在1.1到1.6之间。可持续性参数的均方根误差最小。障碍物参数被发现具有最大的RMSE。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信