{"title":"印度尼西亚总统大选中公众知识决策树标准的比较","authors":"Siti Masripah, Lestari Yusuf","doi":"10.33480/inti.v18i2.5065","DOIUrl":null,"url":null,"abstract":"In the Presidential election, gen z as a new voter, must know in advance who the presidential candidate will be in the 2024 election as well as the election process, because if voters do not know and do not understand, it will cause the wrong choice will result in their votes cannot be used even to abstain. What factors cause milennials and gen z generations to not know about elections can be determined using a decision tree. Therefore, in this study, a questionnaire was given to millennials and gen z generation to find out whether voters know the presidential candidate to be elected. The data from the questionnaire is processed to become training data and testing data with a ratio of 70:30. Then measure the accuracy level using the C4.5 algorithm with a comparison of splitting criteria, namely gain ratio, information gain and gini index. By knowing the right splitting criteria, the decision tree model can help overcome the problem of overfitting in the data. Overfitting occurs when the model is too complex in memorizing training data, thus failing to generalize well to read new data. The calculation results show the difference in accuracy values between Gain ratio, Information gain and Gini index, namely 81.67%, 83.33% and 83.33%. It can be concluded that for the use of Algorithm C4.5 splitting criteria Gain ratio and Gini index have the same accuracy value for accuracy measurement in this study.","PeriodicalId":197142,"journal":{"name":"INTI Nusa Mandiri","volume":"18 18","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PERBANDINGAN KRITERIA DECISION TREE PADA PENGETAHUAN MASYARAKAT PADA PEMILIHAN UMUM PRESIDEN INDONESIA\",\"authors\":\"Siti Masripah, Lestari Yusuf\",\"doi\":\"10.33480/inti.v18i2.5065\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the Presidential election, gen z as a new voter, must know in advance who the presidential candidate will be in the 2024 election as well as the election process, because if voters do not know and do not understand, it will cause the wrong choice will result in their votes cannot be used even to abstain. What factors cause milennials and gen z generations to not know about elections can be determined using a decision tree. Therefore, in this study, a questionnaire was given to millennials and gen z generation to find out whether voters know the presidential candidate to be elected. The data from the questionnaire is processed to become training data and testing data with a ratio of 70:30. Then measure the accuracy level using the C4.5 algorithm with a comparison of splitting criteria, namely gain ratio, information gain and gini index. By knowing the right splitting criteria, the decision tree model can help overcome the problem of overfitting in the data. Overfitting occurs when the model is too complex in memorizing training data, thus failing to generalize well to read new data. The calculation results show the difference in accuracy values between Gain ratio, Information gain and Gini index, namely 81.67%, 83.33% and 83.33%. It can be concluded that for the use of Algorithm C4.5 splitting criteria Gain ratio and Gini index have the same accuracy value for accuracy measurement in this study.\",\"PeriodicalId\":197142,\"journal\":{\"name\":\"INTI Nusa Mandiri\",\"volume\":\"18 18\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"INTI Nusa Mandiri\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.33480/inti.v18i2.5065\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"INTI Nusa Mandiri","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33480/inti.v18i2.5065","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
在总统大选中,z 世代作为新选民,必须提前了解 2024 年大选的总统候选人是谁以及选举过程,因为如果选民不知道、不了解,就会造成错误的选择,导致自己的选票无法使用甚至弃权。导致千禧一代和 Z 世代不了解选举的因素可以通过决策树来确定。因此,本研究向千禧一代和 z 世代发放了调查问卷,以了解选民是否了解将要选举的总统候选人。问卷中的数据经过处理后成为训练数据和测试数据,两者的比例为 70:30。然后使用 C4.5 算法,通过比较拆分标准(即增益比、信息增益和基尼指数)来衡量准确率水平。 通过了解正确的拆分标准,决策树模型可以帮助克服数据中的过拟合问题。当模型在记忆训练数据时过于复杂,从而不能很好地泛化到读取新数据时,就会出现过拟合。计算结果表明,增益比、信息增益和基尼指数的准确度值存在差异,分别为 81.67%、83.33% 和 83.33%。由此可以得出结论,在本研究中,使用 C4.5 算法分割标准时,增益比和基尼指数具有相同的准确度测量值。
PERBANDINGAN KRITERIA DECISION TREE PADA PENGETAHUAN MASYARAKAT PADA PEMILIHAN UMUM PRESIDEN INDONESIA
In the Presidential election, gen z as a new voter, must know in advance who the presidential candidate will be in the 2024 election as well as the election process, because if voters do not know and do not understand, it will cause the wrong choice will result in their votes cannot be used even to abstain. What factors cause milennials and gen z generations to not know about elections can be determined using a decision tree. Therefore, in this study, a questionnaire was given to millennials and gen z generation to find out whether voters know the presidential candidate to be elected. The data from the questionnaire is processed to become training data and testing data with a ratio of 70:30. Then measure the accuracy level using the C4.5 algorithm with a comparison of splitting criteria, namely gain ratio, information gain and gini index. By knowing the right splitting criteria, the decision tree model can help overcome the problem of overfitting in the data. Overfitting occurs when the model is too complex in memorizing training data, thus failing to generalize well to read new data. The calculation results show the difference in accuracy values between Gain ratio, Information gain and Gini index, namely 81.67%, 83.33% and 83.33%. It can be concluded that for the use of Algorithm C4.5 splitting criteria Gain ratio and Gini index have the same accuracy value for accuracy measurement in this study.