基于文本挖掘算法的英文术语使用的评审文件分析——以24份评审文件为例

Syahrul Iman, A. Wibawa, S. Sumpeno, Arbintoro Mas
{"title":"基于文本挖掘算法的英文术语使用的评审文件分析——以24份评审文件为例","authors":"Syahrul Iman, A. Wibawa, S. Sumpeno, Arbintoro Mas","doi":"10.1109/iSemantic55962.2022.9920369","DOIUrl":null,"url":null,"abstract":"Until recently, study program accreditation is still the standard for evaluating the feasibility and quality of the education process worldwide. Normally, in the process of accreditation, one specific study program needs to prepare a report document that represents the whole process of education, especially from the standpoint of quality, and then submit it to the National Board of Accreditation. Self-assessment and evaluation are the keys to creating a proper accreditation document. In the case of Indonesia, the accreditation process is run by the National Board of Higher Education Accreditation (called BAN-PT). The study program must prepare and submit a document that represents the overall process of education by following the national standard and format determined by BAN-PT. In this study, we hypothesized that the text mining technique can be used to evaluate the accreditation document by analyzing the English terms that are used in the document. The analysis is done by scoring the number of English terms in one accreditation document and selecting the proper terms that are used. We hypothesize that the higher the score is obtained by the document the higher the national rank of accreditation was obtained by the document. In this study, 24 accreditation documents with approximately every document consisting of 200 pages within similar study programs were analyzed. A text processing technique was implemented to filter and clean the documents from unimportant words. English phrases were detected from each sentence in the document and then continued by extracting English terms further filtering to get the English terms is then performed. The selected English terms are then scored from each accreditation document. The validation process is done by mapping the scoring result with the Accreditation Rank obtained by the document. The result showed that the higher the score based on the English terms obtained by one document. The higher the Nasional Accreditation ranks too. However, two out of 24 documents did not show a similar pattern.","PeriodicalId":360042,"journal":{"name":"2022 International Seminar on Application for Technology of Information and Communication (iSemantic)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Accreditation Documents Profiling Based On The Used of English Terms Using Text mining Algorithm : A case study on 24 accreditation document\",\"authors\":\"Syahrul Iman, A. Wibawa, S. Sumpeno, Arbintoro Mas\",\"doi\":\"10.1109/iSemantic55962.2022.9920369\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Until recently, study program accreditation is still the standard for evaluating the feasibility and quality of the education process worldwide. Normally, in the process of accreditation, one specific study program needs to prepare a report document that represents the whole process of education, especially from the standpoint of quality, and then submit it to the National Board of Accreditation. Self-assessment and evaluation are the keys to creating a proper accreditation document. In the case of Indonesia, the accreditation process is run by the National Board of Higher Education Accreditation (called BAN-PT). The study program must prepare and submit a document that represents the overall process of education by following the national standard and format determined by BAN-PT. In this study, we hypothesized that the text mining technique can be used to evaluate the accreditation document by analyzing the English terms that are used in the document. The analysis is done by scoring the number of English terms in one accreditation document and selecting the proper terms that are used. We hypothesize that the higher the score is obtained by the document the higher the national rank of accreditation was obtained by the document. In this study, 24 accreditation documents with approximately every document consisting of 200 pages within similar study programs were analyzed. A text processing technique was implemented to filter and clean the documents from unimportant words. English phrases were detected from each sentence in the document and then continued by extracting English terms further filtering to get the English terms is then performed. The selected English terms are then scored from each accreditation document. The validation process is done by mapping the scoring result with the Accreditation Rank obtained by the document. The result showed that the higher the score based on the English terms obtained by one document. The higher the Nasional Accreditation ranks too. However, two out of 24 documents did not show a similar pattern.\",\"PeriodicalId\":360042,\"journal\":{\"name\":\"2022 International Seminar on Application for Technology of Information and Communication (iSemantic)\",\"volume\":\"62 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Seminar on Application for Technology of Information and Communication (iSemantic)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/iSemantic55962.2022.9920369\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Seminar on Application for Technology of Information and Communication (iSemantic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iSemantic55962.2022.9920369","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

直到最近,学习项目认证仍然是评估全球教育过程可行性和质量的标准。通常,在认证过程中,一个特定的学习项目需要准备一份代表整个教育过程的报告文件,特别是从质量的角度来看,然后提交给国家认证委员会。自我评估和评价是创建适当的认证文件的关键。就印度尼西亚而言,认证过程由国家高等教育认证委员会(BAN-PT)管理。学习计划必须按照BAN-PT确定的国家标准和格式准备并提交一份代表整个教育过程的文件。在本研究中,我们假设文本挖掘技术可以通过分析文档中使用的英语术语来评估认证文档。分析是通过对一份认证文件中的英语术语数量进行评分并选择使用的适当术语来完成的。我们假设该文件获得的分数越高,该文件获得的国家认可等级就越高。在本研究中,我们分析了类似研究项目中的24份认证文件,每份文件大约有200页。实现了一种文本处理技术来过滤和清除文档中不重要的单词。从文档中的每个句子中检测到英语短语,然后继续提取英语术语,然后执行进一步过滤以获得英语术语。然后从每个认证文件中对选定的英语术语进行评分。验证过程是通过将评分结果与文档获得的认证等级进行映射来完成的。结果表明,基于一篇文章获得的英语术语的分数越高。国家认证等级也越高。然而,24份文件中有两份没有显示出类似的模式。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Accreditation Documents Profiling Based On The Used of English Terms Using Text mining Algorithm : A case study on 24 accreditation document
Until recently, study program accreditation is still the standard for evaluating the feasibility and quality of the education process worldwide. Normally, in the process of accreditation, one specific study program needs to prepare a report document that represents the whole process of education, especially from the standpoint of quality, and then submit it to the National Board of Accreditation. Self-assessment and evaluation are the keys to creating a proper accreditation document. In the case of Indonesia, the accreditation process is run by the National Board of Higher Education Accreditation (called BAN-PT). The study program must prepare and submit a document that represents the overall process of education by following the national standard and format determined by BAN-PT. In this study, we hypothesized that the text mining technique can be used to evaluate the accreditation document by analyzing the English terms that are used in the document. The analysis is done by scoring the number of English terms in one accreditation document and selecting the proper terms that are used. We hypothesize that the higher the score is obtained by the document the higher the national rank of accreditation was obtained by the document. In this study, 24 accreditation documents with approximately every document consisting of 200 pages within similar study programs were analyzed. A text processing technique was implemented to filter and clean the documents from unimportant words. English phrases were detected from each sentence in the document and then continued by extracting English terms further filtering to get the English terms is then performed. The selected English terms are then scored from each accreditation document. The validation process is done by mapping the scoring result with the Accreditation Rank obtained by the document. The result showed that the higher the score based on the English terms obtained by one document. The higher the Nasional Accreditation ranks too. However, two out of 24 documents did not show a similar pattern.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信