{"title":"Evaluation of Machine Learning to Early Detection of Highly Cited Papers","authors":"G. M. Binmakhashen, Hamdi A. Al-Jamimi","doi":"10.1109/CDMA54072.2022.00006","DOIUrl":null,"url":null,"abstract":"As one of the fastest-growing topics, machine learning has many applications that span through different domains including image and signal recognition, text mining, information retrieval, robotics, etc. It enables information extraction and analysis for better insights and decision-based systems. The Web of Science(WoS) citation database is a leading organization that provides citation data of high-quality published research. WoS has its metrics to label published articles as Highly Cited Paper(HCP). Machine learning (ML) can help researchers in identifying the key characteristics of HCP. Moreover, it can allow research evaluation units forecasting significant scientific articles. In other words, it may allow researchers and/or research evaluators to detect potential scientific breakthrough ideas and stay current. In this study, more than 26 thousand records of published articles indexed by WoS were analyzed. All the records are drawn from the Technology research area as defined by WoS. Four ML algorithms are evaluated to verify the HCP common factors influence in raising citations and interest in scientific articles. The ensemble algorithms show promising results to identify HCP articles using only four factors.","PeriodicalId":313042,"journal":{"name":"2022 7th International Conference on Data Science and Machine Learning Applications (CDMA)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 7th International Conference on Data Science and Machine Learning Applications (CDMA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CDMA54072.2022.00006","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
As one of the fastest-growing topics, machine learning has many applications that span through different domains including image and signal recognition, text mining, information retrieval, robotics, etc. It enables information extraction and analysis for better insights and decision-based systems. The Web of Science(WoS) citation database is a leading organization that provides citation data of high-quality published research. WoS has its metrics to label published articles as Highly Cited Paper(HCP). Machine learning (ML) can help researchers in identifying the key characteristics of HCP. Moreover, it can allow research evaluation units forecasting significant scientific articles. In other words, it may allow researchers and/or research evaluators to detect potential scientific breakthrough ideas and stay current. In this study, more than 26 thousand records of published articles indexed by WoS were analyzed. All the records are drawn from the Technology research area as defined by WoS. Four ML algorithms are evaluated to verify the HCP common factors influence in raising citations and interest in scientific articles. The ensemble algorithms show promising results to identify HCP articles using only four factors.
作为发展最快的课题之一,机器学习在图像和信号识别、文本挖掘、信息检索、机器人等不同领域有着广泛的应用。它支持信息提取和分析,以获得更好的见解和基于决策的系统。Web of Science(WoS)引文数据库是提供高质量已发表研究的引文数据的领先组织。WoS有自己的指标来将发表的文章标记为高被引论文(HCP)。机器学习(ML)可以帮助研究人员识别HCP的关键特征。此外,它可以让研究评价单位预测重要的科学文章。换句话说,它可以让研究人员和/或研究评估人员发现潜在的科学突破性想法,并保持最新状态。在这项研究中,我们分析了超过2.6万条由WoS索引的已发表文章记录。所有记录均取自WoS定义的技术研究区域。评估了四种机器学习算法,以验证HCP共同因素对提高科学文章的引用和兴趣的影响。集成算法显示了有希望的结果,识别HCP文章仅使用四个因素。