{"title":"基于集成的学术文章语境分类模型:一种推荐系统的路径","authors":"Md. Abdullah-Al-Kafi, S. Banshal, N. Sultana","doi":"10.55835/64417860148a6e8ce52b6bd2","DOIUrl":null,"url":null,"abstract":"Articles from multiple research areas tend to use similar words as keywords and titles. On the other hand, selecting a publishing source can be problematic for authors in the initial stage of a scholarly study. So, context-based classification is important where the abstracts, keywords, and titles get similar attention. The aim of this study is to create a tool that uses ensemble learning to classify scholarly articles and recommends sources. This study uses 38 classes for the Web of Science dataset and 40 Classes for the Dimension dataset without grouping them. In all experiment setups models using abstracts achieved the best result as giving a more contextual understanding. Based on this ensemble-based approach, a recommender system has been outlined to recommend probable sources for a given article based on Title, Keywords, and Abstract.","PeriodicalId":334841,"journal":{"name":"27th International Conference on Science, Technology and Innovation Indicators (STI 2023)","volume":"300 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Ensemble-Based Model to Classify Scholarly Articles on Context: A Path to Recommender System\",\"authors\":\"Md. Abdullah-Al-Kafi, S. Banshal, N. Sultana\",\"doi\":\"10.55835/64417860148a6e8ce52b6bd2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Articles from multiple research areas tend to use similar words as keywords and titles. On the other hand, selecting a publishing source can be problematic for authors in the initial stage of a scholarly study. So, context-based classification is important where the abstracts, keywords, and titles get similar attention. The aim of this study is to create a tool that uses ensemble learning to classify scholarly articles and recommends sources. This study uses 38 classes for the Web of Science dataset and 40 Classes for the Dimension dataset without grouping them. In all experiment setups models using abstracts achieved the best result as giving a more contextual understanding. Based on this ensemble-based approach, a recommender system has been outlined to recommend probable sources for a given article based on Title, Keywords, and Abstract.\",\"PeriodicalId\":334841,\"journal\":{\"name\":\"27th International Conference on Science, Technology and Innovation Indicators (STI 2023)\",\"volume\":\"300 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"27th International Conference on Science, Technology and Innovation Indicators (STI 2023)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.55835/64417860148a6e8ce52b6bd2\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"27th International Conference on Science, Technology and Innovation Indicators (STI 2023)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.55835/64417860148a6e8ce52b6bd2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
来自多个研究领域的文章倾向于使用类似的关键词和标题。另一方面,在学术研究的初始阶段,选择出版来源对作者来说可能是一个问题。因此,在摘要、关键字和标题得到类似关注的情况下,基于上下文的分类非常重要。本研究的目的是创建一个使用集成学习对学术文章进行分类和推荐来源的工具。本研究对Web of Science数据集使用了38个类,对Dimension数据集使用了40个类,没有对它们进行分组。在所有的实验设置中,使用摘要的模型获得了最好的结果,因为它提供了更多的上下文理解。基于这种基于集成的方法,已经概述了一个推荐系统,根据标题,关键词和摘要为给定文章推荐可能的来源。
An Ensemble-Based Model to Classify Scholarly Articles on Context: A Path to Recommender System
Articles from multiple research areas tend to use similar words as keywords and titles. On the other hand, selecting a publishing source can be problematic for authors in the initial stage of a scholarly study. So, context-based classification is important where the abstracts, keywords, and titles get similar attention. The aim of this study is to create a tool that uses ensemble learning to classify scholarly articles and recommends sources. This study uses 38 classes for the Web of Science dataset and 40 Classes for the Dimension dataset without grouping them. In all experiment setups models using abstracts achieved the best result as giving a more contextual understanding. Based on this ensemble-based approach, a recommender system has been outlined to recommend probable sources for a given article based on Title, Keywords, and Abstract.