{"title":"Open research knowledge graph for structuring scholarly contributions using transformers","authors":"Mehboob Ali, Abdullah Malik, Maryam Bashir","doi":"10.1109/ICACS55311.2023.10089637","DOIUrl":null,"url":null,"abstract":"More research papers are being published now than have ever been at any point in history. It is becoming increasingly difficult for the researchers to keep up with the papers that are being published in even a very narrow domain. This study proposes to build an open research knowledge graph (ORKG) that shows the scholarly contributions of the published papers. The paper makes use of natural language processing techniques and state-of-the-art deep learning models to achieve this task. The system generates a knowledge graph after performing four main steps including sentence classification, phrase extraction, triple formation (and classification) and finally, knowledge graph generation. Different state-of-the-art deep learning models such as RoBERTa have been used for classification and phrase extraction tasks whereas triple formation was performed using different heuristics. Finally, a knowledge graph is generated through which an end-user can identify the scholarly contributions in scholarly article. Experimental results are compared against other systems and show encouraging results.","PeriodicalId":357522,"journal":{"name":"2023 4th International Conference on Advancements in Computational Sciences (ICACS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 4th International Conference on Advancements in Computational Sciences (ICACS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACS55311.2023.10089637","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
More research papers are being published now than have ever been at any point in history. It is becoming increasingly difficult for the researchers to keep up with the papers that are being published in even a very narrow domain. This study proposes to build an open research knowledge graph (ORKG) that shows the scholarly contributions of the published papers. The paper makes use of natural language processing techniques and state-of-the-art deep learning models to achieve this task. The system generates a knowledge graph after performing four main steps including sentence classification, phrase extraction, triple formation (and classification) and finally, knowledge graph generation. Different state-of-the-art deep learning models such as RoBERTa have been used for classification and phrase extraction tasks whereas triple formation was performed using different heuristics. Finally, a knowledge graph is generated through which an end-user can identify the scholarly contributions in scholarly article. Experimental results are compared against other systems and show encouraging results.