{"title":"利用知识图谱和大型语言模型生成并由人类专家评估有趣的研究想法","authors":"Xuemei Gu, Mario Krenn","doi":"arxiv-2405.17044","DOIUrl":null,"url":null,"abstract":"Advanced artificial intelligence (AI) systems with access to millions of\nresearch papers could inspire new research ideas that may not be conceived by\nhumans alone. However, how interesting are these AI-generated ideas, and how\ncan we improve their quality? Here, we introduce SciMuse, a system that uses an\nevolving knowledge graph built from more than 58 million scientific papers to\ngenerate personalized research ideas via an interface to GPT-4. We conducted a\nlarge-scale human evaluation with over 100 research group leaders from the Max\nPlanck Society, who ranked more than 4,000 personalized research ideas based on\ntheir level of interest. This evaluation allows us to understand the\nrelationships between scientific interest and the core properties of the\nknowledge graph. We find that data-efficient machine learning can predict\nresearch interest with high precision, allowing us to optimize the\ninterest-level of generated research ideas. This work represents a step towards\nan artificial scientific muse that could catalyze unforeseen collaborations and\nsuggest interesting avenues for scientists.","PeriodicalId":501285,"journal":{"name":"arXiv - CS - Digital Libraries","volume":"15 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Generation and human-expert evaluation of interesting research ideas using knowledge graphs and large language models\",\"authors\":\"Xuemei Gu, Mario Krenn\",\"doi\":\"arxiv-2405.17044\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Advanced artificial intelligence (AI) systems with access to millions of\\nresearch papers could inspire new research ideas that may not be conceived by\\nhumans alone. However, how interesting are these AI-generated ideas, and how\\ncan we improve their quality? Here, we introduce SciMuse, a system that uses an\\nevolving knowledge graph built from more than 58 million scientific papers to\\ngenerate personalized research ideas via an interface to GPT-4. We conducted a\\nlarge-scale human evaluation with over 100 research group leaders from the Max\\nPlanck Society, who ranked more than 4,000 personalized research ideas based on\\ntheir level of interest. This evaluation allows us to understand the\\nrelationships between scientific interest and the core properties of the\\nknowledge graph. We find that data-efficient machine learning can predict\\nresearch interest with high precision, allowing us to optimize the\\ninterest-level of generated research ideas. This work represents a step towards\\nan artificial scientific muse that could catalyze unforeseen collaborations and\\nsuggest interesting avenues for scientists.\",\"PeriodicalId\":501285,\"journal\":{\"name\":\"arXiv - CS - Digital Libraries\",\"volume\":\"15 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Digital Libraries\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2405.17044\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Digital Libraries","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.17044","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Generation and human-expert evaluation of interesting research ideas using knowledge graphs and large language models
Advanced artificial intelligence (AI) systems with access to millions of
research papers could inspire new research ideas that may not be conceived by
humans alone. However, how interesting are these AI-generated ideas, and how
can we improve their quality? Here, we introduce SciMuse, a system that uses an
evolving knowledge graph built from more than 58 million scientific papers to
generate personalized research ideas via an interface to GPT-4. We conducted a
large-scale human evaluation with over 100 research group leaders from the Max
Planck Society, who ranked more than 4,000 personalized research ideas based on
their level of interest. This evaluation allows us to understand the
relationships between scientific interest and the core properties of the
knowledge graph. We find that data-efficient machine learning can predict
research interest with high precision, allowing us to optimize the
interest-level of generated research ideas. This work represents a step towards
an artificial scientific muse that could catalyze unforeseen collaborations and
suggest interesting avenues for scientists.