Simplifying synthesis of the expanding glioblastoma literature: a topic modeling approach.

IF 4.3 3区 材料科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
ACS Applied Electronic Materials Pub Date : 2024-09-01 Epub Date: 2024-07-11 DOI:10.1007/s11060-024-04762-8
Mert Karabacak, Pemla Jagtiani, Alejandro Carrasquilla, Ankita Jain, Isabelle M Germano, Konstantinos Margetis
{"title":"Simplifying synthesis of the expanding glioblastoma literature: a topic modeling approach.","authors":"Mert Karabacak, Pemla Jagtiani, Alejandro Carrasquilla, Ankita Jain, Isabelle M Germano, Konstantinos Margetis","doi":"10.1007/s11060-024-04762-8","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Our study aims to discover the leading topics within glioblastoma (GB) research, and to examine if these topics have \"hot\" or \"cold\" trends. Additionally, we aim to showcase the potential of natural language processing (NLP) in facilitating research syntheses, offering an efficient strategy to dissect the landscape of academic literature in the realm of GB research.</p><p><strong>Methods: </strong>The Scopus database was queried using \"glioblastoma\" as the search term, in the \"TITLE\" and \"KEY\" fields. BERTopic, an NLP-based topic modeling (TM) method, was used for probabilistic TM. We specified a minimum topic size of 300 documents and 5% probability cutoff for outlier detection. We labeled topics based on keywords and representative documents and visualized them with word clouds. Linear regression models were utilized to identify \"hot\" and \"cold\" topic trends per decade.</p><p><strong>Results: </strong>Our TM analysis categorized 43,329 articles into 15 distinct topics. The most common topics were Genomics, Survival, Drug Delivery, and Imaging, while the least common topics were Surgical Resection, MGMT Methylation, and Exosomes. The hottest topics over the 2020s were Viruses and Oncolytic Therapy, Anticancer Compounds, and Exosomes, while the cold topics were Surgical Resection, Angiogenesis, and Tumor Metabolism.</p><p><strong>Conclusion: </strong>Our NLP methodology provided an extensive analysis of GB literature, revealing valuable insights about historical and contemporary patterns difficult to discern with traditional techniques. The outcomes offer guidance for research directions, policy, and identifying emerging trends. Our approach could be applied across research disciplines to summarize and examine scholarly literature, guiding future exploration.</p>","PeriodicalId":3,"journal":{"name":"ACS Applied Electronic Materials","volume":null,"pages":null},"PeriodicalIF":4.3000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Electronic Materials","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s11060-024-04762-8","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/7/11 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: Our study aims to discover the leading topics within glioblastoma (GB) research, and to examine if these topics have "hot" or "cold" trends. Additionally, we aim to showcase the potential of natural language processing (NLP) in facilitating research syntheses, offering an efficient strategy to dissect the landscape of academic literature in the realm of GB research.

Methods: The Scopus database was queried using "glioblastoma" as the search term, in the "TITLE" and "KEY" fields. BERTopic, an NLP-based topic modeling (TM) method, was used for probabilistic TM. We specified a minimum topic size of 300 documents and 5% probability cutoff for outlier detection. We labeled topics based on keywords and representative documents and visualized them with word clouds. Linear regression models were utilized to identify "hot" and "cold" topic trends per decade.

Results: Our TM analysis categorized 43,329 articles into 15 distinct topics. The most common topics were Genomics, Survival, Drug Delivery, and Imaging, while the least common topics were Surgical Resection, MGMT Methylation, and Exosomes. The hottest topics over the 2020s were Viruses and Oncolytic Therapy, Anticancer Compounds, and Exosomes, while the cold topics were Surgical Resection, Angiogenesis, and Tumor Metabolism.

Conclusion: Our NLP methodology provided an extensive analysis of GB literature, revealing valuable insights about historical and contemporary patterns difficult to discern with traditional techniques. The outcomes offer guidance for research directions, policy, and identifying emerging trends. Our approach could be applied across research disciplines to summarize and examine scholarly literature, guiding future exploration.

Abstract Image

简化不断扩展的胶质母细胞瘤文献综述:一种主题建模方法。
目的:我们的研究旨在发现胶质母细胞瘤(GB)研究中的热门话题,并探讨这些话题是 "热门 "还是 "冷门"。此外,我们还旨在展示自然语言处理(NLP)在促进研究综述方面的潜力,为剖析 GB 研究领域的学术文献提供一种有效的策略:在 Scopus 数据库的 "标题 "和 "关键字 "字段中使用 "胶质母细胞瘤 "作为检索词进行查询。BERTopic 是一种基于 NLP 的主题建模(TM)方法,用于概率 TM。我们规定最小主题大小为 300 个文档,离群点检测的概率截止值为 5%。我们根据关键词和代表性文档对主题进行标注,并通过词云将其可视化。我们利用线性回归模型来确定每十年的 "热门 "和 "冷门 "话题趋势:我们的 TM 分析将 43,329 篇文章分为 15 个不同的主题。最常见的主题是基因组学、生存、给药和成像,而最不常见的主题是手术切除、MGMT 甲基化和外泌体。2020 年代最热门的话题是病毒和肿瘤溶解疗法、抗癌化合物和外泌体,而冷门话题是手术切除、血管生成和肿瘤代谢:我们的 NLP 方法对国标文献进行了广泛的分析,揭示了传统技术难以发现的历史和当代模式的宝贵见解。这些成果为研究方向、政策和识别新兴趋势提供了指导。我们的方法可以应用于各个研究学科,总结和研究学术文献,指导未来的探索。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.20
自引率
4.30%
发文量
567
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信