Two Decades of Rheumatology Research (2000-2023): A Dynamic Topic Modeling Perspective

Alfredo Madrid-García, Dalifer Freites-Núñez, Luis Rodríguez-Rodríguez
{"title":"Two Decades of Rheumatology Research (2000-2023): A Dynamic Topic Modeling Perspective","authors":"Alfredo Madrid-García, Dalifer Freites-Núñez, Luis Rodríguez-Rodríguez","doi":"10.1101/2024.06.06.24308533","DOIUrl":null,"url":null,"abstract":"Background: Rheumatology has experience notably changes in last decades. New drugs, including biologic agents and janus kinase inhibitors, have bloosom. Concepts such as window of opportunity, arthralgia suspicious for progression, or difficult-to-treat rheumatoid arthritis have appeared; and new management approaches and strategies such as treat-to-target have become popular. Statistical learning methods, gene therapy, telemedicine or precision medicine are other advancements that have gained relevance in the field. To better characterise the research landscape and advances in rheumatology, automatic and efficient approaches based on natural language processing should be used. The objective of this study is to use topic modeling techniques to uncover key topics and trends in the rheumatology research conducted in the last 23 years. Methods: This study analysed 96,004 abstracts published between 2000 and December 31, 2023, drawn from 34 specialised rheumatology journals obtained from PubMed. BERTopic, a novel topic modeling approach that considers semantic relationships among words and their context, was used to uncover topics. Up to 30 different models were trained. Based on the number of topics, outliers and topic coherence score, two of them were finally selected, and the topics manually labeled by two rheumatologists. Word clouds and hierarchical clustering visualizations were computed. Finally, hot and cold trends were identified using linear regression models. Results: Abstracts were classified into 45 and 47 topics. The most frequent topics were rheumatoid arthritis, systemic lupus erythematosus and osteoarthritis. Expected topics such as COVID-19 or JAK inhibitors were identified after conducting the dynamic topic modeling. Topics such as spinal surgery or bone fractures have gained relevance in last years, however, antiphospholipid syndrome, or septic arthritis have lost momentum. Conclusions: Our study utilized advanced natural language processing techniques to analyse the rheumatology research landscape, and identify key themes and emerging trends. The results highlight the dynamic and varied nature of rheumatology research, illustrating how interest in certain topics have shifted over time.","PeriodicalId":18505,"journal":{"name":"medRxiv","volume":" 4","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.06.06.24308533","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Rheumatology has experience notably changes in last decades. New drugs, including biologic agents and janus kinase inhibitors, have bloosom. Concepts such as window of opportunity, arthralgia suspicious for progression, or difficult-to-treat rheumatoid arthritis have appeared; and new management approaches and strategies such as treat-to-target have become popular. Statistical learning methods, gene therapy, telemedicine or precision medicine are other advancements that have gained relevance in the field. To better characterise the research landscape and advances in rheumatology, automatic and efficient approaches based on natural language processing should be used. The objective of this study is to use topic modeling techniques to uncover key topics and trends in the rheumatology research conducted in the last 23 years. Methods: This study analysed 96,004 abstracts published between 2000 and December 31, 2023, drawn from 34 specialised rheumatology journals obtained from PubMed. BERTopic, a novel topic modeling approach that considers semantic relationships among words and their context, was used to uncover topics. Up to 30 different models were trained. Based on the number of topics, outliers and topic coherence score, two of them were finally selected, and the topics manually labeled by two rheumatologists. Word clouds and hierarchical clustering visualizations were computed. Finally, hot and cold trends were identified using linear regression models. Results: Abstracts were classified into 45 and 47 topics. The most frequent topics were rheumatoid arthritis, systemic lupus erythematosus and osteoarthritis. Expected topics such as COVID-19 or JAK inhibitors were identified after conducting the dynamic topic modeling. Topics such as spinal surgery or bone fractures have gained relevance in last years, however, antiphospholipid syndrome, or septic arthritis have lost momentum. Conclusions: Our study utilized advanced natural language processing techniques to analyse the rheumatology research landscape, and identify key themes and emerging trends. The results highlight the dynamic and varied nature of rheumatology research, illustrating how interest in certain topics have shifted over time.
风湿病学研究的二十年(2000-2023 年):动态主题建模视角
背景:风湿病学在过去几十年中经历了显著的变化。包括生物制剂和破伤风激酶抑制剂在内的新药层出不穷。机会之窗、疑似进展性关节痛或难以治疗的类风湿关节炎等概念已经出现;靶向治疗等新的管理方法和策略也开始流行。统计学习方法、基因治疗、远程医疗或精准医疗是该领域取得的其他进展。为了更好地描述风湿病学的研究状况和进展,应使用基于自然语言处理的自动高效方法。本研究的目的是利用主题建模技术来揭示过去 23 年中风湿病学研究的关键主题和趋势。研究方法本研究分析了2000年至2023年12月31日期间发表的96004篇摘要,这些摘要来自PubMed上的34种风湿病学专业期刊。BERTopic 是一种新颖的主题建模方法,它考虑了词语及其上下文之间的语义关系,用于发现主题。训练了多达 30 个不同的模型。根据主题数量、离群值和主题一致性得分,最终选出其中两个,并由两位风湿病学家对主题进行人工标注。计算了词云和分层聚类可视化。最后,利用线性回归模型确定了冷热趋势。结果摘要分为 45 个主题和 47 个主题。最常见的主题是类风湿性关节炎、系统性红斑狼疮和骨关节炎。在进行动态主题建模后,确定了 COVID-19 或 JAK 抑制剂等预期主题。脊柱手术或骨折等话题在过去几年中的相关性有所提高,但抗磷脂综合征或化脓性关节炎的相关性有所下降。结论我们的研究利用先进的自然语言处理技术分析了风湿病学的研究现状,并确定了关键主题和新兴趋势。研究结果凸显了风湿病学研究的动态性和多样性,说明了人们对某些主题的兴趣是如何随着时间的推移而变化的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信