Utilizing Deep Learning for Detecting Adverse Drug Events in Structured and Unstructured Regulatory Drug Data Sets.

IF 4.5 Q2 PHARMACOLOGY & PHARMACY

Pharmaceutical Medicine Pub Date : 2022-10-01 Epub Date: 2022-07-24 DOI:10.1007/s40290-022-00434-y

Benjamin M Knisely, Qais Hatim, Monifa Vaughn-Cooke

{"title":"Utilizing Deep Learning for Detecting Adverse Drug Events in Structured and Unstructured Regulatory Drug Data Sets.","authors":"Benjamin M Knisely, Qais Hatim, Monifa Vaughn-Cooke","doi":"10.1007/s40290-022-00434-y","DOIUrl":null,"url":null,"abstract":"Background: The US Food and Drug Administration (FDA) collects and retains several data sets on post-market drugs and associated adverse events (AEs). The FDA Adverse Event Reporting System (FAERS) contains millions of AE reports submitted by the public when a medication is suspected to have caused an AE. The FDA monitors these reports to identify drug safety issues that were undetected during the premarket evaluation of these products. These reports contain patient narratives that provide information regarding the AE that needs to be coded using standardized terminology to enable aggregation of reports for further review. Additionally, the FDA collects structured drug product labels (SPLs) that facilitate standardized distribution of information regarding marketed medical products. Manufacturers are currently not required to code labels with associated AEs.Objectives: Approaches for automated classification of reports by preferred terminology could enhance regulatory efficiency. The goal of this work was to assess the suitability of manually annotated FDA FAERS and SPL data sets to be subjected to predictive modeling.Methods: A recurrent neural network (RNN) was proposed as a proof-of-concept model for automated extraction of preferred AE terminology. A separate RNN was fit and cross-validated on two regulatory data sets with varying properties. First, the researchers trained and cross-validated a model on 325 annotated FAERS patient narratives for a sample of AE terms. A model was then trained and validated on a data set of 100 SPLs.Results: Model cross-validation results for product labels demonstrated that the model performed at least as well as more conventional models for all but one of the terms selected based on F1-score. Model results for the FAERS data set were mixed.Conclusions: This work successfully demonstrated a proof-of-concept machine learning approach to automatically detect AEs in several textual regulatory data sets to support post-market regulatory activities. Limited instances of each AE class likely prohibited models from generalizing data effectively. Additional data may permit more robust validation.","PeriodicalId":19778,"journal":{"name":"Pharmaceutical Medicine","volume":"36 5","pages":"307-317"},"PeriodicalIF":4.5000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pharmaceutical Medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s40290-022-00434-y","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/7/24 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}

引用次数: 0

Abstract

Background: The US Food and Drug Administration (FDA) collects and retains several data sets on post-market drugs and associated adverse events (AEs). The FDA Adverse Event Reporting System (FAERS) contains millions of AE reports submitted by the public when a medication is suspected to have caused an AE. The FDA monitors these reports to identify drug safety issues that were undetected during the premarket evaluation of these products. These reports contain patient narratives that provide information regarding the AE that needs to be coded using standardized terminology to enable aggregation of reports for further review. Additionally, the FDA collects structured drug product labels (SPLs) that facilitate standardized distribution of information regarding marketed medical products. Manufacturers are currently not required to code labels with associated AEs.

Objectives: Approaches for automated classification of reports by preferred terminology could enhance regulatory efficiency. The goal of this work was to assess the suitability of manually annotated FDA FAERS and SPL data sets to be subjected to predictive modeling.

Methods: A recurrent neural network (RNN) was proposed as a proof-of-concept model for automated extraction of preferred AE terminology. A separate RNN was fit and cross-validated on two regulatory data sets with varying properties. First, the researchers trained and cross-validated a model on 325 annotated FAERS patient narratives for a sample of AE terms. A model was then trained and validated on a data set of 100 SPLs.

Results: Model cross-validation results for product labels demonstrated that the model performed at least as well as more conventional models for all but one of the terms selected based on F1-score. Model results for the FAERS data set were mixed.

Conclusions: This work successfully demonstrated a proof-of-concept machine learning approach to automatically detect AEs in several textual regulatory data sets to support post-market regulatory activities. Limited instances of each AE class likely prohibited models from generalizing data effectively. Additional data may permit more robust validation.

查看原文本刊更多论文

利用深度学习在结构化和非结构化监管药物数据集中检测药物不良事件。

背景:美国食品和药物管理局(FDA)收集并保留了一些上市后药物和相关不良事件(ae)的数据集。FDA不良事件报告系统(FAERS)包含数百万份公众在怀疑药物引起不良事件时提交的不良事件报告。FDA监督这些报告，以确定在这些产品上市前评估期间未发现的药物安全问题。这些报告包含患者叙述，提供有关AE的信息，需要使用标准化术语进行编码，以便汇总报告以供进一步审查。此外，FDA收集结构化药品标签(SPLs)，以促进有关已上市医疗产品信息的标准化分发。目前不要求制造商在标签上标注相关的AEs。目的:采用首选术语对报告进行自动分类的方法可以提高监管效率。这项工作的目的是评估人工注释的FDA FAERS和SPL数据集是否适合进行预测建模。方法:提出递归神经网络(RNN)作为自动提取首选声发射术语的概念验证模型。一个单独的RNN在两个具有不同属性的调节数据集上进行拟合和交叉验证。首先，研究人员训练并交叉验证了325例FAERS患者叙述的AE术语样本模型。然后在100个SPLs数据集上对模型进行训练和验证。结果:产品标签的模型交叉验证结果表明，该模型对基于f1得分选择的所有术语的表现至少与更传统的模型一样好。FAERS数据集的模型结果是混合的。结论:这项工作成功地证明了一种概念验证的机器学习方法，可以自动检测几个文本监管数据集中的ae，以支持上市后的监管活动。每个AE类的有限实例可能会阻止模型有效地泛化数据。额外的数据可能允许更可靠的验证。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Pharmaceutical Medicine PHARMACOLOGY & PHARMACY-

CiteScore

5.10

自引率

4.00%

发文量

期刊介绍： Pharmaceutical Medicine is a specialist discipline concerned with medical aspects of the discovery, development, evaluation, registration, regulation, monitoring, marketing, distribution and pricing of medicines, drug-device and drug-diagnostic combinations. The Journal disseminates information to support the community of professionals working in these highly inter-related functions. Key areas include translational medicine, clinical trial design, pharmacovigilance, clinical toxicology, drug regulation, clinical pharmacology, biostatistics and pharmacoeconomics. The Journal includes:Overviews of contentious or emerging issues.Comprehensive narrative reviews that provide an authoritative source of information on topical issues.Systematic reviews that collate empirical evidence to answer a specific research question, using explicit, systematic methods as outlined by PRISMA statement.Original research articles reporting the results of well-designed studies with a strong link to wider areas of clinical research.Additional digital features (including animated abstracts, video abstracts, slide decks, audio slides, instructional videos, infographics, podcasts and animations) can be published with articles; these are designed to increase the visibility, readership and educational value of the journal’s content. In addition, articles published in Pharmaceutical Medicine may be accompanied by plain language summaries to assist readers who have some knowledge of, but not in-depth expertise in, the area to understand important medical advances.All manuscripts are subject to peer review by international experts. Letters to the Editor are welcomed and will be considered for publication.