An automated algorithm using free-text clinical notes to improve identification of transgender people.

IF 2.5 4区 医学 Q2 HEALTH CARE SCIENCES & SERVICES
Informatics for Health & Social Care Pub Date : 2021-03-02 Epub Date: 2020-11-17 DOI:10.1080/17538157.2020.1828890
Fagen Xie, Darios Getahun, Virginia P Quinn, Theresa M Im, Richard Contreras, Michael J Silverberg, Tisha C Baird, Rebecca Nash, Lee Cromwell, Douglas Roblin, Trenton Hoffman, Michael Goodman
{"title":"An automated algorithm using free-text clinical notes to improve identification of transgender people.","authors":"Fagen Xie,&nbsp;Darios Getahun,&nbsp;Virginia P Quinn,&nbsp;Theresa M Im,&nbsp;Richard Contreras,&nbsp;Michael J Silverberg,&nbsp;Tisha C Baird,&nbsp;Rebecca Nash,&nbsp;Lee Cromwell,&nbsp;Douglas Roblin,&nbsp;Trenton Hoffman,&nbsp;Michael Goodman","doi":"10.1080/17538157.2020.1828890","DOIUrl":null,"url":null,"abstract":"<p><p>Accurate identification of transgender persons is a critical first step in conducting transgender health studies. To develop an automated algorithm for identifying transgender individuals from electronic medical records (EMR) using free-text clinical notes. The development and validation of the algorithm was based on data from an integrated healthcare system that served as a participating site in the multicenter Study of Transition Outcomes and Gender. The training and test datasets each contained a total of 300 individuals identified between 2006 and 2014. Both datasets underwent a full medical record review by experienced research abstractors. The validated algorithm was then implemented to identify transgender individuals in the EMR using all clinical notes of patients that received care between January 1, 2015 and June 30, 2018. Validation of the algorithm against the full chart review demonstrated a high degree of accuracy with 97% sensitivity, 95% specificity, 94% positive predictive value, and 97% negative predictive value. The algorithm classified 7,409 individuals (3.5%) as \"Definitely transgender\" and 679 individuals (0.3%) as \"Probably transgender\" out of 212,138 candidates with a total of 378,641 clinical notes. The computerized NLP algorithm can support essential efforts to improve the health of transgender people.</p>","PeriodicalId":54984,"journal":{"name":"Informatics for Health & Social Care","volume":"46 1","pages":"18-28"},"PeriodicalIF":2.5000,"publicationDate":"2021-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/17538157.2020.1828890","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatics for Health & Social Care","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/17538157.2020.1828890","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2020/11/17 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 6

Abstract

Accurate identification of transgender persons is a critical first step in conducting transgender health studies. To develop an automated algorithm for identifying transgender individuals from electronic medical records (EMR) using free-text clinical notes. The development and validation of the algorithm was based on data from an integrated healthcare system that served as a participating site in the multicenter Study of Transition Outcomes and Gender. The training and test datasets each contained a total of 300 individuals identified between 2006 and 2014. Both datasets underwent a full medical record review by experienced research abstractors. The validated algorithm was then implemented to identify transgender individuals in the EMR using all clinical notes of patients that received care between January 1, 2015 and June 30, 2018. Validation of the algorithm against the full chart review demonstrated a high degree of accuracy with 97% sensitivity, 95% specificity, 94% positive predictive value, and 97% negative predictive value. The algorithm classified 7,409 individuals (3.5%) as "Definitely transgender" and 679 individuals (0.3%) as "Probably transgender" out of 212,138 candidates with a total of 378,641 clinical notes. The computerized NLP algorithm can support essential efforts to improve the health of transgender people.

一个使用自由文本临床记录的自动算法,以提高对变性人的识别。
准确识别跨性别者是开展跨性别健康研究的关键第一步。开发一种自动算法,用于使用自由文本临床记录从电子医疗记录(EMR)中识别跨性别者。该算法的开发和验证基于一个综合医疗保健系统的数据,该系统作为多中心过渡结果和性别研究的参与站点。训练和测试数据集各包含2006年至2014年间确定的300个个体。两个数据集都由经验丰富的研究摘要人员进行了完整的医疗记录审查。然后实施经过验证的算法,使用2015年1月1日至2018年6月30日期间接受治疗的患者的所有临床记录,在EMR中识别跨性别者。对整个图表的验证表明,该算法具有很高的准确性,灵敏度为97%,特异性为95%,阳性预测值为94%,阴性预测值为97%。该算法将212138名候选人中的7409人(3.5%)分类为“绝对跨性别者”,679人(0.3%)分类为“可能跨性别者”,总共有378641份临床记录。计算机化的NLP算法可以支持改善变性人健康的必要努力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
6.10
自引率
4.20%
发文量
21
审稿时长
>12 weeks
期刊介绍: Informatics for Health & Social Care promotes evidence-based informatics as applied to the domain of health and social care. It showcases informatics research and practice within the many and diverse contexts of care; it takes personal information, both its direct and indirect use, as its central focus. The scope of the Journal is broad, encompassing both the properties of care information and the life-cycle of associated information systems. Consideration of the properties of care information will necessarily include the data itself, its representation, structure, and associated processes, as well as the context of its use, highlighting the related communication, computational, cognitive, social and ethical aspects. Consideration of the life-cycle of care information systems includes full range from requirements, specifications, theoretical models and conceptual design through to sustainable implementations, and the valuation of impacts. Empirical evidence experiences related to implementation are particularly welcome. Informatics in Health & Social Care seeks to consolidate and add to the core knowledge within the disciplines of Health and Social Care Informatics. The Journal therefore welcomes scientific papers, case studies and literature reviews. Examples of novel approaches are particularly welcome. Articles might, for example, show how care data is collected and transformed into useful and usable information, how informatics research is translated into practice, how specific results can be generalised, or perhaps provide case studies that facilitate learning from experience.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信