JournalADE: Creation and validation of a novel program for automated data extraction (ADE) to assess authorship gender representation

IF 0.2 Q4 ORTHOPEDICS
Emily L. Larson, Shivani Pandya, S. Stewart, Jessica Schmerler, S. Jabori, Helen Xun, Kriti Jain, Dawn LaPorte, A. Aiyer
{"title":"JournalADE: Creation and validation of a novel program for automated data extraction (ADE) to assess authorship gender representation","authors":"Emily L. Larson, Shivani Pandya, S. Stewart, Jessica Schmerler, S. Jabori, Helen Xun, Kriti Jain, Dawn LaPorte, A. Aiyer","doi":"10.1097/bco.0000000000001272","DOIUrl":null,"url":null,"abstract":"\n \n Analyses of gender in academic authorship are key to characterizing representation in surgical fields, but current methods of manual data collection are time-consuming and error prone. The purpose of this study was to design a program to automatically extract publication data and verify the accuracy of this program in comparison to manually-collected data in a pilot study of three orthopaedic surgery journals.\n \n \n \n Publications from three orthopaedic subspecialty journals between January 2019 and June 2021 were identified via PubMed search. For each publication, online publication date, journal issue month, first author name, and senior author name were collected from PubMed listings by hand and programmatically in a Python script (JournalADE). Gender was determined using Gender API.\n \n \n \n The percent of publications for which manually- and program-collected online publication dates were within 14 days of each other was above 95% for all journals. There was 98.3% (95% CI=97.84-98.76%) agreement for online publication date, with a mean difference of 6.43 (SD 0.87) days. Journal issue month agreement was 99.6% (95% CI=99.37-99.83%). Agreement for first author gender was 97.33% (95% CI=96.75-97.91%) and for senior author gender was 96.77% (95% CI=96.14-97.4%). Estimated labor time for manual collection was 100 hr, compared to 15 min for JournalADE.\n \n \n \n When comparing the JournalADE- and manually-collected data, rates of agreement were high at a fraction of the time. This supports the efficacy of JournalADE and sets the stage for its use in future studies of gender in authorship.\n","PeriodicalId":10732,"journal":{"name":"Current Orthopaedic Practice","volume":null,"pages":null},"PeriodicalIF":0.2000,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Orthopaedic Practice","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1097/bco.0000000000001272","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ORTHOPEDICS","Score":null,"Total":0}
引用次数: 0

Abstract

Analyses of gender in academic authorship are key to characterizing representation in surgical fields, but current methods of manual data collection are time-consuming and error prone. The purpose of this study was to design a program to automatically extract publication data and verify the accuracy of this program in comparison to manually-collected data in a pilot study of three orthopaedic surgery journals. Publications from three orthopaedic subspecialty journals between January 2019 and June 2021 were identified via PubMed search. For each publication, online publication date, journal issue month, first author name, and senior author name were collected from PubMed listings by hand and programmatically in a Python script (JournalADE). Gender was determined using Gender API. The percent of publications for which manually- and program-collected online publication dates were within 14 days of each other was above 95% for all journals. There was 98.3% (95% CI=97.84-98.76%) agreement for online publication date, with a mean difference of 6.43 (SD 0.87) days. Journal issue month agreement was 99.6% (95% CI=99.37-99.83%). Agreement for first author gender was 97.33% (95% CI=96.75-97.91%) and for senior author gender was 96.77% (95% CI=96.14-97.4%). Estimated labor time for manual collection was 100 hr, compared to 15 min for JournalADE. When comparing the JournalADE- and manually-collected data, rates of agreement were high at a fraction of the time. This supports the efficacy of JournalADE and sets the stage for its use in future studies of gender in authorship.
JournalADE:创建并验证用于自动数据提取(ADE)的新型程序,以评估作者的性别代表性
对学术作者的性别进行分析是描述外科领域代表性的关键,但目前的人工数据收集方法既费时又容易出错。本研究的目的是设计一个自动提取论文数据的程序,并在三本骨科外科期刊的试点研究中验证该程序与人工收集数据的准确性。 通过 PubMed 搜索确定了 2019 年 1 月至 2021 年 6 月期间三本骨科亚专科期刊上的论文。每篇论文的在线发表日期、期刊发行月份、第一作者姓名和资深作者姓名都是通过手工和 Python 脚本 (JournalADE) 程序从 PubMed 列表中收集的。性别通过性别 API 确定。 在所有期刊中,人工和程序收集的在线发表日期相差不超过 14 天的论文比例均在 95% 以上。在线出版日期的一致性为 98.3% (95% CI=97.84-98.76%) ,平均相差 6.43 天 (SD 0.87)。期刊出版月份的一致性为 99.6% (95% CI=99.37-99.83%)。第一作者性别的一致性为 97.33% (95% CI=96.75-97.91%),资深作者性别的一致性为 96.77% (95% CI=96.14-97.4%)。人工收集估计需要 100 小时,而 JournalADE 只需 15 分钟。 比较JournalADE和人工收集的数据,两者的一致率很高,只用了一小部分时间。这证明了JournalADE的有效性,并为其在未来作者性别研究中的应用奠定了基础。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
0.60
自引率
0.00%
发文量
107
期刊介绍: Lippincott Williams & Wilkins is a leading international publisher of professional health information for physicians, nurses, specialized clinicians and students. For a complete listing of titles currently published by Lippincott Williams & Wilkins and detailed information about print, online, and other offerings, please visit the LWW Online Store. Current Orthopaedic Practice is a peer-reviewed, general orthopaedic journal that translates clinical research into best practices for diagnosing, treating, and managing musculoskeletal disorders. The journal publishes original articles in the form of clinical research, invited special focus reviews and general reviews, as well as original articles on innovations in practice, case reports, point/counterpoint, and diagnostic imaging.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信