Emily L. Larson, Shivani Pandya, S. Stewart, Jessica Schmerler, S. Jabori, Helen Xun, Kriti Jain, Dawn LaPorte, A. Aiyer
{"title":"JournalADE: Creation and validation of a novel program for automated data extraction (ADE) to assess authorship gender representation","authors":"Emily L. Larson, Shivani Pandya, S. Stewart, Jessica Schmerler, S. Jabori, Helen Xun, Kriti Jain, Dawn LaPorte, A. Aiyer","doi":"10.1097/bco.0000000000001272","DOIUrl":null,"url":null,"abstract":"\n \n Analyses of gender in academic authorship are key to characterizing representation in surgical fields, but current methods of manual data collection are time-consuming and error prone. The purpose of this study was to design a program to automatically extract publication data and verify the accuracy of this program in comparison to manually-collected data in a pilot study of three orthopaedic surgery journals.\n \n \n \n Publications from three orthopaedic subspecialty journals between January 2019 and June 2021 were identified via PubMed search. For each publication, online publication date, journal issue month, first author name, and senior author name were collected from PubMed listings by hand and programmatically in a Python script (JournalADE). Gender was determined using Gender API.\n \n \n \n The percent of publications for which manually- and program-collected online publication dates were within 14 days of each other was above 95% for all journals. There was 98.3% (95% CI=97.84-98.76%) agreement for online publication date, with a mean difference of 6.43 (SD 0.87) days. Journal issue month agreement was 99.6% (95% CI=99.37-99.83%). Agreement for first author gender was 97.33% (95% CI=96.75-97.91%) and for senior author gender was 96.77% (95% CI=96.14-97.4%). Estimated labor time for manual collection was 100 hr, compared to 15 min for JournalADE.\n \n \n \n When comparing the JournalADE- and manually-collected data, rates of agreement were high at a fraction of the time. This supports the efficacy of JournalADE and sets the stage for its use in future studies of gender in authorship.\n","PeriodicalId":10732,"journal":{"name":"Current Orthopaedic Practice","volume":null,"pages":null},"PeriodicalIF":0.2000,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Orthopaedic Practice","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1097/bco.0000000000001272","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ORTHOPEDICS","Score":null,"Total":0}
引用次数: 0
Abstract
Analyses of gender in academic authorship are key to characterizing representation in surgical fields, but current methods of manual data collection are time-consuming and error prone. The purpose of this study was to design a program to automatically extract publication data and verify the accuracy of this program in comparison to manually-collected data in a pilot study of three orthopaedic surgery journals.
Publications from three orthopaedic subspecialty journals between January 2019 and June 2021 were identified via PubMed search. For each publication, online publication date, journal issue month, first author name, and senior author name were collected from PubMed listings by hand and programmatically in a Python script (JournalADE). Gender was determined using Gender API.
The percent of publications for which manually- and program-collected online publication dates were within 14 days of each other was above 95% for all journals. There was 98.3% (95% CI=97.84-98.76%) agreement for online publication date, with a mean difference of 6.43 (SD 0.87) days. Journal issue month agreement was 99.6% (95% CI=99.37-99.83%). Agreement for first author gender was 97.33% (95% CI=96.75-97.91%) and for senior author gender was 96.77% (95% CI=96.14-97.4%). Estimated labor time for manual collection was 100 hr, compared to 15 min for JournalADE.
When comparing the JournalADE- and manually-collected data, rates of agreement were high at a fraction of the time. This supports the efficacy of JournalADE and sets the stage for its use in future studies of gender in authorship.
期刊介绍:
Lippincott Williams & Wilkins is a leading international publisher of professional health information for physicians, nurses, specialized clinicians and students. For a complete listing of titles currently published by Lippincott Williams & Wilkins and detailed information about print, online, and other offerings, please visit the LWW Online Store. Current Orthopaedic Practice is a peer-reviewed, general orthopaedic journal that translates clinical research into best practices for diagnosing, treating, and managing musculoskeletal disorders. The journal publishes original articles in the form of clinical research, invited special focus reviews and general reviews, as well as original articles on innovations in practice, case reports, point/counterpoint, and diagnostic imaging.