Journal of methods and measurement in the social sciences最新文献

Machine Learning Method for High-Dimensional Education Data 高维教育数据的机器学习方法

Journal of methods and measurement in the social sciences Pub Date : 2022-10-01 DOI: 10.2458/jmmss.5396

Haiyan Bai, Xing Liu, F. Bai, Yuting Chen, Randyll Pandohie

引用次数: 0

Comparing human coding to two natural language processing algorithms in aspirations of people affected by Duchenne Muscular Dystrophy 比较人类编码与两种自然语言处理算法对Duchenne肌肉营养不良患者愿望的影响

Journal of methods and measurement in the social sciences Pub Date : 2022-10-01 DOI: 10.2458/jmmss.5397

C. Schwartz, Roland B. Stark, Elijah Biletch, Richard B. B. Stuart

{"title":"Comparing human coding to two natural language processing algorithms in aspirations of people affected by Duchenne Muscular Dystrophy","authors":"C. Schwartz, Roland B. Stark, Elijah Biletch, Richard B. B. Stuart","doi":"10.2458/jmmss.5397","DOIUrl":"https://doi.org/10.2458/jmmss.5397","url":null,"abstract":"Qualitative methods can enhance our understanding of constructs that have not been well portrayed and enable nuanced depiction of experience from study participants who have not been broadly studied. However, qualitative data require time and effort to train raters to achieve validity and reliability. This study compares recent advances in Natural Language Processing (NLP) models with human coding. This web-based study (N=1,253; 3,046 free-text entries, averaging 64 characters per entry) included people with Duchenne Muscular Dystrophy (DMD), their siblings, and a representative comparison group. Human raters (n=6) were trained over multiple sessions in content analysis as per a comprehensive codebook. Three prompts addressed distinct aspects of participants’ aspirations. Unsupervised NLP was implemented using Latent Dirichlet Allocation (LDA), which extracts latent topics across all the free-text entries. Supervised NLP was done using a Bidirectional Encoder Representations from Transformers (BERT) model, which requires training the algorithm to recognize relevant human-coded themes across free-text entries. We compared the human-, LDA-, and BERT-coded themes. Study sample contained 286 people with DMD, 355 DMD siblings, and 997 comparison participants, age 8-69. Human coders generated 95 codes across the three prompts and had an average inter-rater reliability (Fleiss’s kappa) of 0.77, with minimal rater-effect (pseudo R2=4%). Compared to human coders, LDA does not yield easily interpretable themes. BERT correctly classified only 61-70% of the validation set. LDA and BERT required technical expertise to program and took approximately 1.15 minutes per open-text entry, compared to 1.18 minutes for human raters including training time. LDA and BERT provide potentially viable approaches to analyzing large-scale qualitative data, but both have limitations. When text entries are short, LDA yields latent topics that are hard to interpret. BERT accurately identified only about two thirds of new statements. Humans provided reliable and cost-effective coding in the web-based context. The upfront training enables BERT to process enormous quantities of text data in future work, which should examine NLP’s predictive accuracy given different quantities of training data.","PeriodicalId":90602,"journal":{"name":"Journal of methods and measurement in the social sciences","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48496402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Invitation for COVID-19 Submissions 新冠肺炎提交邀请函

Journal of methods and measurement in the social sciences Pub Date : 2022-10-01 DOI: 10.2458/jmmss.5395

E. Board

引用次数: 0

Binary Classification: An Introductory Machine Learning Tutorial for Social Scientists 二元分类:面向社会科学家的机器学习入门教程

Journal of methods and measurement in the social sciences Pub Date : 2021-12-12 DOI: 10.2458/jmmss.5186

Vivian P. Ta, Leonardo Carrico, Arthur Bousquet

引用次数: 0

Journal of Methods and Measurement in the Social Sciences 社会科学方法与测量杂志

Journal of methods and measurement in the social sciences Pub Date : 2021-12-12 DOI: 10.2458/jmmss.5185

Editorial Board

引用次数: 1

The Modern Biased Information Test: Proposing alternatives for implicit measures 现代偏倚信息检验:为隐性措施提出替代方案

Journal of methods and measurement in the social sciences Pub Date : 2021-12-12 DOI: 10.2458/jmmss.2966

A. Figueredo, V. Smith-Castro, Mateo Peñaherrera-Aguirre

引用次数: 0

From the Editors 来自编辑

Journal of methods and measurement in the social sciences Pub Date : 2021-11-01 DOI: 10.2458/jmmss.3058

E. Board

引用次数: 0

In Defense of Fishing 为渔业辩护

Journal of methods and measurement in the social sciences Pub Date : 2021-11-01 DOI: 10.2458/jmmss.3063

R. Byrne

引用次数: 1

Echoes from the Past: Meaning in Measures, Environments, and Predictions 过去的回声：测量、环境和预测中的意义

Journal of methods and measurement in the social sciences Pub Date : 2021-11-01 DOI: 10.2458/jmmss.3064

B. Krauss

引用次数: 0

Marvel Cinematic Universe Introductions 漫威电影宇宙简介

Journal of methods and measurement in the social sciences Pub Date : 2021-11-01 DOI: 10.2458/jmmss.3066

A. Weiss

引用次数: 1