Using generative AI for large-scale qualitative analysis of social media posts to understand why people leave computer science

IF 3.4 2区 工程技术 Q1 EDUCATION & EDUCATIONAL RESEARCH
Amanda Ross, Andrew Katz
{"title":"Using generative AI for large-scale qualitative analysis of social media posts to understand why people leave computer science","authors":"Amanda Ross,&nbsp;Andrew Katz","doi":"10.1002/jee.70036","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>Computer science faces a persistent attrition problem, with people leaving the field at a rate that exceeds new entrants. Given the increasing demand for computing jobs, it is essential to focus on reducing the number of individuals exiting the field.</p>\n </section>\n \n <section>\n \n <h3> Purpose</h3>\n \n <p>This study investigates why individuals leave the computer science field across various stages and contexts, addressing two questions: (1) What are the reasons for leaving? (2) What external factors influence these decisions?</p>\n </section>\n \n <section>\n \n <h3> Method</h3>\n \n <p>This large-scale qualitative study collected over 10,000 Reddit posts using keyword-based scraping. Using generative AI, we refined the dataset, filtering it down to 263 relevant posts. Generative AI was then used for thematic analysis on this subset of posts, utilizing the established GATOS method. We extend this approach by integrating a human-in-the-loop process to contextualize the identified themes within social cognitive career theory.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>Findings reveal diverse reasons for leaving, including job dissatisfaction, interests in other fields, psychological factors, academic challenges, health concerns, and industry issues. Influential factors include background, transition requirements, alternative field characteristics, and personal circumstances. Although the extent varied, all of these reasons and factors were observed at every departure stage.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>These findings provide important insights that can help inform industry and academic policies and practices. Additionally, we contribute to the development of more efficient, scalable workflows for future qualitative research using generative AI.</p>\n </section>\n </div>","PeriodicalId":50206,"journal":{"name":"Journal of Engineering Education","volume":"114 4","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/jee.70036","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Engineering Education","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/jee.70036","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}
引用次数: 0

Abstract

Background

Computer science faces a persistent attrition problem, with people leaving the field at a rate that exceeds new entrants. Given the increasing demand for computing jobs, it is essential to focus on reducing the number of individuals exiting the field.

Purpose

This study investigates why individuals leave the computer science field across various stages and contexts, addressing two questions: (1) What are the reasons for leaving? (2) What external factors influence these decisions?

Method

This large-scale qualitative study collected over 10,000 Reddit posts using keyword-based scraping. Using generative AI, we refined the dataset, filtering it down to 263 relevant posts. Generative AI was then used for thematic analysis on this subset of posts, utilizing the established GATOS method. We extend this approach by integrating a human-in-the-loop process to contextualize the identified themes within social cognitive career theory.

Results

Findings reveal diverse reasons for leaving, including job dissatisfaction, interests in other fields, psychological factors, academic challenges, health concerns, and industry issues. Influential factors include background, transition requirements, alternative field characteristics, and personal circumstances. Although the extent varied, all of these reasons and factors were observed at every departure stage.

Conclusions

These findings provide important insights that can help inform industry and academic policies and practices. Additionally, we contribute to the development of more efficient, scalable workflows for future qualitative research using generative AI.

Abstract Image

使用生成式人工智能对社交媒体帖子进行大规模定性分析,以了解人们离开计算机科学的原因
计算机科学面临着一个持续的人员流失问题,人们离开这个领域的速度超过了新进入者。鉴于对计算机工作的需求不断增加,有必要将重点放在减少退出该领域的个人数量上。本研究调查了个人在不同阶段和背景下离开计算机科学领域的原因,解决了两个问题:(1)离开的原因是什么?(2)哪些外部因素影响这些决策?方法本研究采用基于关键词的抓取技术,收集了1万多条Reddit帖子。使用生成式人工智能,我们改进了数据集,将其过滤到263个相关帖子。然后使用生成式人工智能对该帖子子集进行主题分析,利用已建立的GATOS方法。我们通过整合人在循环过程来扩展这种方法,将社会认知职业理论中确定的主题置于背景中。研究结果显示,学生离职的原因多种多样,包括对工作不满意、对其他领域的兴趣、心理因素、学业挑战、健康问题和行业问题。影响因素包括背景、过渡要求、替代领域特征和个人情况。虽然程度不同,但在每个出发阶段都观察到所有这些原因和因素。这些发现提供了重要的见解,可以帮助告知行业和学术政策和实践。此外,我们还利用生成式人工智能为未来的定性研究开发更高效、可扩展的工作流程。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Engineering Education
Journal of Engineering Education 工程技术-工程:综合
CiteScore
12.20
自引率
11.80%
发文量
47
审稿时长
>12 weeks
期刊介绍: The Journal of Engineering Education (JEE) serves to cultivate, disseminate, and archive scholarly research in engineering education.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信