Elizabeth Pleasants, Ndola Prata, Ushma D Upadhyay, Cassondra Marshall, Coye Cheshire
{"title":"使用自然语言处理描述2022年期间堕胎在线社区的使用:Reddit帖子的动态主题建模分析。","authors":"Elizabeth Pleasants, Ndola Prata, Ushma D Upadhyay, Cassondra Marshall, Coye Cheshire","doi":"10.2196/72771","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Abortion access in the United States has been in a state of rapid change and increasing restriction since the Dobbs v Jackson Women's Health Organization decision from the US Supreme Court in June 2022. With further constraints on access to abortion since Dobbs, the internet and online communities are playing an increasingly important role in people's abortion trajectories. There is a need for a broader understanding of how online resources are used for abortion and how they may reflect changes in the sociopolitical and legal context of abortion access. Research using online information and leveraging methods to work efficiently with large textual datasets has the potential to accelerate knowledge generation and provide novel insights into changing abortion-related experiences following Dobbs, helping address these knowledge gaps.</p><p><strong>Objective: </strong>This project sought to use natural language processing techniques, specifically topic modeling, to explore the content of posts to 1 online community for abortion (r/abortion) in 2022 and assess how community use changed during that time.</p><p><strong>Methods: </strong>This analysis described and explored posts shared throughout 2022 and for 3 subperiods of interest: before the Dobbs leak (December 24, 2021-May 1, 2022), Dobbs leak to decision (May 2, 2022-June 23, 2022), and after the Dobbs decision (June 24, 2022-December 23, 2022). We used topic modeling to obtain descriptive topics for the year and each subperiod and then classified posts. Topics were then aggregated into conceptual groups based on a combination of quantitative and qualitative assessments. The proportion of posts classified in each conceptual group was used to assess change in community interests across the 3 study subperiods.</p><p><strong>Results: </strong>The 7273 posts shared in r/abortion in 2022 included in our analyses were categorized into 8 conceptual groups: abortion decision-making, navigating abortion access barriers, clinical abortion care, medication abortion processes, postabortion physical experiences, potential pregnancy, and self-managed abortion processes. Posts related to navigating access barriers were most common. The proportion of posts about abortion decision-making and self-management changed significantly across study periods (P=.006 and P<.001, respectively); abortion decision-making posts were more common before the Dobbs leak, whereas those related to self-management increased following the leak and decision.</p><p><strong>Conclusions: </strong>This analysis provides a holistic view of r/abortion posts in 2022, highlighting the important role of online communities as abortion-supportive online resources and changing interests among posters with abortion policy changes. As policies and pathways to abortion access continue to change across the United States, approaches leveraging natural language processing with sufficiently large samples of textual data present opportunities for timely monitoring, with the potential to reflect a broad range of abortion experiences, including those of people who have limited or no interaction with clinical abortion care.</p>","PeriodicalId":73554,"journal":{"name":"JMIR infodemiology","volume":"5 ","pages":"e72771"},"PeriodicalIF":3.5000,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Using Natural Language Processing to Describe the Use of an Online Community for Abortion During 2022: Dynamic Topic Modeling Analysis of Reddit Posts.\",\"authors\":\"Elizabeth Pleasants, Ndola Prata, Ushma D Upadhyay, Cassondra Marshall, Coye Cheshire\",\"doi\":\"10.2196/72771\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Abortion access in the United States has been in a state of rapid change and increasing restriction since the Dobbs v Jackson Women's Health Organization decision from the US Supreme Court in June 2022. With further constraints on access to abortion since Dobbs, the internet and online communities are playing an increasingly important role in people's abortion trajectories. There is a need for a broader understanding of how online resources are used for abortion and how they may reflect changes in the sociopolitical and legal context of abortion access. Research using online information and leveraging methods to work efficiently with large textual datasets has the potential to accelerate knowledge generation and provide novel insights into changing abortion-related experiences following Dobbs, helping address these knowledge gaps.</p><p><strong>Objective: </strong>This project sought to use natural language processing techniques, specifically topic modeling, to explore the content of posts to 1 online community for abortion (r/abortion) in 2022 and assess how community use changed during that time.</p><p><strong>Methods: </strong>This analysis described and explored posts shared throughout 2022 and for 3 subperiods of interest: before the Dobbs leak (December 24, 2021-May 1, 2022), Dobbs leak to decision (May 2, 2022-June 23, 2022), and after the Dobbs decision (June 24, 2022-December 23, 2022). We used topic modeling to obtain descriptive topics for the year and each subperiod and then classified posts. Topics were then aggregated into conceptual groups based on a combination of quantitative and qualitative assessments. The proportion of posts classified in each conceptual group was used to assess change in community interests across the 3 study subperiods.</p><p><strong>Results: </strong>The 7273 posts shared in r/abortion in 2022 included in our analyses were categorized into 8 conceptual groups: abortion decision-making, navigating abortion access barriers, clinical abortion care, medication abortion processes, postabortion physical experiences, potential pregnancy, and self-managed abortion processes. Posts related to navigating access barriers were most common. The proportion of posts about abortion decision-making and self-management changed significantly across study periods (P=.006 and P<.001, respectively); abortion decision-making posts were more common before the Dobbs leak, whereas those related to self-management increased following the leak and decision.</p><p><strong>Conclusions: </strong>This analysis provides a holistic view of r/abortion posts in 2022, highlighting the important role of online communities as abortion-supportive online resources and changing interests among posters with abortion policy changes. As policies and pathways to abortion access continue to change across the United States, approaches leveraging natural language processing with sufficiently large samples of textual data present opportunities for timely monitoring, with the potential to reflect a broad range of abortion experiences, including those of people who have limited or no interaction with clinical abortion care.</p>\",\"PeriodicalId\":73554,\"journal\":{\"name\":\"JMIR infodemiology\",\"volume\":\"5 \",\"pages\":\"e72771\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2025-07-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JMIR infodemiology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2196/72771\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR infodemiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/72771","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
Using Natural Language Processing to Describe the Use of an Online Community for Abortion During 2022: Dynamic Topic Modeling Analysis of Reddit Posts.
Background: Abortion access in the United States has been in a state of rapid change and increasing restriction since the Dobbs v Jackson Women's Health Organization decision from the US Supreme Court in June 2022. With further constraints on access to abortion since Dobbs, the internet and online communities are playing an increasingly important role in people's abortion trajectories. There is a need for a broader understanding of how online resources are used for abortion and how they may reflect changes in the sociopolitical and legal context of abortion access. Research using online information and leveraging methods to work efficiently with large textual datasets has the potential to accelerate knowledge generation and provide novel insights into changing abortion-related experiences following Dobbs, helping address these knowledge gaps.
Objective: This project sought to use natural language processing techniques, specifically topic modeling, to explore the content of posts to 1 online community for abortion (r/abortion) in 2022 and assess how community use changed during that time.
Methods: This analysis described and explored posts shared throughout 2022 and for 3 subperiods of interest: before the Dobbs leak (December 24, 2021-May 1, 2022), Dobbs leak to decision (May 2, 2022-June 23, 2022), and after the Dobbs decision (June 24, 2022-December 23, 2022). We used topic modeling to obtain descriptive topics for the year and each subperiod and then classified posts. Topics were then aggregated into conceptual groups based on a combination of quantitative and qualitative assessments. The proportion of posts classified in each conceptual group was used to assess change in community interests across the 3 study subperiods.
Results: The 7273 posts shared in r/abortion in 2022 included in our analyses were categorized into 8 conceptual groups: abortion decision-making, navigating abortion access barriers, clinical abortion care, medication abortion processes, postabortion physical experiences, potential pregnancy, and self-managed abortion processes. Posts related to navigating access barriers were most common. The proportion of posts about abortion decision-making and self-management changed significantly across study periods (P=.006 and P<.001, respectively); abortion decision-making posts were more common before the Dobbs leak, whereas those related to self-management increased following the leak and decision.
Conclusions: This analysis provides a holistic view of r/abortion posts in 2022, highlighting the important role of online communities as abortion-supportive online resources and changing interests among posters with abortion policy changes. As policies and pathways to abortion access continue to change across the United States, approaches leveraging natural language processing with sufficiently large samples of textual data present opportunities for timely monitoring, with the potential to reflect a broad range of abortion experiences, including those of people who have limited or no interaction with clinical abortion care.