Case reports unlocked: Harnessing large language models to advance research on child maltreatment

IF 3.4 2区 心理学 Q1 FAMILY STUDIES
Dragan Stoll , Samuel Wehrli , David Lätsch
{"title":"Case reports unlocked: Harnessing large language models to advance research on child maltreatment","authors":"Dragan Stoll ,&nbsp;Samuel Wehrli ,&nbsp;David Lätsch","doi":"10.1016/j.chiabu.2024.107202","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Research on child protective services (CPS) is impeded by a lack of high-quality structured data. Crucial information on cases is often documented in case files, but only in narrative form. Researchers have applied automated language processing to extract structured data from these narratives, but this has been limited to classification tasks of fairly low complexity. Large language models (LLMs) may work for more challenging tasks.</div></div><div><h3>Objective</h3><div>We aimed to extract structured data from narrative casework reports by applying LLMs to distinguish between different subtypes of violence: child sexual abuse, child physical abuse, a child witnessing domestic violence, and a child being physically aggressive.</div></div><div><h3>Methods</h3><div>We developed a four-stage pipeline comprising of (1) text segmentation, (2) text segment classification, and subsequent labeling of (3) casework reports, and (4) cases. All CPS reports (<em>N</em> = 29,770) between 2008 and 2022 from Switzerland's largest CPS provider were collected. 28,223 text segments were extracted based on pre-defined keywords. Two human reviewers annotated random samples of text segments and reports for training and validation. Model performance was compared against human-coded test data.</div></div><div><h3>Results</h3><div>The best-performing LLM (Mixtral-8x7B) classified text segments with an accuracy of 87 %, outperforming agreement between the two human reviewers (77 %). The model also correctly labelled casework reports with an accuracy of 87 %, but only when disregarding non-extracted text segments in stage (1).</div></div><div><h3>Conclusions</h3><div>LLMs can replicate human coding of text documents even for highly complex tasks that require contextual information. This may considerably advance research on CPS. Transparency can be achieved by backtracking labeling decisions to individual text segments. Keyword-based text segmentation was identified as a weak point, and the potential for bias that may occur at several stages of the process requires attention.</div></div>","PeriodicalId":51343,"journal":{"name":"Child Abuse & Neglect","volume":"160 ","pages":"Article 107202"},"PeriodicalIF":3.4000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Child Abuse & Neglect","FirstCategoryId":"102","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0145213424005957","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"FAMILY STUDIES","Score":null,"Total":0}
引用次数: 0

Abstract

Background

Research on child protective services (CPS) is impeded by a lack of high-quality structured data. Crucial information on cases is often documented in case files, but only in narrative form. Researchers have applied automated language processing to extract structured data from these narratives, but this has been limited to classification tasks of fairly low complexity. Large language models (LLMs) may work for more challenging tasks.

Objective

We aimed to extract structured data from narrative casework reports by applying LLMs to distinguish between different subtypes of violence: child sexual abuse, child physical abuse, a child witnessing domestic violence, and a child being physically aggressive.

Methods

We developed a four-stage pipeline comprising of (1) text segmentation, (2) text segment classification, and subsequent labeling of (3) casework reports, and (4) cases. All CPS reports (N = 29,770) between 2008 and 2022 from Switzerland's largest CPS provider were collected. 28,223 text segments were extracted based on pre-defined keywords. Two human reviewers annotated random samples of text segments and reports for training and validation. Model performance was compared against human-coded test data.

Results

The best-performing LLM (Mixtral-8x7B) classified text segments with an accuracy of 87 %, outperforming agreement between the two human reviewers (77 %). The model also correctly labelled casework reports with an accuracy of 87 %, but only when disregarding non-extracted text segments in stage (1).

Conclusions

LLMs can replicate human coding of text documents even for highly complex tasks that require contextual information. This may considerably advance research on CPS. Transparency can be achieved by backtracking labeling decisions to individual text segments. Keyword-based text segmentation was identified as a weak point, and the potential for bias that may occur at several stages of the process requires attention.
解锁案例报告:利用大型语言模型推进儿童虐待研究。
背景:儿童保护服务(CPS)的研究受到缺乏高质量结构化数据的阻碍。案件的关键信息通常记录在案件档案中,但仅以叙述形式记录。研究人员已经应用自动语言处理从这些叙述中提取结构化数据,但这仅限于相当低复杂性的分类任务。大型语言模型(llm)可能适用于更具挑战性的任务。目的:我们旨在通过应用法学硕士来区分不同亚型的暴力:儿童性虐待、儿童身体虐待、儿童目睹家庭暴力和儿童身体攻击,从叙述性案例报告中提取结构化数据。方法:我们开发了一个四阶段的管道,包括(1)文本分割,(2)文本片段分类,以及随后的标记(3)案例报告,(4)案例。收集了2008年至2022年间瑞士最大的CPS提供商的所有CPS报告(N = 29,770)。基于预定义关键字提取28223个文本片段。两名人工审稿人对文本片段和报告的随机样本进行注释,以进行训练和验证。将模型性能与人工编码的测试数据进行比较。结果:表现最好的LLM (Mixtral-8x7B)分类文本片段的准确率为87%,优于两位人类审稿人之间的协议(77%)。该模型还正确地标记了案例报告,准确率为87%,但仅当忽略阶段(1)中未提取的文本片段时。结论:法学硕士可以复制文本文档的人类编码,即使是需要上下文信息的高度复杂的任务。这可能会大大推进CPS的研究。透明度可以通过回溯标签决策到单个文本段来实现。基于关键字的文本分割被确定为一个弱点,并且可能在过程的几个阶段发生的潜在偏差需要注意。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.40
自引率
10.40%
发文量
397
期刊介绍: Official Publication of the International Society for Prevention of Child Abuse and Neglect. Child Abuse & Neglect The International Journal, provides an international, multidisciplinary forum on all aspects of child abuse and neglect, with special emphasis on prevention and treatment; the scope extends further to all those aspects of life which either favor or hinder child development. While contributions will primarily be from the fields of psychology, psychiatry, social work, medicine, nursing, law enforcement, legislature, education, and anthropology, the Journal encourages the concerned lay individual and child-oriented advocate organizations to contribute.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信