Lara J Kanbar, Alexander Osborn, Andrew Cifuentes, Jennifer Combs, Michael Sorter, Drew Barzman, Judith W Dexheimer
{"title":"用不完全访谈数据预测校园暴力风险:一种自动评估方法。","authors":"Lara J Kanbar, Alexander Osborn, Andrew Cifuentes, Jennifer Combs, Michael Sorter, Drew Barzman, Judith W Dexheimer","doi":"10.1093/jamiaopen/ooaf084","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>School violence risk prevention in the United States relies on manual assessments that are time-consuming and subjective. We developed a machine learning algorithm named Automated RIsk Assessment (ARIA), using natural language processing (NLP) to find linguistic patterns in standardized interview questions that can predict risk of aggression. Our goal was to evaluate the incremental change in performance with the addition of each question to simulate situations where interviews cannot be completed.</p><p><strong>Materials and methods: </strong>Students were interviewed with 2 14-question risk assessments, the Brief Rating of Aggression by Children and Adolescents (BRACHA) and the School Safety Scale (SSS), that encouraged open-ended answers to the interview questions. The reference standard was defined as the subject's likeliness to display aggression in the future as determined by a forensic psychiatrist. Feature sets were extracted to represent the addition of 1 question at a time in a typical interview, up to and including the 28 total main questions along with other sub-questions that arose. The ARIA NLP pipeline tokenized each feature set, then extracted n-gram features (<i>n</i> <b>≤</b> 5) that captured contextual and semantic information. The features were evaluated using an L2-regularized logistic regression classifier and L2-regularized support vector machine (L2-SVM) classifier.</p><p><strong>Results: </strong>Between May 1, 2015 and February 6, 2021, 412 assessment interviews were conducted. When compared to clinical judgement, ARIA performed with an area under the Receiver Operating Characteristic curve of 0.9 after 10 BRACHA questions, suggesting that it remains powerful even with truncated interviews. The full BRACHA had similar performance to the BRACHA + SSS assessment.</p><p><strong>Discussion and conclusion: </strong>ARIA could use incomplete risk assessment interviews to provide modest recommendations even if interview completion is not possible. This could help to reduce the burden for the social worker or school counselor who may be using ARIA in less-than-ideal conditions.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf084"},"PeriodicalIF":3.4000,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12313018/pdf/","citationCount":"0","resultStr":"{\"title\":\"Forecasting school violence risk with incomplete interview data: an automated assessment approach.\",\"authors\":\"Lara J Kanbar, Alexander Osborn, Andrew Cifuentes, Jennifer Combs, Michael Sorter, Drew Barzman, Judith W Dexheimer\",\"doi\":\"10.1093/jamiaopen/ooaf084\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objectives: </strong>School violence risk prevention in the United States relies on manual assessments that are time-consuming and subjective. We developed a machine learning algorithm named Automated RIsk Assessment (ARIA), using natural language processing (NLP) to find linguistic patterns in standardized interview questions that can predict risk of aggression. Our goal was to evaluate the incremental change in performance with the addition of each question to simulate situations where interviews cannot be completed.</p><p><strong>Materials and methods: </strong>Students were interviewed with 2 14-question risk assessments, the Brief Rating of Aggression by Children and Adolescents (BRACHA) and the School Safety Scale (SSS), that encouraged open-ended answers to the interview questions. The reference standard was defined as the subject's likeliness to display aggression in the future as determined by a forensic psychiatrist. Feature sets were extracted to represent the addition of 1 question at a time in a typical interview, up to and including the 28 total main questions along with other sub-questions that arose. The ARIA NLP pipeline tokenized each feature set, then extracted n-gram features (<i>n</i> <b>≤</b> 5) that captured contextual and semantic information. The features were evaluated using an L2-regularized logistic regression classifier and L2-regularized support vector machine (L2-SVM) classifier.</p><p><strong>Results: </strong>Between May 1, 2015 and February 6, 2021, 412 assessment interviews were conducted. When compared to clinical judgement, ARIA performed with an area under the Receiver Operating Characteristic curve of 0.9 after 10 BRACHA questions, suggesting that it remains powerful even with truncated interviews. The full BRACHA had similar performance to the BRACHA + SSS assessment.</p><p><strong>Discussion and conclusion: </strong>ARIA could use incomplete risk assessment interviews to provide modest recommendations even if interview completion is not possible. This could help to reduce the burden for the social worker or school counselor who may be using ARIA in less-than-ideal conditions.</p>\",\"PeriodicalId\":36278,\"journal\":{\"name\":\"JAMIA Open\",\"volume\":\"8 4\",\"pages\":\"ooaf084\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12313018/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JAMIA Open\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/jamiaopen/ooaf084\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/8/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JAMIA Open","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/jamiaopen/ooaf084","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/8/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
Forecasting school violence risk with incomplete interview data: an automated assessment approach.
Objectives: School violence risk prevention in the United States relies on manual assessments that are time-consuming and subjective. We developed a machine learning algorithm named Automated RIsk Assessment (ARIA), using natural language processing (NLP) to find linguistic patterns in standardized interview questions that can predict risk of aggression. Our goal was to evaluate the incremental change in performance with the addition of each question to simulate situations where interviews cannot be completed.
Materials and methods: Students were interviewed with 2 14-question risk assessments, the Brief Rating of Aggression by Children and Adolescents (BRACHA) and the School Safety Scale (SSS), that encouraged open-ended answers to the interview questions. The reference standard was defined as the subject's likeliness to display aggression in the future as determined by a forensic psychiatrist. Feature sets were extracted to represent the addition of 1 question at a time in a typical interview, up to and including the 28 total main questions along with other sub-questions that arose. The ARIA NLP pipeline tokenized each feature set, then extracted n-gram features (n≤ 5) that captured contextual and semantic information. The features were evaluated using an L2-regularized logistic regression classifier and L2-regularized support vector machine (L2-SVM) classifier.
Results: Between May 1, 2015 and February 6, 2021, 412 assessment interviews were conducted. When compared to clinical judgement, ARIA performed with an area under the Receiver Operating Characteristic curve of 0.9 after 10 BRACHA questions, suggesting that it remains powerful even with truncated interviews. The full BRACHA had similar performance to the BRACHA + SSS assessment.
Discussion and conclusion: ARIA could use incomplete risk assessment interviews to provide modest recommendations even if interview completion is not possible. This could help to reduce the burden for the social worker or school counselor who may be using ARIA in less-than-ideal conditions.