Detecting aggression in clinical treatment videos

Walker S. Arce , Seth G. Walker , Jordan DeBrine , Benjamin S. Riggan , James E. Gehringer
{"title":"Detecting aggression in clinical treatment videos","authors":"Walker S. Arce ,&nbsp;Seth G. Walker ,&nbsp;Jordan DeBrine ,&nbsp;Benjamin S. Riggan ,&nbsp;James E. Gehringer","doi":"10.1016/j.mlwa.2023.100515","DOIUrl":null,"url":null,"abstract":"<div><p>Many clinical spaces are outfitted with centralized video recording systems to monitor patient–client interactions. Considering the increasing interest in video-based machine learning methods, the potential of using these clinical recordings to automate observational data collection is apparent. To explore this, seven patients had videos of their functional assessment and treatment sessions annotated by coders trained by our clinical team. Commonly used clinical software has inherent limitations aligning behavioral and video data, so a custom software tool was employed to address this functionality gap. After developing a Canvas-based coder training course for this tool, a team of six trained coders annotated 82.33 h of data. Two machine learning approaches were considered, where both used a convolutional neural network as a video feature extractor. The first approach used a recurrent network as the classifier on the extracted features and the second used a Transformer architecture. Both models produced promising metrics indicating that the capability of detecting aggression from clinical videos is possible and generalizable. Model performance is directly tied to the feature extractor’s performance on ImageNet, where ConvNeXtXL produced the best performing models. This has applications in automating patient incident response to improve patient and clinician safety and could be directly integrated into existing video management systems for real-time analysis.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"14 ","pages":"Article 100515"},"PeriodicalIF":0.0000,"publicationDate":"2023-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000683/pdfft?md5=7be193e80aa9244b29f8609ccc55e9e6&pid=1-s2.0-S2666827023000683-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning with applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666827023000683","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Many clinical spaces are outfitted with centralized video recording systems to monitor patient–client interactions. Considering the increasing interest in video-based machine learning methods, the potential of using these clinical recordings to automate observational data collection is apparent. To explore this, seven patients had videos of their functional assessment and treatment sessions annotated by coders trained by our clinical team. Commonly used clinical software has inherent limitations aligning behavioral and video data, so a custom software tool was employed to address this functionality gap. After developing a Canvas-based coder training course for this tool, a team of six trained coders annotated 82.33 h of data. Two machine learning approaches were considered, where both used a convolutional neural network as a video feature extractor. The first approach used a recurrent network as the classifier on the extracted features and the second used a Transformer architecture. Both models produced promising metrics indicating that the capability of detecting aggression from clinical videos is possible and generalizable. Model performance is directly tied to the feature extractor’s performance on ImageNet, where ConvNeXtXL produced the best performing models. This has applications in automating patient incident response to improve patient and clinician safety and could be directly integrated into existing video management systems for real-time analysis.

在临床治疗视频中检测攻击性
许多临床空间配备了集中的视频记录系统来监控病人与病人之间的互动。考虑到人们对基于视频的机器学习方法的兴趣日益增加,使用这些临床记录来自动收集观察数据的潜力是显而易见的。为了探索这一点,我们的临床团队训练了编码员,并对7名患者的功能评估和治疗过程进行了视频注释。常用的临床软件在调整行为和视频数据方面存在固有的局限性,因此采用自定义软件工具来解决这一功能差距。在为这个工具开发了一个基于canvas的编码员培训课程后,一个由六名训练有素的编码员组成的团队注释了82.33小时的数据。考虑了两种机器学习方法,其中都使用卷积神经网络作为视频特征提取器。第一种方法使用循环网络作为提取特征的分类器,第二种方法使用Transformer架构。这两个模型都产生了有希望的指标,表明从临床视频中检测攻击的能力是可能的和可推广的。模型性能与特征提取器在ImageNet上的性能直接相关,在ImageNet上,ConvNeXtXL产生了性能最好的模型。这可以应用于自动化患者事件响应,以提高患者和临床医生的安全性,并可以直接集成到现有的视频管理系统中进行实时分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Machine learning with applications
Machine learning with applications Management Science and Operations Research, Artificial Intelligence, Computer Science Applications
自引率
0.00%
发文量
0
审稿时长
98 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信