Saandeep Aathreya , Tara Nourivandi , Alison Salloum , Leigh J. Ruth , Eric A. Storch , Shaun Canavan
{"title":"创伤后应激障碍儿童的多模式、基于情境的数据集","authors":"Saandeep Aathreya , Tara Nourivandi , Alison Salloum , Leigh J. Ruth , Eric A. Storch , Shaun Canavan","doi":"10.1016/j.patrec.2025.05.003","DOIUrl":null,"url":null,"abstract":"<div><div>The conventional method of diagnosing Post Traumatic Stress Disorder by a clinician has been subjective in nature by taking specific events/context in consideration. Developing AI-based solutions to these sensitive areas calls for adopting similar methodologies. Considering this, we propose a de-identified dataset of children subjects who are clinically diagnosed with/without PTSD in multiple contexts. This datset can help facilitate future research in this area. For each subject, in the dataset, the participant undergoes several sessions with clinicians and/or guardian that brings out various emotional response from the participant. We collect videos of these sessions and for each video, we extract several facial features that detach the identity information of the subjects. These include facial landmarks, head pose, action units (AU), and eye gaze. To evaluate this dataset, we propose a baseline approach to identifying PTSD using the encoded action unit (AU) intensities of the video frames as the features. We show that AU intensities intrinsically captures the expressiveness of the subject and can be leveraged in modeling PTSD solutions. The AU features are used to train a transformer for classification where we propose encoding the low-dimensional AU intensity vectors using a learnable Fourier representation. We show that this encoding, combined with a standard Multilayer Perceptron (MLP) mapping of AU intensities yields a superior result when compared to its individual counterparts. We apply the approach to various contexts of PTSD discussions (e.g., Clinician-child discussion) and our experiments show that using context is essential in classifying videos of children.</div></div>","PeriodicalId":54638,"journal":{"name":"Pattern Recognition Letters","volume":"196 ","pages":"Pages 228-235"},"PeriodicalIF":3.3000,"publicationDate":"2025-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multimodal, context-based dataset of children with Post Traumatic Stress Disorder\",\"authors\":\"Saandeep Aathreya , Tara Nourivandi , Alison Salloum , Leigh J. Ruth , Eric A. Storch , Shaun Canavan\",\"doi\":\"10.1016/j.patrec.2025.05.003\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The conventional method of diagnosing Post Traumatic Stress Disorder by a clinician has been subjective in nature by taking specific events/context in consideration. Developing AI-based solutions to these sensitive areas calls for adopting similar methodologies. Considering this, we propose a de-identified dataset of children subjects who are clinically diagnosed with/without PTSD in multiple contexts. This datset can help facilitate future research in this area. For each subject, in the dataset, the participant undergoes several sessions with clinicians and/or guardian that brings out various emotional response from the participant. We collect videos of these sessions and for each video, we extract several facial features that detach the identity information of the subjects. These include facial landmarks, head pose, action units (AU), and eye gaze. To evaluate this dataset, we propose a baseline approach to identifying PTSD using the encoded action unit (AU) intensities of the video frames as the features. We show that AU intensities intrinsically captures the expressiveness of the subject and can be leveraged in modeling PTSD solutions. The AU features are used to train a transformer for classification where we propose encoding the low-dimensional AU intensity vectors using a learnable Fourier representation. We show that this encoding, combined with a standard Multilayer Perceptron (MLP) mapping of AU intensities yields a superior result when compared to its individual counterparts. We apply the approach to various contexts of PTSD discussions (e.g., Clinician-child discussion) and our experiments show that using context is essential in classifying videos of children.</div></div>\",\"PeriodicalId\":54638,\"journal\":{\"name\":\"Pattern Recognition Letters\",\"volume\":\"196 \",\"pages\":\"Pages 228-235\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-06-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition Letters\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167865525001928\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition Letters","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167865525001928","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Multimodal, context-based dataset of children with Post Traumatic Stress Disorder
The conventional method of diagnosing Post Traumatic Stress Disorder by a clinician has been subjective in nature by taking specific events/context in consideration. Developing AI-based solutions to these sensitive areas calls for adopting similar methodologies. Considering this, we propose a de-identified dataset of children subjects who are clinically diagnosed with/without PTSD in multiple contexts. This datset can help facilitate future research in this area. For each subject, in the dataset, the participant undergoes several sessions with clinicians and/or guardian that brings out various emotional response from the participant. We collect videos of these sessions and for each video, we extract several facial features that detach the identity information of the subjects. These include facial landmarks, head pose, action units (AU), and eye gaze. To evaluate this dataset, we propose a baseline approach to identifying PTSD using the encoded action unit (AU) intensities of the video frames as the features. We show that AU intensities intrinsically captures the expressiveness of the subject and can be leveraged in modeling PTSD solutions. The AU features are used to train a transformer for classification where we propose encoding the low-dimensional AU intensity vectors using a learnable Fourier representation. We show that this encoding, combined with a standard Multilayer Perceptron (MLP) mapping of AU intensities yields a superior result when compared to its individual counterparts. We apply the approach to various contexts of PTSD discussions (e.g., Clinician-child discussion) and our experiments show that using context is essential in classifying videos of children.
期刊介绍:
Pattern Recognition Letters aims at rapid publication of concise articles of a broad interest in pattern recognition.
Subject areas include all the current fields of interest represented by the Technical Committees of the International Association of Pattern Recognition, and other developing themes involving learning and recognition.