Dan Vilenchik, Julie Cwikel, Yaakob Ezra, Tuvia Hausdorff, Mor Lazarov, Ruslan Sergienko, Rachel Abramovitz, Ilana Schmidt, Alison Stern Perez
{"title":"Method Matters: Enhancing Voice-Based Depression Detection With a New Data Collection Framework","authors":"Dan Vilenchik, Julie Cwikel, Yaakob Ezra, Tuvia Hausdorff, Mor Lazarov, Ruslan Sergienko, Rachel Abramovitz, Ilana Schmidt, Alison Stern Perez","doi":"10.1155/da/4839334","DOIUrl":null,"url":null,"abstract":"<div>\n <p>Depression accounts for a major share of global disability-adjusted life-years (DALYs). Diagnosis typically requires a psychiatrist or lengthy self-assessments, which can be challenging for symptomatic individuals. Developing reliable, noninvasive, and accessible detection methods is a healthcare priority. Voice analysis offers a promising approach for early depression detection, potentially improving treatment access and reducing costs. This paper presents a novel pipeline for depression detection that addresses several critical challenges in the field, including data imbalance, label quality, and model generalizability. Our study utilizes a high-quality, high-depression-prevalence dataset collected from a specialized chronic pain clinic, enabling robust depression detection even with a limited sample size. We obtained a lift in the accuracy of up to 15% over the 50–50 baseline in our 52-patient dataset using a 3-fold cross-validation test (which means the train set is <i>n</i> = 34, std 2.8%, <i>p</i>-value 0.01). We further show that combining voice-only acoustic features with a single self-report question (subject unit of distress [SUDs]) significantly improves predictive accuracy. While relying on SUDs is not always good practice, our data collection setting lacked incentives to misrepresent depression status; SUDs were highly reliable, giving 86% accuracy; adding acoustic features raises it to 92%, exceeding the stand-alone potential of SUDs with a <i>p</i>-value 0.1. Further data collection will enhance accuracy, supporting a rapid, noninvasive depression detection method that overcomes clinical barriers. These findings offer a promising tool for early depression detection across clinical settings.</p>\n </div>","PeriodicalId":55179,"journal":{"name":"Depression and Anxiety","volume":"2025 1","pages":""},"PeriodicalIF":4.7000,"publicationDate":"2025-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/da/4839334","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Depression and Anxiety","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1155/da/4839334","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHIATRY","Score":null,"Total":0}
引用次数: 0
Abstract
Depression accounts for a major share of global disability-adjusted life-years (DALYs). Diagnosis typically requires a psychiatrist or lengthy self-assessments, which can be challenging for symptomatic individuals. Developing reliable, noninvasive, and accessible detection methods is a healthcare priority. Voice analysis offers a promising approach for early depression detection, potentially improving treatment access and reducing costs. This paper presents a novel pipeline for depression detection that addresses several critical challenges in the field, including data imbalance, label quality, and model generalizability. Our study utilizes a high-quality, high-depression-prevalence dataset collected from a specialized chronic pain clinic, enabling robust depression detection even with a limited sample size. We obtained a lift in the accuracy of up to 15% over the 50–50 baseline in our 52-patient dataset using a 3-fold cross-validation test (which means the train set is n = 34, std 2.8%, p-value 0.01). We further show that combining voice-only acoustic features with a single self-report question (subject unit of distress [SUDs]) significantly improves predictive accuracy. While relying on SUDs is not always good practice, our data collection setting lacked incentives to misrepresent depression status; SUDs were highly reliable, giving 86% accuracy; adding acoustic features raises it to 92%, exceeding the stand-alone potential of SUDs with a p-value 0.1. Further data collection will enhance accuracy, supporting a rapid, noninvasive depression detection method that overcomes clinical barriers. These findings offer a promising tool for early depression detection across clinical settings.
期刊介绍:
Depression and Anxiety is a scientific journal that focuses on the study of mood and anxiety disorders, as well as related phenomena in humans. The journal is dedicated to publishing high-quality research and review articles that contribute to the understanding and treatment of these conditions. The journal places a particular emphasis on articles that contribute to the clinical evaluation and care of individuals affected by mood and anxiety disorders. It prioritizes the publication of treatment-related research and review papers, as well as those that present novel findings that can directly impact clinical practice. The journal's goal is to advance the field by disseminating knowledge that can lead to better diagnosis, treatment, and management of these disorders, ultimately improving the quality of life for those who suffer from them.