Parisa Khodabakhshi, Masoud Mahootchi, Hadi Mosadegh
{"title":"社交媒体文本多层次抑郁检测的新型混合聚类和分类框架","authors":"Parisa Khodabakhshi, Masoud Mahootchi, Hadi Mosadegh","doi":"10.1016/j.engappai.2025.111952","DOIUrl":null,"url":null,"abstract":"<div><div>Depression is a prevalent psychiatric condition worldwide, with significant social and economic implications. Despite its high incidence, many individuals with depression remain undiagnosed and untreated. Meanwhile, people increasingly use social media platforms to express their emotions and thoughts. Consequently, leveraging these platforms for depression detection may help address several related challenges. This paper proposes a three-stage methodology, based on text mining techniques, to determine the severity of depression in individuals who post textual content on social media. In the proposed framework, each post is transformed into a vector of numerical features using established feature extraction methods. Principal Component Analysis is then applied to select the most informative features for identifying whether a post indicates depression via a classification algorithm. If depression is detected, the method clusters the relevant posts based on their characteristics, grouping similar texts together. A second classification model is then applied within each cluster to determine the level of depression. To equip the model with effective algorithms at each stage, the Taguchi method is used to identify the best combination of feature extraction, clustering, and classification techniques. Specifically, Bidirectional Encoder Representations from Transformers (BERT) is used for deep contextual feature extraction, Deep Embedded Clustering (DEC) is employed for clustering, and Support Vector Machine (SVM) is used for classification. Numerical results show that the proposed approach can accurately classify individuals’ posts into one of four depression levels: non-depressed, low, moderate, and severe. These findings suggest that social networks offer a platform for assessing mental health through textual analysis.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"160 ","pages":"Article 111952"},"PeriodicalIF":8.0000,"publicationDate":"2025-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A novel hybrid clustering and classification framework for multi-level depression detection in social media texts\",\"authors\":\"Parisa Khodabakhshi, Masoud Mahootchi, Hadi Mosadegh\",\"doi\":\"10.1016/j.engappai.2025.111952\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Depression is a prevalent psychiatric condition worldwide, with significant social and economic implications. Despite its high incidence, many individuals with depression remain undiagnosed and untreated. Meanwhile, people increasingly use social media platforms to express their emotions and thoughts. Consequently, leveraging these platforms for depression detection may help address several related challenges. This paper proposes a three-stage methodology, based on text mining techniques, to determine the severity of depression in individuals who post textual content on social media. In the proposed framework, each post is transformed into a vector of numerical features using established feature extraction methods. Principal Component Analysis is then applied to select the most informative features for identifying whether a post indicates depression via a classification algorithm. If depression is detected, the method clusters the relevant posts based on their characteristics, grouping similar texts together. A second classification model is then applied within each cluster to determine the level of depression. To equip the model with effective algorithms at each stage, the Taguchi method is used to identify the best combination of feature extraction, clustering, and classification techniques. Specifically, Bidirectional Encoder Representations from Transformers (BERT) is used for deep contextual feature extraction, Deep Embedded Clustering (DEC) is employed for clustering, and Support Vector Machine (SVM) is used for classification. Numerical results show that the proposed approach can accurately classify individuals’ posts into one of four depression levels: non-depressed, low, moderate, and severe. These findings suggest that social networks offer a platform for assessing mental health through textual analysis.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"160 \",\"pages\":\"Article 111952\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-08-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625019608\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625019608","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
A novel hybrid clustering and classification framework for multi-level depression detection in social media texts
Depression is a prevalent psychiatric condition worldwide, with significant social and economic implications. Despite its high incidence, many individuals with depression remain undiagnosed and untreated. Meanwhile, people increasingly use social media platforms to express their emotions and thoughts. Consequently, leveraging these platforms for depression detection may help address several related challenges. This paper proposes a three-stage methodology, based on text mining techniques, to determine the severity of depression in individuals who post textual content on social media. In the proposed framework, each post is transformed into a vector of numerical features using established feature extraction methods. Principal Component Analysis is then applied to select the most informative features for identifying whether a post indicates depression via a classification algorithm. If depression is detected, the method clusters the relevant posts based on their characteristics, grouping similar texts together. A second classification model is then applied within each cluster to determine the level of depression. To equip the model with effective algorithms at each stage, the Taguchi method is used to identify the best combination of feature extraction, clustering, and classification techniques. Specifically, Bidirectional Encoder Representations from Transformers (BERT) is used for deep contextual feature extraction, Deep Embedded Clustering (DEC) is employed for clustering, and Support Vector Machine (SVM) is used for classification. Numerical results show that the proposed approach can accurately classify individuals’ posts into one of four depression levels: non-depressed, low, moderate, and severe. These findings suggest that social networks offer a platform for assessing mental health through textual analysis.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.