Renzhi Sheng , Kai Zhang , Le Chang , Yijing Chu , Han Lin , Juan Tu
{"title":"一种整合文本词性和序列特征的语义融合凹陷识别模型","authors":"Renzhi Sheng , Kai Zhang , Le Chang , Yijing Chu , Han Lin , Juan Tu","doi":"10.1016/j.apacoust.2025.111084","DOIUrl":null,"url":null,"abstract":"<div><div>Depression is a mental disorder that significantly impairs patients’ emotional and functional well-being status. With the breakthrough development of natural language processing technology, text-based depression recognition has emerged as a promising non-invasive diagnostic approach. Nevertheless, traditional models often suffer from limited feature representation and contextual ambiguity, particularly in Chinese. This study proposes a novel semantic fusion depression recognition model (SFDRM) was proposed by integrating part-of-speech (POS) features with sequential features of text to address these challenges. First of all, an improved TF-IDF algorithm was developed by incorporating the sentiment dictionary with semantic rules, thereafter extracting sentiment-oriented POS based. Second, global semantic representations were extracted using BERT, and local contextual features are captured via BiLSTM. Finally, a self-attention mechanism was introduced to dynamically fuse these multi-granular features, enabling the model to emphasize salient information. Experiments conducted on two publicly available Chinese depression corpora (EATD and CMDC) showed that SFDDM has higher accuracy (0.86) and F1 score (0.80) compared to traditional and single feature models. This indicated that the model effectively addressed the ambiguity problem in Chinese sentiment analysis, and effectively improved the generalization ability and robustness of the model. Therefore, the current SFDRM might provide a more effective and universal solution for intelligent depression screening based on Chinese language patterns, and lays the foundation for the integration of acoustic features in multimodal mental health assessment in the future.</div></div>","PeriodicalId":55506,"journal":{"name":"Applied Acoustics","volume":"242 ","pages":"Article 111084"},"PeriodicalIF":3.4000,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A semantic fusion depression recognition model integrating textual part-of-speech and sequential features\",\"authors\":\"Renzhi Sheng , Kai Zhang , Le Chang , Yijing Chu , Han Lin , Juan Tu\",\"doi\":\"10.1016/j.apacoust.2025.111084\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Depression is a mental disorder that significantly impairs patients’ emotional and functional well-being status. With the breakthrough development of natural language processing technology, text-based depression recognition has emerged as a promising non-invasive diagnostic approach. Nevertheless, traditional models often suffer from limited feature representation and contextual ambiguity, particularly in Chinese. This study proposes a novel semantic fusion depression recognition model (SFDRM) was proposed by integrating part-of-speech (POS) features with sequential features of text to address these challenges. First of all, an improved TF-IDF algorithm was developed by incorporating the sentiment dictionary with semantic rules, thereafter extracting sentiment-oriented POS based. Second, global semantic representations were extracted using BERT, and local contextual features are captured via BiLSTM. Finally, a self-attention mechanism was introduced to dynamically fuse these multi-granular features, enabling the model to emphasize salient information. Experiments conducted on two publicly available Chinese depression corpora (EATD and CMDC) showed that SFDDM has higher accuracy (0.86) and F1 score (0.80) compared to traditional and single feature models. This indicated that the model effectively addressed the ambiguity problem in Chinese sentiment analysis, and effectively improved the generalization ability and robustness of the model. Therefore, the current SFDRM might provide a more effective and universal solution for intelligent depression screening based on Chinese language patterns, and lays the foundation for the integration of acoustic features in multimodal mental health assessment in the future.</div></div>\",\"PeriodicalId\":55506,\"journal\":{\"name\":\"Applied Acoustics\",\"volume\":\"242 \",\"pages\":\"Article 111084\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-09-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Acoustics\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0003682X25005560\",\"RegionNum\":2,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ACOUSTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Acoustics","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0003682X25005560","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ACOUSTICS","Score":null,"Total":0}
A semantic fusion depression recognition model integrating textual part-of-speech and sequential features
Depression is a mental disorder that significantly impairs patients’ emotional and functional well-being status. With the breakthrough development of natural language processing technology, text-based depression recognition has emerged as a promising non-invasive diagnostic approach. Nevertheless, traditional models often suffer from limited feature representation and contextual ambiguity, particularly in Chinese. This study proposes a novel semantic fusion depression recognition model (SFDRM) was proposed by integrating part-of-speech (POS) features with sequential features of text to address these challenges. First of all, an improved TF-IDF algorithm was developed by incorporating the sentiment dictionary with semantic rules, thereafter extracting sentiment-oriented POS based. Second, global semantic representations were extracted using BERT, and local contextual features are captured via BiLSTM. Finally, a self-attention mechanism was introduced to dynamically fuse these multi-granular features, enabling the model to emphasize salient information. Experiments conducted on two publicly available Chinese depression corpora (EATD and CMDC) showed that SFDDM has higher accuracy (0.86) and F1 score (0.80) compared to traditional and single feature models. This indicated that the model effectively addressed the ambiguity problem in Chinese sentiment analysis, and effectively improved the generalization ability and robustness of the model. Therefore, the current SFDRM might provide a more effective and universal solution for intelligent depression screening based on Chinese language patterns, and lays the foundation for the integration of acoustic features in multimodal mental health assessment in the future.
期刊介绍:
Since its launch in 1968, Applied Acoustics has been publishing high quality research papers providing state-of-the-art coverage of research findings for engineers and scientists involved in applications of acoustics in the widest sense.
Applied Acoustics looks not only at recent developments in the understanding of acoustics but also at ways of exploiting that understanding. The Journal aims to encourage the exchange of practical experience through publication and in so doing creates a fund of technological information that can be used for solving related problems. The presentation of information in graphical or tabular form is especially encouraged. If a report of a mathematical development is a necessary part of a paper it is important to ensure that it is there only as an integral part of a practical solution to a problem and is supported by data. Applied Acoustics encourages the exchange of practical experience in the following ways: • Complete Papers • Short Technical Notes • Review Articles; and thereby provides a wealth of technological information that can be used to solve related problems.
Manuscripts that address all fields of applications of acoustics ranging from medicine and NDT to the environment and buildings are welcome.