{"title":"基于文本数据的社交媒体抑郁分类方法研究","authors":"Yiqing Zhou, Junzi Zhang","doi":"10.1109/ISoIRS57349.2022.00030","DOIUrl":null,"url":null,"abstract":"The cause of depression is not well understood by researchers at present, but its existence has seriously harmed people's health. Therefore, it is very important to quickly judge whether it is depression in today's society. In the work, the text dataset from social media is first vectorized with Bag-of-words and self-training Word2Vec, and then the text represented by Bag-of-words is trained and tested in different types of traditional machine learning classification methods. For text represented by Word2Vec, CNN and RNN algorithms are added for training. Finally, we compare and analyse the classification effects of traditional machine learning algorithms and deep learning algorithms with different text vectorization representations on text dataset, so as to find a better classification model of depression based on text dataset. The experimental results show that CNN and Logistic Regression model are better in the task of depression classification based on text dataset.","PeriodicalId":405065,"journal":{"name":"2022 International Symposium on Intelligent Robotics and Systems (ISoIRS)","volume":"120 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Study on Social Media Depression Classification Method Based on Text Data\",\"authors\":\"Yiqing Zhou, Junzi Zhang\",\"doi\":\"10.1109/ISoIRS57349.2022.00030\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The cause of depression is not well understood by researchers at present, but its existence has seriously harmed people's health. Therefore, it is very important to quickly judge whether it is depression in today's society. In the work, the text dataset from social media is first vectorized with Bag-of-words and self-training Word2Vec, and then the text represented by Bag-of-words is trained and tested in different types of traditional machine learning classification methods. For text represented by Word2Vec, CNN and RNN algorithms are added for training. Finally, we compare and analyse the classification effects of traditional machine learning algorithms and deep learning algorithms with different text vectorization representations on text dataset, so as to find a better classification model of depression based on text dataset. The experimental results show that CNN and Logistic Regression model are better in the task of depression classification based on text dataset.\",\"PeriodicalId\":405065,\"journal\":{\"name\":\"2022 International Symposium on Intelligent Robotics and Systems (ISoIRS)\",\"volume\":\"120 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Symposium on Intelligent Robotics and Systems (ISoIRS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISoIRS57349.2022.00030\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Symposium on Intelligent Robotics and Systems (ISoIRS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISoIRS57349.2022.00030","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Study on Social Media Depression Classification Method Based on Text Data
The cause of depression is not well understood by researchers at present, but its existence has seriously harmed people's health. Therefore, it is very important to quickly judge whether it is depression in today's society. In the work, the text dataset from social media is first vectorized with Bag-of-words and self-training Word2Vec, and then the text represented by Bag-of-words is trained and tested in different types of traditional machine learning classification methods. For text represented by Word2Vec, CNN and RNN algorithms are added for training. Finally, we compare and analyse the classification effects of traditional machine learning algorithms and deep learning algorithms with different text vectorization representations on text dataset, so as to find a better classification model of depression based on text dataset. The experimental results show that CNN and Logistic Regression model are better in the task of depression classification based on text dataset.