Kimia Hemmatirad, H. Bagherzadeh, Ehsan Fazl-Ersi, Abedin Vahedian
{"title":"基于多层次支持向量机的社交媒体心理疾病风险检测","authors":"Kimia Hemmatirad, H. Bagherzadeh, Ehsan Fazl-Ersi, Abedin Vahedian","doi":"10.1109/CFIS49607.2020.9238692","DOIUrl":null,"url":null,"abstract":"As shown by several previous studies, personal narratives through social media are often indicative of one's psychological state. In particular, mental illnesses such as depression were found to be associated with distinct linguistic patterns. However, many people with mental illness still do not receive full treatment. In this paper, we study mental illnesses through people's choice of words in expressing themselves on two popular social media platforms, Reddit and Twitter. Our goal is to develop an empirical model to detect and diagnose major mental disorders in individuals. We build a substantial dataset of posts made by people suffering from mental illnesses and the control ones, and in order to generate numerical feature from text we apply text cleaning and Word2Vec language modeling, and then for classification we used SVM machine which classifies posts and users with high accuracy. We achieve an accuracy of 95% on Twitter users and an accuracy of 73% on the Reddit challenge.","PeriodicalId":128323,"journal":{"name":"2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Detection of Mental Illness Risk on Social Media through Multi-level SVMs\",\"authors\":\"Kimia Hemmatirad, H. Bagherzadeh, Ehsan Fazl-Ersi, Abedin Vahedian\",\"doi\":\"10.1109/CFIS49607.2020.9238692\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As shown by several previous studies, personal narratives through social media are often indicative of one's psychological state. In particular, mental illnesses such as depression were found to be associated with distinct linguistic patterns. However, many people with mental illness still do not receive full treatment. In this paper, we study mental illnesses through people's choice of words in expressing themselves on two popular social media platforms, Reddit and Twitter. Our goal is to develop an empirical model to detect and diagnose major mental disorders in individuals. We build a substantial dataset of posts made by people suffering from mental illnesses and the control ones, and in order to generate numerical feature from text we apply text cleaning and Word2Vec language modeling, and then for classification we used SVM machine which classifies posts and users with high accuracy. We achieve an accuracy of 95% on Twitter users and an accuracy of 73% on the Reddit challenge.\",\"PeriodicalId\":128323,\"journal\":{\"name\":\"2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS)\",\"volume\":\"55 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CFIS49607.2020.9238692\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CFIS49607.2020.9238692","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Detection of Mental Illness Risk on Social Media through Multi-level SVMs
As shown by several previous studies, personal narratives through social media are often indicative of one's psychological state. In particular, mental illnesses such as depression were found to be associated with distinct linguistic patterns. However, many people with mental illness still do not receive full treatment. In this paper, we study mental illnesses through people's choice of words in expressing themselves on two popular social media platforms, Reddit and Twitter. Our goal is to develop an empirical model to detect and diagnose major mental disorders in individuals. We build a substantial dataset of posts made by people suffering from mental illnesses and the control ones, and in order to generate numerical feature from text we apply text cleaning and Word2Vec language modeling, and then for classification we used SVM machine which classifies posts and users with high accuracy. We achieve an accuracy of 95% on Twitter users and an accuracy of 73% on the Reddit challenge.