Alex X. Wang, S. Chukova, Colin Simpson, Binh P. Nguyen
{"title":"Data-centric AI to Improve Early Detection of Mental Illness","authors":"Alex X. Wang, S. Chukova, Colin Simpson, Binh P. Nguyen","doi":"10.1109/SSP53291.2023.10207938","DOIUrl":null,"url":null,"abstract":"The growth of information technology and advancements in artificial intelligence (AI) have made data creation and usage more prevalent. AI research can be grouped into two categories: model-centric and data-centric. Model-centric AI focuses on using the same data and making changes to model hyper-parameters, architectures, and other configurations. Data-centric AI, on the other hand, prioritizes improving existing data or incorporating new data to improve the performance of machine learning (ML) algorithms. Data-centric AI can greatly improve the performance of machine learning models by improving data quality, increasing data diversity, and using advanced data augmentation methods. The use of ML for early detection of mental health issues is vital due to its ability to identify issues early, provide personalized treatments, detect patterns, and increase accessibility to mental health services. While there have been numerous mental illness detection studies using model-centric approaches, there is a lack of research from a data-centric AI perspective. This study aims to address this gap by comparing established tabular data synthesis methods to explore the impact of synthetic data and data-centric AI on the early detection of mental health issues.","PeriodicalId":296346,"journal":{"name":"2023 IEEE Statistical Signal Processing Workshop (SSP)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE Statistical Signal Processing Workshop (SSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSP53291.2023.10207938","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The growth of information technology and advancements in artificial intelligence (AI) have made data creation and usage more prevalent. AI research can be grouped into two categories: model-centric and data-centric. Model-centric AI focuses on using the same data and making changes to model hyper-parameters, architectures, and other configurations. Data-centric AI, on the other hand, prioritizes improving existing data or incorporating new data to improve the performance of machine learning (ML) algorithms. Data-centric AI can greatly improve the performance of machine learning models by improving data quality, increasing data diversity, and using advanced data augmentation methods. The use of ML for early detection of mental health issues is vital due to its ability to identify issues early, provide personalized treatments, detect patterns, and increase accessibility to mental health services. While there have been numerous mental illness detection studies using model-centric approaches, there is a lack of research from a data-centric AI perspective. This study aims to address this gap by comparing established tabular data synthesis methods to explore the impact of synthetic data and data-centric AI on the early detection of mental health issues.