Towards breastfeeding self-efficacy and postpartum depression estimation based on analysis of free-speech interviews through natural language processing
Luz Itzel Valdeolivar-Hernandez, M. E. Flores Quijano, Juan Carlos Echeverría-Arjonilla, J. Perez-Gonzalez, O. Piña-Ramírez
{"title":"Towards breastfeeding self-efficacy and postpartum depression estimation based on analysis of free-speech interviews through natural language processing","authors":"Luz Itzel Valdeolivar-Hernandez, M. E. Flores Quijano, Juan Carlos Echeverría-Arjonilla, J. Perez-Gonzalez, O. Piña-Ramírez","doi":"10.1117/12.2669883","DOIUrl":null,"url":null,"abstract":"Edinburgh Postpartum Depression (EPDS) and Breastfeeding Self-Efficacy (BSES) scales are standardized questionnaires to screen for postpartum depression and breastfeeding performance self-perception. On the other hand, Natural Language Processing (NLP) is a machine learning technique that analyses the human language to extract relevant and computer-interpretable information. In this work we proposed the application of an NLP toolchain that includes a typical preprocessing stage and the probabilistic topic modeling performed through the Latent Dirichlet Allocation (LDA) to find out the two most relevant topics within each of six study groups (low, medium, and high scores of BSES and EPDS). Each topic LDA-modeled consisted of 30-word/terms (tokens) which are organized in Venn diagrams, contrasting the mutually exclusive tokens within the low and high scores on each scale. Coherence and log-Perplexity topic modeling performance metrics, were computed. We found that LDA-models have distinguishable tokens between low and high scores of the BSES and EPDS. However, the most remarkable findings were two subset of tokens, one related to newborn care and another to newborn intake, respectively correlated to low and high postpartum depression risk according to EPDS.","PeriodicalId":147201,"journal":{"name":"Symposium on Medical Information Processing and Analysis","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Symposium on Medical Information Processing and Analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2669883","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Edinburgh Postpartum Depression (EPDS) and Breastfeeding Self-Efficacy (BSES) scales are standardized questionnaires to screen for postpartum depression and breastfeeding performance self-perception. On the other hand, Natural Language Processing (NLP) is a machine learning technique that analyses the human language to extract relevant and computer-interpretable information. In this work we proposed the application of an NLP toolchain that includes a typical preprocessing stage and the probabilistic topic modeling performed through the Latent Dirichlet Allocation (LDA) to find out the two most relevant topics within each of six study groups (low, medium, and high scores of BSES and EPDS). Each topic LDA-modeled consisted of 30-word/terms (tokens) which are organized in Venn diagrams, contrasting the mutually exclusive tokens within the low and high scores on each scale. Coherence and log-Perplexity topic modeling performance metrics, were computed. We found that LDA-models have distinguishable tokens between low and high scores of the BSES and EPDS. However, the most remarkable findings were two subset of tokens, one related to newborn care and another to newborn intake, respectively correlated to low and high postpartum depression risk according to EPDS.