Deepali Kulkarni , Abhijit Ghosh , Amey Girdhari , Shaomin Liu , L. Alexander Vance , Melissa Unruh , Joydeep Sarkar
{"title":"用三重丢失增强预训练的上下文嵌入,作为从电子健康记录衍生的精神健康临床笔记中提取临床特征的有效微调方法","authors":"Deepali Kulkarni , Abhijit Ghosh , Amey Girdhari , Shaomin Liu , L. Alexander Vance , Melissa Unruh , Joydeep Sarkar","doi":"10.1016/j.nlp.2023.100045","DOIUrl":null,"url":null,"abstract":"<div><p>The development and application of real-world evidence in the field of mental health trails other therapeutic areas like oncology and cardiovascular diseases, largely because of the lack of frequent, structured outcomes measures in routine clinical care. A wealth of valuable patient-level clinical data resides in an unstructured format in clinical notes documented at each clinical encounter. Manual extraction of this information is not scalable, and heterogeneity in recording patterns and the heavily context-dependent nature of the content renders keyword-based automated searches of little practical value. While state-of-the-art natural language processing (NLP) models based on the transformer architecture have been developed for information extraction tasks in the mental health space, they are not trained on unstructured clinical data that capture the nuances of different dimensions of mental health (e.g., symptomology, social history, etc.). We have developed a novel transformer architecture-based NLP model to capture core clinical features of patients with major depressive disorder (MDD). Initialized on MentalBERT model weights, we pre-trained our model further on clinical notes from routine mental health care and fine-tuned using triplet loss, an effective feature embedding regularizer which boosts classification and extraction of 3 specific features in patients with MDD: anhedonia, suicidal ideation with plan or intent (SP), and suicidal ideation without plan or intent (SI) or where plan or intent are unknown. Training and testing data were annotated by mental health clinicians. Using triplet loss for fine tuning led to improvement in model performance benchmarked against other standard models (MentalBERT and BioClinicalBERT) on the same tasks, achieving F1 scores of 0.99 for anhedonia, 0.94 for SP, and 0.88 for SI. Model robustness was tested by testing sensitivity of model predictions on modifications to test sentences. The application of such an NLP model can be further scaled to capture clinical features of other disorders as well as other domains like social history or history of illness.</p></div>","PeriodicalId":100944,"journal":{"name":"Natural Language Processing Journal","volume":"6 ","pages":"Article 100045"},"PeriodicalIF":0.0000,"publicationDate":"2023-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949719123000420/pdfft?md5=f1fb11ae818a47f417a3249822e0ac8b&pid=1-s2.0-S2949719123000420-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Enhancing pre-trained contextual embeddings with triplet loss as an effective fine-tuning method for extracting clinical features from electronic health record derived mental health clinical notes\",\"authors\":\"Deepali Kulkarni , Abhijit Ghosh , Amey Girdhari , Shaomin Liu , L. Alexander Vance , Melissa Unruh , Joydeep Sarkar\",\"doi\":\"10.1016/j.nlp.2023.100045\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The development and application of real-world evidence in the field of mental health trails other therapeutic areas like oncology and cardiovascular diseases, largely because of the lack of frequent, structured outcomes measures in routine clinical care. A wealth of valuable patient-level clinical data resides in an unstructured format in clinical notes documented at each clinical encounter. Manual extraction of this information is not scalable, and heterogeneity in recording patterns and the heavily context-dependent nature of the content renders keyword-based automated searches of little practical value. While state-of-the-art natural language processing (NLP) models based on the transformer architecture have been developed for information extraction tasks in the mental health space, they are not trained on unstructured clinical data that capture the nuances of different dimensions of mental health (e.g., symptomology, social history, etc.). We have developed a novel transformer architecture-based NLP model to capture core clinical features of patients with major depressive disorder (MDD). Initialized on MentalBERT model weights, we pre-trained our model further on clinical notes from routine mental health care and fine-tuned using triplet loss, an effective feature embedding regularizer which boosts classification and extraction of 3 specific features in patients with MDD: anhedonia, suicidal ideation with plan or intent (SP), and suicidal ideation without plan or intent (SI) or where plan or intent are unknown. Training and testing data were annotated by mental health clinicians. Using triplet loss for fine tuning led to improvement in model performance benchmarked against other standard models (MentalBERT and BioClinicalBERT) on the same tasks, achieving F1 scores of 0.99 for anhedonia, 0.94 for SP, and 0.88 for SI. Model robustness was tested by testing sensitivity of model predictions on modifications to test sentences. The application of such an NLP model can be further scaled to capture clinical features of other disorders as well as other domains like social history or history of illness.</p></div>\",\"PeriodicalId\":100944,\"journal\":{\"name\":\"Natural Language Processing Journal\",\"volume\":\"6 \",\"pages\":\"Article 100045\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-11-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2949719123000420/pdfft?md5=f1fb11ae818a47f417a3249822e0ac8b&pid=1-s2.0-S2949719123000420-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Natural Language Processing Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2949719123000420\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Natural Language Processing Journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949719123000420","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Enhancing pre-trained contextual embeddings with triplet loss as an effective fine-tuning method for extracting clinical features from electronic health record derived mental health clinical notes
The development and application of real-world evidence in the field of mental health trails other therapeutic areas like oncology and cardiovascular diseases, largely because of the lack of frequent, structured outcomes measures in routine clinical care. A wealth of valuable patient-level clinical data resides in an unstructured format in clinical notes documented at each clinical encounter. Manual extraction of this information is not scalable, and heterogeneity in recording patterns and the heavily context-dependent nature of the content renders keyword-based automated searches of little practical value. While state-of-the-art natural language processing (NLP) models based on the transformer architecture have been developed for information extraction tasks in the mental health space, they are not trained on unstructured clinical data that capture the nuances of different dimensions of mental health (e.g., symptomology, social history, etc.). We have developed a novel transformer architecture-based NLP model to capture core clinical features of patients with major depressive disorder (MDD). Initialized on MentalBERT model weights, we pre-trained our model further on clinical notes from routine mental health care and fine-tuned using triplet loss, an effective feature embedding regularizer which boosts classification and extraction of 3 specific features in patients with MDD: anhedonia, suicidal ideation with plan or intent (SP), and suicidal ideation without plan or intent (SI) or where plan or intent are unknown. Training and testing data were annotated by mental health clinicians. Using triplet loss for fine tuning led to improvement in model performance benchmarked against other standard models (MentalBERT and BioClinicalBERT) on the same tasks, achieving F1 scores of 0.99 for anhedonia, 0.94 for SP, and 0.88 for SI. Model robustness was tested by testing sensitivity of model predictions on modifications to test sentences. The application of such an NLP model can be further scaled to capture clinical features of other disorders as well as other domains like social history or history of illness.