乳腺癌社区论坛中患者生成内容的多标签主题分类

B. Athira, S. M. Idicula, Josette F. Jones
{"title":"乳腺癌社区论坛中患者生成内容的多标签主题分类","authors":"B. Athira, S. M. Idicula, Josette F. Jones","doi":"10.1145/3418094.3418132","DOIUrl":null,"url":null,"abstract":"The research community has been noticing the importance of the online forums in healthcare in understanding the nuances of health-related problems. Breast Cancer is the most common malignance among women worldwide and its survival rate is steadily increasing thanks to early diagnosis and timely and effective treatment. Still how they manage the disease and maintain the quality of daily life are worth understanding. And it is in this context that the online posts of patients with breast cancer, discussing several topics, become patient generated content. The multi-label nature with these posts arising out of the combined contents of these posts can bring out a multitude of divulging conclusions. The resulting classification issues of these online posts under various categories of topics is examined in the present work. A semi-supervised multi-label classification, followed by refinement of multiple assignments of labels based on fuzzy logic and a neighborhood technique is proposed in the paper. Multiplicity of labels, occurs during the assignment of labels to clusters based on proximity. While extending the refined label set to a greater number of unlabeled posts clustered on the basis of proximity, it is observed that the proposed method could bring out more information on the description of posts. The results thus convey that the most discussed topic in the posts is about diagnosis, along with tangential reference to adverse drug effect, presumably to offer support in terms of information or viewpoint. The results show how the diverse nature of multiple labels can be effectively harnessed to draw conclusion from the potential of social media posts of patients' experience in critical health problems.","PeriodicalId":192804,"journal":{"name":"Proceedings of the 4th International Conference on Medical and Health Informatics","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-label Topic Classification of Patient Generated Content in a Breast-cancer Community Forum\",\"authors\":\"B. Athira, S. M. Idicula, Josette F. Jones\",\"doi\":\"10.1145/3418094.3418132\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The research community has been noticing the importance of the online forums in healthcare in understanding the nuances of health-related problems. Breast Cancer is the most common malignance among women worldwide and its survival rate is steadily increasing thanks to early diagnosis and timely and effective treatment. Still how they manage the disease and maintain the quality of daily life are worth understanding. And it is in this context that the online posts of patients with breast cancer, discussing several topics, become patient generated content. The multi-label nature with these posts arising out of the combined contents of these posts can bring out a multitude of divulging conclusions. The resulting classification issues of these online posts under various categories of topics is examined in the present work. A semi-supervised multi-label classification, followed by refinement of multiple assignments of labels based on fuzzy logic and a neighborhood technique is proposed in the paper. Multiplicity of labels, occurs during the assignment of labels to clusters based on proximity. While extending the refined label set to a greater number of unlabeled posts clustered on the basis of proximity, it is observed that the proposed method could bring out more information on the description of posts. The results thus convey that the most discussed topic in the posts is about diagnosis, along with tangential reference to adverse drug effect, presumably to offer support in terms of information or viewpoint. The results show how the diverse nature of multiple labels can be effectively harnessed to draw conclusion from the potential of social media posts of patients' experience in critical health problems.\",\"PeriodicalId\":192804,\"journal\":{\"name\":\"Proceedings of the 4th International Conference on Medical and Health Informatics\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-08-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 4th International Conference on Medical and Health Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3418094.3418132\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Conference on Medical and Health Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3418094.3418132","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

研究界已经注意到在线论坛在了解健康相关问题的细微差别方面的重要性。乳腺癌是全世界妇女中最常见的恶性肿瘤,由于早期诊断和及时有效的治疗,其生存率正在稳步上升。但他们如何控制疾病和维持日常生活质量仍值得了解。正是在这种背景下,乳腺癌患者的在线帖子,讨论了几个话题,成为了患者生成的内容。这些帖子的多标签性质源于这些帖子的综合内容,可以得出许多泄露性的结论。在本工作中审查了这些在线帖子在各种主题类别下的分类问题。本文提出了一种基于模糊逻辑和邻域技术的半监督多标签分类方法,并在此基础上对多个标签的分配进行了改进。标签的多重性,发生在根据邻近度给聚类分配标签的过程中。将改进后的标签集扩展到更多基于接近度聚类的未标记帖子,观察到所提出的方法可以提供更多关于帖子描述的信息。因此,结果表明,帖子中讨论最多的话题是关于诊断的,同时也偶尔提到药物不良反应,可能是为了提供信息或观点方面的支持。研究结果表明,如何有效地利用多重标签的多样性,从社交媒体上关于患者在严重健康问题上的经历的帖子中得出结论。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multi-label Topic Classification of Patient Generated Content in a Breast-cancer Community Forum
The research community has been noticing the importance of the online forums in healthcare in understanding the nuances of health-related problems. Breast Cancer is the most common malignance among women worldwide and its survival rate is steadily increasing thanks to early diagnosis and timely and effective treatment. Still how they manage the disease and maintain the quality of daily life are worth understanding. And it is in this context that the online posts of patients with breast cancer, discussing several topics, become patient generated content. The multi-label nature with these posts arising out of the combined contents of these posts can bring out a multitude of divulging conclusions. The resulting classification issues of these online posts under various categories of topics is examined in the present work. A semi-supervised multi-label classification, followed by refinement of multiple assignments of labels based on fuzzy logic and a neighborhood technique is proposed in the paper. Multiplicity of labels, occurs during the assignment of labels to clusters based on proximity. While extending the refined label set to a greater number of unlabeled posts clustered on the basis of proximity, it is observed that the proposed method could bring out more information on the description of posts. The results thus convey that the most discussed topic in the posts is about diagnosis, along with tangential reference to adverse drug effect, presumably to offer support in terms of information or viewpoint. The results show how the diverse nature of multiple labels can be effectively harnessed to draw conclusion from the potential of social media posts of patients' experience in critical health problems.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信