{"title":"基于本体驱动的机器学习方法从Twitter消息中提取疾病名称","authors":"M. Magumba, Peter Nabende, Ernest Mwebaze","doi":"10.1109/CIAPP.2017.8167182","DOIUrl":null,"url":null,"abstract":"Twitter and social media as a whole has great potential as a source of disease surveillance data however the general messiness of tweets presents several challenges for standard information extraction methods. Current methods for disease surveillance on twitter rely on inflexible keyword based approaches that require messages to be pre-filtered on the basis of a disease name which is supplied a priori and are not capable of detecting new ailments. In this paper we present an ontology based machine learning approach to extract disease names and expressions describing ailments from tweets which may be employed as part of a larger general purpose system for automated disease incidence monitoring. We also propose a simple methodology for automatic detection and correction of errors.","PeriodicalId":187056,"journal":{"name":"2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Ontology driven machine learning approach for disease name extraction from Twitter messages\",\"authors\":\"M. Magumba, Peter Nabende, Ernest Mwebaze\",\"doi\":\"10.1109/CIAPP.2017.8167182\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Twitter and social media as a whole has great potential as a source of disease surveillance data however the general messiness of tweets presents several challenges for standard information extraction methods. Current methods for disease surveillance on twitter rely on inflexible keyword based approaches that require messages to be pre-filtered on the basis of a disease name which is supplied a priori and are not capable of detecting new ailments. In this paper we present an ontology based machine learning approach to extract disease names and expressions describing ailments from tweets which may be employed as part of a larger general purpose system for automated disease incidence monitoring. We also propose a simple methodology for automatic detection and correction of errors.\",\"PeriodicalId\":187056,\"journal\":{\"name\":\"2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIAPP.2017.8167182\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIAPP.2017.8167182","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Ontology driven machine learning approach for disease name extraction from Twitter messages
Twitter and social media as a whole has great potential as a source of disease surveillance data however the general messiness of tweets presents several challenges for standard information extraction methods. Current methods for disease surveillance on twitter rely on inflexible keyword based approaches that require messages to be pre-filtered on the basis of a disease name which is supplied a priori and are not capable of detecting new ailments. In this paper we present an ontology based machine learning approach to extract disease names and expressions describing ailments from tweets which may be employed as part of a larger general purpose system for automated disease incidence monitoring. We also propose a simple methodology for automatic detection and correction of errors.