{"title":"为实时监测系统提取疾病事件","authors":"Minh-Tien Nguyen, Tri-Thanh Nguyen","doi":"10.1145/2542050.2542084","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a method that uses both semantic rules and machine learning to extract infectious disease events in Vietnamese electronic news, which can be used in a real-time system of monitoring the spread of diseases. Our method contains two important steps: detecting disease events from unstructured data and extracting information of the disease events. The event detection uses semantic rules and machine learning to detect a disease event; in the later step, Name Entity Recognition (NER), rules, and dictionaries are used to capture the event's information. The performance of detection step is ≈77,33% (F-score) and the precision of extraction step is ≈91,89%. These results are better that those of the experiments in which rules were not used. This indicates that our method is suitable for extracting disease events in Vietnamese text.","PeriodicalId":246033,"journal":{"name":"Proceedings of the 4th Symposium on Information and Communication Technology","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Extraction of disease events for a real-time monitoring system\",\"authors\":\"Minh-Tien Nguyen, Tri-Thanh Nguyen\",\"doi\":\"10.1145/2542050.2542084\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose a method that uses both semantic rules and machine learning to extract infectious disease events in Vietnamese electronic news, which can be used in a real-time system of monitoring the spread of diseases. Our method contains two important steps: detecting disease events from unstructured data and extracting information of the disease events. The event detection uses semantic rules and machine learning to detect a disease event; in the later step, Name Entity Recognition (NER), rules, and dictionaries are used to capture the event's information. The performance of detection step is ≈77,33% (F-score) and the precision of extraction step is ≈91,89%. These results are better that those of the experiments in which rules were not used. This indicates that our method is suitable for extracting disease events in Vietnamese text.\",\"PeriodicalId\":246033,\"journal\":{\"name\":\"Proceedings of the 4th Symposium on Information and Communication Technology\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-12-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 4th Symposium on Information and Communication Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2542050.2542084\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th Symposium on Information and Communication Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2542050.2542084","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Extraction of disease events for a real-time monitoring system
In this paper, we propose a method that uses both semantic rules and machine learning to extract infectious disease events in Vietnamese electronic news, which can be used in a real-time system of monitoring the spread of diseases. Our method contains two important steps: detecting disease events from unstructured data and extracting information of the disease events. The event detection uses semantic rules and machine learning to detect a disease event; in the later step, Name Entity Recognition (NER), rules, and dictionaries are used to capture the event's information. The performance of detection step is ≈77,33% (F-score) and the precision of extraction step is ≈91,89%. These results are better that those of the experiments in which rules were not used. This indicates that our method is suitable for extracting disease events in Vietnamese text.