M. Vasiu, L. Marghescu, Ioana Barbantan, R. Potolea
{"title":"跨文档概念增强","authors":"M. Vasiu, L. Marghescu, Ioana Barbantan, R. Potolea","doi":"10.1109/ICCP.2018.8516582","DOIUrl":null,"url":null,"abstract":"The current paper proposes a strategy for exploring and integrating related information extracted from unstructured documents with different degree of confidence, standardization and representation. The strategy was instantiated on the medical domain and designed for the English language. The goal of the proposed strategy was of augmenting the therapeutic information from patient leaflets with information extracted from clinical records. The approach proved to be a sound one as the information from the clinical records aligns with the information in the standardized sources. It confirmed the assumption that we can derive drug repositioning from clinical records and thus augmenting the existing medical knowledge. The reported metrics <95.14% precision, 83.3% recall>for patient leaflets and <94.07% precision, 87.27% recall >for EHRs measured for the concept extraction strategy, further support a good performance for the entities correlation approach. The degree of correlation between the extracted information from the two data sources reported as matches is of 85%.","PeriodicalId":259007,"journal":{"name":"2018 IEEE 14th International Conference on Intelligent Computer Communication and Processing (ICCP)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Cross Documents Concept Augmentation\",\"authors\":\"M. Vasiu, L. Marghescu, Ioana Barbantan, R. Potolea\",\"doi\":\"10.1109/ICCP.2018.8516582\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The current paper proposes a strategy for exploring and integrating related information extracted from unstructured documents with different degree of confidence, standardization and representation. The strategy was instantiated on the medical domain and designed for the English language. The goal of the proposed strategy was of augmenting the therapeutic information from patient leaflets with information extracted from clinical records. The approach proved to be a sound one as the information from the clinical records aligns with the information in the standardized sources. It confirmed the assumption that we can derive drug repositioning from clinical records and thus augmenting the existing medical knowledge. The reported metrics <95.14% precision, 83.3% recall>for patient leaflets and <94.07% precision, 87.27% recall >for EHRs measured for the concept extraction strategy, further support a good performance for the entities correlation approach. The degree of correlation between the extracted information from the two data sources reported as matches is of 85%.\",\"PeriodicalId\":259007,\"journal\":{\"name\":\"2018 IEEE 14th International Conference on Intelligent Computer Communication and Processing (ICCP)\",\"volume\":\"71 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 14th International Conference on Intelligent Computer Communication and Processing (ICCP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCP.2018.8516582\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 14th International Conference on Intelligent Computer Communication and Processing (ICCP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCP.2018.8516582","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The current paper proposes a strategy for exploring and integrating related information extracted from unstructured documents with different degree of confidence, standardization and representation. The strategy was instantiated on the medical domain and designed for the English language. The goal of the proposed strategy was of augmenting the therapeutic information from patient leaflets with information extracted from clinical records. The approach proved to be a sound one as the information from the clinical records aligns with the information in the standardized sources. It confirmed the assumption that we can derive drug repositioning from clinical records and thus augmenting the existing medical knowledge. The reported metrics <95.14% precision, 83.3% recall>for patient leaflets and <94.07% precision, 87.27% recall >for EHRs measured for the concept extraction strategy, further support a good performance for the entities correlation approach. The degree of correlation between the extracted information from the two data sources reported as matches is of 85%.