{"title":"基于空间约简启发式的模式匹配方法在生物医学文本中搜索缩略语定义对","authors":"P. C. Rafeeque, K. A. Abdul Nazeer","doi":"10.1109/ADCOM.2007.118","DOIUrl":null,"url":null,"abstract":"This paper deals with the problem of mining acronyms and their definitions from biomedical text. We propose an effective text mining system by using pattern matching method. Different stages of the design have been explained with pseudo code. We used space reduction heuristic constraints (D. Nadeau and P. Turney, 2005) which will increase the precision by reducing the number of candidate definitions and will include most of the true cases. The pattern matching method does not require training data to run as in the case of learning techniques. This will make the process simple and fast. Evaluation has been done by using three metrics - recall (measure of how much relevant information the system has extracted from text), precision (measure of how much information returned by the system is actually correct) and f-factor (combined value of recall and precision). Experimental results achieved 92% recall and 97.2% precision.","PeriodicalId":185608,"journal":{"name":"15th International Conference on Advanced Computing and Communications (ADCOM 2007)","volume":"29 4","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Text Mining for Finding Acronym-Definition Pairs from Biomedical Text Using Pattern Matching Method with Space Reduction Heuristics\",\"authors\":\"P. C. Rafeeque, K. A. Abdul Nazeer\",\"doi\":\"10.1109/ADCOM.2007.118\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper deals with the problem of mining acronyms and their definitions from biomedical text. We propose an effective text mining system by using pattern matching method. Different stages of the design have been explained with pseudo code. We used space reduction heuristic constraints (D. Nadeau and P. Turney, 2005) which will increase the precision by reducing the number of candidate definitions and will include most of the true cases. The pattern matching method does not require training data to run as in the case of learning techniques. This will make the process simple and fast. Evaluation has been done by using three metrics - recall (measure of how much relevant information the system has extracted from text), precision (measure of how much information returned by the system is actually correct) and f-factor (combined value of recall and precision). Experimental results achieved 92% recall and 97.2% precision.\",\"PeriodicalId\":185608,\"journal\":{\"name\":\"15th International Conference on Advanced Computing and Communications (ADCOM 2007)\",\"volume\":\"29 4\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"15th International Conference on Advanced Computing and Communications (ADCOM 2007)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ADCOM.2007.118\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"15th International Conference on Advanced Computing and Communications (ADCOM 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ADCOM.2007.118","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Text Mining for Finding Acronym-Definition Pairs from Biomedical Text Using Pattern Matching Method with Space Reduction Heuristics
This paper deals with the problem of mining acronyms and their definitions from biomedical text. We propose an effective text mining system by using pattern matching method. Different stages of the design have been explained with pseudo code. We used space reduction heuristic constraints (D. Nadeau and P. Turney, 2005) which will increase the precision by reducing the number of candidate definitions and will include most of the true cases. The pattern matching method does not require training data to run as in the case of learning techniques. This will make the process simple and fast. Evaluation has been done by using three metrics - recall (measure of how much relevant information the system has extracted from text), precision (measure of how much information returned by the system is actually correct) and f-factor (combined value of recall and precision). Experimental results achieved 92% recall and 97.2% precision.