{"title":"Combining Information Extratction for Text Mining by Using Morphological Patterns and Knowledge Discovery Using Inductive Logic Programming","authors":"A. Christy, P. Thambidurai","doi":"10.1109/ISCIII.2007.367362","DOIUrl":null,"url":null,"abstract":"This paper introduces concepts and a rule-based model for information extraction (IE) strategy using unsupervised algorithm and inductive learning in a top-down fashion. We have used the natural language processing techniques for identifying the morphological patterns (features) and for constructing patterns based on which the necessary information is extracted. The extracted information is then used to discover knowledge in the form of if-then rules. We have considered the technical abstracts of two different domains, by relating the information extracted from the abstract part with the information provided in the conclusion part. The information gain is found as the result of knowledge discovery and we have found our system producing an accuracy of 90%.","PeriodicalId":314768,"journal":{"name":"2007 International Symposium on Computational Intelligence and Intelligent Informatics","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 International Symposium on Computational Intelligence and Intelligent Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCIII.2007.367362","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper introduces concepts and a rule-based model for information extraction (IE) strategy using unsupervised algorithm and inductive learning in a top-down fashion. We have used the natural language processing techniques for identifying the morphological patterns (features) and for constructing patterns based on which the necessary information is extracted. The extracted information is then used to discover knowledge in the form of if-then rules. We have considered the technical abstracts of two different domains, by relating the information extracted from the abstract part with the information provided in the conclusion part. The information gain is found as the result of knowledge discovery and we have found our system producing an accuracy of 90%.