{"title":"HCHIRSIMEX: An extended method for domain ontology learning based on conditional mutual information","authors":"O. Idrissi, B. Frikh, B. Ouhbi","doi":"10.1109/CIST.2014.7016600","DOIUrl":null,"url":null,"abstract":"This paper presents HCHIRSIMEX, an extended version of our previous algorithm HCHIRSIM for building domain ontology from web corpus. The new version introduces a novel measure based on the Conditional Mutual Information (CMI) statistic method to define the taxonomic relations and the similarity between selected concepts. By using this method, the ontology extracted by HCHIRSIMEX is more concise and contains a richer concept knowledge base compared with the previous version HCHIRSIM. To evaluate our new algorithm effectiveness, we apply the two algorithms and Sanchez et al. algorithm in Finance domain ontology constructed from the web. Then, we compare the obtained concepts with those on the “Financial glossary” provided by Yahoo.com.","PeriodicalId":106483,"journal":{"name":"2014 Third IEEE International Colloquium in Information Science and Technology (CIST)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 Third IEEE International Colloquium in Information Science and Technology (CIST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIST.2014.7016600","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
This paper presents HCHIRSIMEX, an extended version of our previous algorithm HCHIRSIM for building domain ontology from web corpus. The new version introduces a novel measure based on the Conditional Mutual Information (CMI) statistic method to define the taxonomic relations and the similarity between selected concepts. By using this method, the ontology extracted by HCHIRSIMEX is more concise and contains a richer concept knowledge base compared with the previous version HCHIRSIM. To evaluate our new algorithm effectiveness, we apply the two algorithms and Sanchez et al. algorithm in Finance domain ontology constructed from the web. Then, we compare the obtained concepts with those on the “Financial glossary” provided by Yahoo.com.