{"title":"Graph Clustering Based Size Varying Rules for Lifelong Topic Modeling","authors":"Muhammad Taimoor Khan, S. Khalid, Furqan Aziz","doi":"10.1145/3309129.3309146","DOIUrl":null,"url":null,"abstract":"Lifelong learning topic models identify the hidden concepts discussed in the collection of documents. The concepts are represented as topics having groups of ordered words based on their relevance to the topic. Lifelong learning models have an automatic learning mechanism which allows continuous learning without external support. In the process, the model gets more knowledgeable with experience as it learns from the past in the form of rules. It is carries rules to the future and utilize them when a similar scenario arises. The existing lifelong learning topic models heavily rely on statistical measures to learn rules that leads to two limitations. The rules are evaluated for fixed number of words while ignoring the natural arrangement of words within the documents. Moreover, the rules have arbitrary orientation that causes repeated patterns of transferring the impact of a rule into a topic during the early iterations of the inference technique. In this research work, we introduce complex networks analysis for learning rules which addresses both of the limitations discussed. The rules are obtained through hierarchical clustering of the complex network that have different number of words within a rule and have directed orientation. The proposed approach improves the utilization of rules for improved quality of topics at higher performance with unidirectional rules on the standard lifelong learning dataset.","PeriodicalId":326530,"journal":{"name":"Proceedings of the 5th International Conference on Bioinformatics Research and Applications","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th International Conference on Bioinformatics Research and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3309129.3309146","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Lifelong learning topic models identify the hidden concepts discussed in the collection of documents. The concepts are represented as topics having groups of ordered words based on their relevance to the topic. Lifelong learning models have an automatic learning mechanism which allows continuous learning without external support. In the process, the model gets more knowledgeable with experience as it learns from the past in the form of rules. It is carries rules to the future and utilize them when a similar scenario arises. The existing lifelong learning topic models heavily rely on statistical measures to learn rules that leads to two limitations. The rules are evaluated for fixed number of words while ignoring the natural arrangement of words within the documents. Moreover, the rules have arbitrary orientation that causes repeated patterns of transferring the impact of a rule into a topic during the early iterations of the inference technique. In this research work, we introduce complex networks analysis for learning rules which addresses both of the limitations discussed. The rules are obtained through hierarchical clustering of the complex network that have different number of words within a rule and have directed orientation. The proposed approach improves the utilization of rules for improved quality of topics at higher performance with unidirectional rules on the standard lifelong learning dataset.