{"title":"Temporal Topic Correlation and Evolution","authors":"A. Prayote, Kallaya Songklang","doi":"10.1109/JCSSE.2018.8457380","DOIUrl":null,"url":null,"abstract":"This paper presents a technique to alternatively discover the temporal research topics correlation by using a topic model, Latent Dirichlet Allocation (LDA). LDA model assumes the documents as a mixture of topics that group the co-occurrence words with the certain probabilities. Hence the model is popularly used to extract the latent topics from document collections. However, LDA gives an independence assumption between topics and is unable to model the correlation between the topics. Motivated by above limitation, this study introduces a method for improving the topic correlation. The correlation of two topics from different time periods can occur when there exists a publication tagged by the two topics and these two topics are said to be co-occurred by this publication. LDA weights of these co-occurred topics are used in our model to calculate gross-correlation values. The number of publications in a topic co-occurrence is also used in the model. Therefore, we split dataset into groups with some common sub-dataset ordered by temporal timestamp of published year. The experiment results show the correlation between topics in different time periods and results can further support the research collaboration in future.","PeriodicalId":338973,"journal":{"name":"2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JCSSE.2018.8457380","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper presents a technique to alternatively discover the temporal research topics correlation by using a topic model, Latent Dirichlet Allocation (LDA). LDA model assumes the documents as a mixture of topics that group the co-occurrence words with the certain probabilities. Hence the model is popularly used to extract the latent topics from document collections. However, LDA gives an independence assumption between topics and is unable to model the correlation between the topics. Motivated by above limitation, this study introduces a method for improving the topic correlation. The correlation of two topics from different time periods can occur when there exists a publication tagged by the two topics and these two topics are said to be co-occurred by this publication. LDA weights of these co-occurred topics are used in our model to calculate gross-correlation values. The number of publications in a topic co-occurrence is also used in the model. Therefore, we split dataset into groups with some common sub-dataset ordered by temporal timestamp of published year. The experiment results show the correlation between topics in different time periods and results can further support the research collaboration in future.