{"title":"比较在智慧城市决策支持领域的Scopus出版物中发现新兴术语的统计方法","authors":"N. Shilov, Nikolay Teslia","doi":"10.23919/fruct49677.2020.9211000","DOIUrl":null,"url":null,"abstract":"Discovery of emerging research topics is an important task for scientists, conference organizers, policymakers, and scientific foundations. The paper aims at comparative analysis of statistical models that can be used for discovering emerging terms in a corpus of documents. Three models are evaluated based on calculation of the $TF*IDF$ and Energy measures. As a case study, a corpus of abstracts of scientific publications related to decision support in smart city is used that was downloaded from Scopus for 2015-2020. The models are compared and directions of future research to improve the results, namely usage of combinations of models, analysis of synonyms, and usage of additional rules for filtering out non-emerging terms, are identified.","PeriodicalId":149674,"journal":{"name":"2020 27th Conference of Open Innovations Association (FRUCT)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Comparing Statistical Measures for Discovering Emerging Terms in Scopus Publications in the Area of Decision Support in Smart City\",\"authors\":\"N. Shilov, Nikolay Teslia\",\"doi\":\"10.23919/fruct49677.2020.9211000\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Discovery of emerging research topics is an important task for scientists, conference organizers, policymakers, and scientific foundations. The paper aims at comparative analysis of statistical models that can be used for discovering emerging terms in a corpus of documents. Three models are evaluated based on calculation of the $TF*IDF$ and Energy measures. As a case study, a corpus of abstracts of scientific publications related to decision support in smart city is used that was downloaded from Scopus for 2015-2020. The models are compared and directions of future research to improve the results, namely usage of combinations of models, analysis of synonyms, and usage of additional rules for filtering out non-emerging terms, are identified.\",\"PeriodicalId\":149674,\"journal\":{\"name\":\"2020 27th Conference of Open Innovations Association (FRUCT)\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 27th Conference of Open Innovations Association (FRUCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/fruct49677.2020.9211000\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 27th Conference of Open Innovations Association (FRUCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/fruct49677.2020.9211000","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Comparing Statistical Measures for Discovering Emerging Terms in Scopus Publications in the Area of Decision Support in Smart City
Discovery of emerging research topics is an important task for scientists, conference organizers, policymakers, and scientific foundations. The paper aims at comparative analysis of statistical models that can be used for discovering emerging terms in a corpus of documents. Three models are evaluated based on calculation of the $TF*IDF$ and Energy measures. As a case study, a corpus of abstracts of scientific publications related to decision support in smart city is used that was downloaded from Scopus for 2015-2020. The models are compared and directions of future research to improve the results, namely usage of combinations of models, analysis of synonyms, and usage of additional rules for filtering out non-emerging terms, are identified.