T. Georgieva-Trifonova, Miroslav Galabov, D. Valcheva, Teodor Kalushkov
{"title":"基于图的在线商店顾客评论表示","authors":"T. Georgieva-Trifonova, Miroslav Galabov, D. Valcheva, Teodor Kalushkov","doi":"10.1109/ISMSIT.2019.8932866","DOIUrl":null,"url":null,"abstract":"The purpose of this paper is to investigate the graph-based representation of the data required for the vector space model (VSM) and PMI (pointwise mutual information)-enriched VSM used for text mining (e.g. text classification). The transformation of a dataset containing free text reviews for online stores in a graph-based form is described and its format that allows to be used by Neo4j graph database management system is proposed. Queries for retrieving the data required for training text mining models are considered; the steps and the actions for their modification when receiving new data are specified. The advantages of the proposed graph-based representation in regard to the maintenance and the extraction of current data needed for retraining data mining models are summarized, in order to prevent loss of performance of results from the execution of the respective data mining task.","PeriodicalId":169791,"journal":{"name":"2019 3rd International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Graph-Based Representation of Customer Reviews for Online Stores\",\"authors\":\"T. Georgieva-Trifonova, Miroslav Galabov, D. Valcheva, Teodor Kalushkov\",\"doi\":\"10.1109/ISMSIT.2019.8932866\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The purpose of this paper is to investigate the graph-based representation of the data required for the vector space model (VSM) and PMI (pointwise mutual information)-enriched VSM used for text mining (e.g. text classification). The transformation of a dataset containing free text reviews for online stores in a graph-based form is described and its format that allows to be used by Neo4j graph database management system is proposed. Queries for retrieving the data required for training text mining models are considered; the steps and the actions for their modification when receiving new data are specified. The advantages of the proposed graph-based representation in regard to the maintenance and the extraction of current data needed for retraining data mining models are summarized, in order to prevent loss of performance of results from the execution of the respective data mining task.\",\"PeriodicalId\":169791,\"journal\":{\"name\":\"2019 3rd International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 3rd International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISMSIT.2019.8932866\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 3rd International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISMSIT.2019.8932866","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Graph-Based Representation of Customer Reviews for Online Stores
The purpose of this paper is to investigate the graph-based representation of the data required for the vector space model (VSM) and PMI (pointwise mutual information)-enriched VSM used for text mining (e.g. text classification). The transformation of a dataset containing free text reviews for online stores in a graph-based form is described and its format that allows to be used by Neo4j graph database management system is proposed. Queries for retrieving the data required for training text mining models are considered; the steps and the actions for their modification when receiving new data are specified. The advantages of the proposed graph-based representation in regard to the maintenance and the extraction of current data needed for retraining data mining models are summarized, in order to prevent loss of performance of results from the execution of the respective data mining task.