Trung Tran Van, Kien Vu Sy, Tuan Tran Anh, V. Duc, Thang Luu Quang, Phuong Hoang Xuan, V. Luu, Q. H. Bui, S. Pham
{"title":"Design of an GIS-based Investment Heatmap System using Topic Classification and NER","authors":"Trung Tran Van, Kien Vu Sy, Tuan Tran Anh, V. Duc, Thang Luu Quang, Phuong Hoang Xuan, V. Luu, Q. H. Bui, S. Pham","doi":"10.1109/KSE53942.2021.9648730","DOIUrl":null,"url":null,"abstract":"In recent years, Vietnam has received a significantly increasing Foreign Direct Investment (FDI) year on year. It has lead to the creation of a large number of social news that reflect to a certain extent the investment activities. Quantitatively extracting such information would be meaningful in analyzing market's direction. The objective of this study was to design a social listening system to identify key investment activities and trends over time using historical news data. First, we present the first-of-its-kind manually annotated investment domain-specific dataset for Vietnamese. Particularly, our dataset is annotated for join-tasks of 1) topic classification and 2) named entity recognition (NER) with newly-defined entity types. Second, empirical experiment was conducted using strong baselines on our dataset and show potential results with $\\mathrm{F}1=82.43$ for topic classification task, and $\\mathrm{F}1=92.15$ for NER task. Finally, we demonstrate the results on a Geographic Information System (GIS)-based heatmap system for the analysis of real-world social listening problem.","PeriodicalId":130986,"journal":{"name":"2021 13th International Conference on Knowledge and Systems Engineering (KSE)","volume":"17 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 13th International Conference on Knowledge and Systems Engineering (KSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KSE53942.2021.9648730","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In recent years, Vietnam has received a significantly increasing Foreign Direct Investment (FDI) year on year. It has lead to the creation of a large number of social news that reflect to a certain extent the investment activities. Quantitatively extracting such information would be meaningful in analyzing market's direction. The objective of this study was to design a social listening system to identify key investment activities and trends over time using historical news data. First, we present the first-of-its-kind manually annotated investment domain-specific dataset for Vietnamese. Particularly, our dataset is annotated for join-tasks of 1) topic classification and 2) named entity recognition (NER) with newly-defined entity types. Second, empirical experiment was conducted using strong baselines on our dataset and show potential results with $\mathrm{F}1=82.43$ for topic classification task, and $\mathrm{F}1=92.15$ for NER task. Finally, we demonstrate the results on a Geographic Information System (GIS)-based heatmap system for the analysis of real-world social listening problem.