{"title":"Nowcasting of Corporate Research and Development trends through news article analysis by BERTopic: the case of Japanese electric company","authors":"Haruna Okazaki, Hiroshi Takahashi","doi":"10.1109/ICECCME55909.2022.9987867","DOIUrl":null,"url":null,"abstract":"Various means exist for obtaining information about a company. However, many of them are not timely and, from the investor’ s point of view, do not have enough information. In particular, it is even more difficult to find out trends of diversified companies with multiple segments. Therefore, this study aims to extract timely information on the trends of each segment of a company from news data using BERTopic. The analysis targets news headlines of diversified electronics firms listed on the Japanese stock market. The sample period was 24 years, from 1996 to 2019, and the number of news items for analysis was 26,058. As a result of the analysis, we found that (1) BERTopic can classify the target news into 46 topics, (2) it is possible to identify company segments and extract trends in company activities from the classified topics, and (3) it is also possible to visualize the time-series transition of topics related to each segment. (4) The results obtained from the analysis were used to determine the value of the company's investment in the market. In addition, the results obtained from the analysis were consistent with the descriptions in the annual reports. These results indicate the possibility of obtaining highly immediate information on corporate trends, such as R&D, through the analysis of news headlines via BERTopic.","PeriodicalId":202568,"journal":{"name":"2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECCME55909.2022.9987867","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Various means exist for obtaining information about a company. However, many of them are not timely and, from the investor’ s point of view, do not have enough information. In particular, it is even more difficult to find out trends of diversified companies with multiple segments. Therefore, this study aims to extract timely information on the trends of each segment of a company from news data using BERTopic. The analysis targets news headlines of diversified electronics firms listed on the Japanese stock market. The sample period was 24 years, from 1996 to 2019, and the number of news items for analysis was 26,058. As a result of the analysis, we found that (1) BERTopic can classify the target news into 46 topics, (2) it is possible to identify company segments and extract trends in company activities from the classified topics, and (3) it is also possible to visualize the time-series transition of topics related to each segment. (4) The results obtained from the analysis were used to determine the value of the company's investment in the market. In addition, the results obtained from the analysis were consistent with the descriptions in the annual reports. These results indicate the possibility of obtaining highly immediate information on corporate trends, such as R&D, through the analysis of news headlines via BERTopic.