{"title":"Proposal of a Method Extracting Strategic Phrases from Japanese Enterprise Disclosure Documents","authors":"Shu Kawanami, Ken Hidema, K. Okada","doi":"10.1109/IIAI-AAI50415.2020.00106","DOIUrl":null,"url":null,"abstract":"Text mining is spreading in various domains due to remarkable computational capabilities and huge accumulation of digital data. However, application of text mining to enterprises' disclosure documents written in Japanese is still less common than application to SNS (Social Networking Service) messages or product reviews. In this study, we propose a new method to extract strategic phrases/words which potentially related to enterprises' various strategies, policies, initiatives, action plans, and so forth, as clue to understand the deep knowledge of business dynamic mechanism. The proposed method is able to extract strategic phrases/words from the various types of enterprises' disclosure documents written in Japanese, through the process which consists of steps as follows: (1) stratified morphological analysis; (2) classification of strategic phrases/words; (3) elimination of inadequate strategic phrases by employing dependency analysis; also (4) XBRL parsing and new sentences extraction for formal annual securities reports, as optional step. Moreover, it was demonstrated that the extracted results by the proposed method includes valuable strategic phrases/words.","PeriodicalId":188870,"journal":{"name":"2020 9th International Congress on Advanced Applied Informatics (IIAI-AAI)","volume":"112 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 9th International Congress on Advanced Applied Informatics (IIAI-AAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IIAI-AAI50415.2020.00106","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Text mining is spreading in various domains due to remarkable computational capabilities and huge accumulation of digital data. However, application of text mining to enterprises' disclosure documents written in Japanese is still less common than application to SNS (Social Networking Service) messages or product reviews. In this study, we propose a new method to extract strategic phrases/words which potentially related to enterprises' various strategies, policies, initiatives, action plans, and so forth, as clue to understand the deep knowledge of business dynamic mechanism. The proposed method is able to extract strategic phrases/words from the various types of enterprises' disclosure documents written in Japanese, through the process which consists of steps as follows: (1) stratified morphological analysis; (2) classification of strategic phrases/words; (3) elimination of inadequate strategic phrases by employing dependency analysis; also (4) XBRL parsing and new sentences extraction for formal annual securities reports, as optional step. Moreover, it was demonstrated that the extracted results by the proposed method includes valuable strategic phrases/words.