{"title":"Business demands for processing unstructured textual data – text mining techniques for companies to implement","authors":"Denitsa Zhecheva, Nayden Nenkov","doi":"10.46656/access.2022.3.2(2)","DOIUrl":null,"url":null,"abstract":"The rapid development of technology has caused a pervasive change in the way people and businesses live. Making sound business decisions is unthinkable without processing a large amount of data (publicly available and collected on the basis of problems) with high accuracy and quality. The importance of unstructured data acquires various sources is growing. Of particular value is the continuous flow of textual information that is generated every minute around the world in a different form (unstructured textual data). This is also the subject of this article. The aim of the article is to provide an analytical overview of the main methods of word processing that are applicable for pragmatic analysis of information flows from companies, such as: extraction, summarization, grouping and categorization of text. Some methodologies are based on NLP (Natural Language Processing), others on Bayesian logic and statistical theory and practice. From the review of various publications on the topic, conclusions are proposed for their practical applicability. This allows for an objective choice of appropriate tools for processing unstructured information and business intelligence. The results of the study can be successfully used to improve managerial decision-making, improve the quality of work of employees and reduce errors in overall marketing planning.","PeriodicalId":176153,"journal":{"name":"Access Journal - Access to Science, Business, Innovation in the digital economy","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Access Journal - Access to Science, Business, Innovation in the digital economy","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.46656/access.2022.3.2(2)","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The rapid development of technology has caused a pervasive change in the way people and businesses live. Making sound business decisions is unthinkable without processing a large amount of data (publicly available and collected on the basis of problems) with high accuracy and quality. The importance of unstructured data acquires various sources is growing. Of particular value is the continuous flow of textual information that is generated every minute around the world in a different form (unstructured textual data). This is also the subject of this article. The aim of the article is to provide an analytical overview of the main methods of word processing that are applicable for pragmatic analysis of information flows from companies, such as: extraction, summarization, grouping and categorization of text. Some methodologies are based on NLP (Natural Language Processing), others on Bayesian logic and statistical theory and practice. From the review of various publications on the topic, conclusions are proposed for their practical applicability. This allows for an objective choice of appropriate tools for processing unstructured information and business intelligence. The results of the study can be successfully used to improve managerial decision-making, improve the quality of work of employees and reduce errors in overall marketing planning.