Business demands for processing unstructured textual data – text mining techniques for companies to implement

Access Journal - Access to Science, Business, Innovation in the digital economy Pub Date : 2022-04-17 DOI:10.46656/access.2022.3.2(2)

Denitsa Zhecheva, Nayden Nenkov

{"title":"Business demands for processing unstructured textual data – text mining techniques for companies to implement","authors":"Denitsa Zhecheva, Nayden Nenkov","doi":"10.46656/access.2022.3.2(2)","DOIUrl":null,"url":null,"abstract":"The rapid development of technology has caused a pervasive change in the way people and businesses live. Making sound business decisions is unthinkable without processing a large amount of data (publicly available and collected on the basis of problems) with high accuracy and quality. The importance of unstructured data acquires various sources is growing. Of particular value is the continuous flow of textual information that is generated every minute around the world in a different form (unstructured textual data). This is also the subject of this article. The aim of the article is to provide an analytical overview of the main methods of word processing that are applicable for pragmatic analysis of information flows from companies, such as: extraction, summarization, grouping and categorization of text. Some methodologies are based on NLP (Natural Language Processing), others on Bayesian logic and statistical theory and practice. From the review of various publications on the topic, conclusions are proposed for their practical applicability. This allows for an objective choice of appropriate tools for processing unstructured information and business intelligence. The results of the study can be successfully used to improve managerial decision-making, improve the quality of work of employees and reduce errors in overall marketing planning.","PeriodicalId":176153,"journal":{"name":"Access Journal - Access to Science, Business, Innovation in the digital economy","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Access Journal - Access to Science, Business, Innovation in the digital economy","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.46656/access.2022.3.2(2)","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

The rapid development of technology has caused a pervasive change in the way people and businesses live. Making sound business decisions is unthinkable without processing a large amount of data (publicly available and collected on the basis of problems) with high accuracy and quality. The importance of unstructured data acquires various sources is growing. Of particular value is the continuous flow of textual information that is generated every minute around the world in a different form (unstructured textual data). This is also the subject of this article. The aim of the article is to provide an analytical overview of the main methods of word processing that are applicable for pragmatic analysis of information flows from companies, such as: extraction, summarization, grouping and categorization of text. Some methodologies are based on NLP (Natural Language Processing), others on Bayesian logic and statistical theory and practice. From the review of various publications on the topic, conclusions are proposed for their practical applicability. This allows for an objective choice of appropriate tools for processing unstructured information and business intelligence. The results of the study can be successfully used to improve managerial decision-making, improve the quality of work of employees and reduce errors in overall marketing planning.

查看原文本刊更多论文

业务对处理非结构化文本数据的需求——文本挖掘技术供公司实现

技术的快速发展已经引起了人们和企业生活方式的普遍变化。如果不以高精度和高质量处理大量数据(公开可用和根据问题收集的数据)，就无法做出合理的业务决策。获取各种来源的非结构化数据的重要性正在增长。特别有价值的是每分钟在世界各地以不同形式(非结构化文本数据)生成的连续的文本信息流。这也是本文的主题。本文的目的是提供适用于公司信息流的语用分析的主要文字处理方法的分析概述，例如:文本的提取，摘要，分组和分类。一些方法基于NLP(自然语言处理)，其他方法基于贝叶斯逻辑和统计理论和实践。从对该主题的各种出版物的回顾中，提出了具有实际适用性的结论。这允许客观地选择适当的工具来处理非结构化信息和业务智能。研究结果可以成功地用于改进管理决策，提高员工的工作质量，减少整体营销规划中的错误。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Access Journal - Access to Science, Business, Innovation in the digital economy

自引率

0.00%

发文量