Business demands for processing unstructured textual data – text mining techniques for companies to implement

Denitsa Zhecheva, Nayden Nenkov
{"title":"Business demands for processing unstructured textual data – text mining techniques for companies to implement","authors":"Denitsa Zhecheva, Nayden Nenkov","doi":"10.46656/access.2022.3.2(2)","DOIUrl":null,"url":null,"abstract":"The rapid development of technology has caused a pervasive change in the way people and businesses live. Making sound business decisions is unthinkable without processing a large amount of data (publicly available and collected on the basis of problems) with high accuracy and quality. The importance of unstructured data acquires various sources is growing. Of particular value is the continuous flow of textual information that is generated every minute around the world in a different form (unstructured textual data). This is also the subject of this article. The aim of the article is to provide an analytical overview of the main methods of word processing that are applicable for pragmatic analysis of information flows from companies, such as: extraction, summarization, grouping and categorization of text. Some methodologies are based on NLP (Natural Language Processing), others on Bayesian logic and statistical theory and practice. From the review of various publications on the topic, conclusions are proposed for their practical applicability. This allows for an objective choice of appropriate tools for processing unstructured information and business intelligence. The results of the study can be successfully used to improve managerial decision-making, improve the quality of work of employees and reduce errors in overall marketing planning.","PeriodicalId":176153,"journal":{"name":"Access Journal - Access to Science, Business, Innovation in the digital economy","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Access Journal - Access to Science, Business, Innovation in the digital economy","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.46656/access.2022.3.2(2)","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The rapid development of technology has caused a pervasive change in the way people and businesses live. Making sound business decisions is unthinkable without processing a large amount of data (publicly available and collected on the basis of problems) with high accuracy and quality. The importance of unstructured data acquires various sources is growing. Of particular value is the continuous flow of textual information that is generated every minute around the world in a different form (unstructured textual data). This is also the subject of this article. The aim of the article is to provide an analytical overview of the main methods of word processing that are applicable for pragmatic analysis of information flows from companies, such as: extraction, summarization, grouping and categorization of text. Some methodologies are based on NLP (Natural Language Processing), others on Bayesian logic and statistical theory and practice. From the review of various publications on the topic, conclusions are proposed for their practical applicability. This allows for an objective choice of appropriate tools for processing unstructured information and business intelligence. The results of the study can be successfully used to improve managerial decision-making, improve the quality of work of employees and reduce errors in overall marketing planning.
业务对处理非结构化文本数据的需求——文本挖掘技术供公司实现
技术的快速发展已经引起了人们和企业生活方式的普遍变化。如果不以高精度和高质量处理大量数据(公开可用和根据问题收集的数据),就无法做出合理的业务决策。获取各种来源的非结构化数据的重要性正在增长。特别有价值的是每分钟在世界各地以不同形式(非结构化文本数据)生成的连续的文本信息流。这也是本文的主题。本文的目的是提供适用于公司信息流的语用分析的主要文字处理方法的分析概述,例如:文本的提取,摘要,分组和分类。一些方法基于NLP(自然语言处理),其他方法基于贝叶斯逻辑和统计理论和实践。从对该主题的各种出版物的回顾中,提出了具有实际适用性的结论。这允许客观地选择适当的工具来处理非结构化信息和业务智能。研究结果可以成功地用于改进管理决策,提高员工的工作质量,减少整体营销规划中的错误。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信