Effectual Text Classification in Data Mining: A Practical Approach

I. Salem, Alaa Wagih Abdulqader, Atheel Sabih Shaker
{"title":"Effectual Text Classification in Data Mining: A Practical Approach","authors":"I. Salem, Alaa Wagih Abdulqader, Atheel Sabih Shaker","doi":"10.58496/mjbd/2023/007","DOIUrl":null,"url":null,"abstract":"Text classification is the process of setting records into classes that have already been set up based on what they say. It automatically puts texts in natural languages into categories that have already been set up. Text classification is the most crucial part of text retrieval systems, which find texts based on what the user requests, and text understanding systems, which change the text in some way, like by making summaries, answering questions, or pulling out data. Existing algorithms that use supervised learning to classify text automatically need enough examples to learn well. The algorithms for data mining are used to classify texts, as well as a review of the work that has been done on classifying texts. Design/Methodology/Approach: Data mining algorithms that are used to classify texts were talked about, and studies that looked at how these algorithms were used to classify texts were looked at, with a focus on comparative studies. Findings: No classifier can always do the best job because different datasets and situations lead to different classification accuracy. Implications for Real Life: When using data mining algorithms to classify text documents, it's important to keep in mind that the conditions of the data will affect how well the documents are classified. For this reason, the data should be well organized.","PeriodicalId":325612,"journal":{"name":"Mesopotamian Journal of Big Data","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mesopotamian Journal of Big Data","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.58496/mjbd/2023/007","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Text classification is the process of setting records into classes that have already been set up based on what they say. It automatically puts texts in natural languages into categories that have already been set up. Text classification is the most crucial part of text retrieval systems, which find texts based on what the user requests, and text understanding systems, which change the text in some way, like by making summaries, answering questions, or pulling out data. Existing algorithms that use supervised learning to classify text automatically need enough examples to learn well. The algorithms for data mining are used to classify texts, as well as a review of the work that has been done on classifying texts. Design/Methodology/Approach: Data mining algorithms that are used to classify texts were talked about, and studies that looked at how these algorithms were used to classify texts were looked at, with a focus on comparative studies. Findings: No classifier can always do the best job because different datasets and situations lead to different classification accuracy. Implications for Real Life: When using data mining algorithms to classify text documents, it's important to keep in mind that the conditions of the data will affect how well the documents are classified. For this reason, the data should be well organized.
数据挖掘中的有效文本分类:一种实用方法
文本分类是根据记录所说的内容将记录设置为已经设置好的类别的过程。它会自动将自然语言的文本放入已经设置好的类别中。文本分类是文本检索系统和文本理解系统中最关键的部分,文本检索系统根据用户请求查找文本,文本理解系统以某种方式更改文本,例如通过摘要、回答问题或提取数据。现有的使用监督学习来自动分类文本的算法需要足够的例子才能学好。数据挖掘算法用于对文本进行分类,以及对已完成的文本分类工作的回顾。设计/方法论/方法:讨论了用于文本分类的数据挖掘算法,以及研究如何使用这些算法对文本进行分类的研究,重点是比较研究。发现:由于不同的数据集和情况导致不同的分类精度,没有分类器可以总是做得最好。对现实生活的影响:在使用数据挖掘算法对文本文档进行分类时,一定要记住,数据的条件会影响文档的分类效果。因此,数据应该组织得很好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信