F. Başkaya, Ilhan Aydin
{"title":"Haber metinlerinin farkli metin madenciliği yöntemleriyle siniflandirilmasi","authors":"F. Başkaya, Ilhan Aydin","doi":"10.1109/IDAP.2017.8090310","DOIUrl":null,"url":null,"abstract":"With the development of technology, people are entering the virtual world more and more. Parallel to this, the internet becomes a bigger network every day and it gets a complex structure depending on this growth. Achieving the desired information with structred data becomes an increasingly important problem. One of the useful ways to find solution for this problem is to divide this complex data into categories by text mining methods. By creating semantic similarities with this categorization, data can be achieved effectively and quickly. In this study, it is aimed to classify the news text data that have four different categories (economy, politics, sports and health) with different feature extraction and term weighting methods using different text mining techniques and to test the efficiency and success of the methods. By the proposed method, 100% classification success rate was obtained on news texts.","PeriodicalId":111721,"journal":{"name":"2017 International Artificial Intelligence and Data Processing Symposium (IDAP)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Artificial Intelligence and Data Processing Symposium (IDAP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IDAP.2017.8090310","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

随着科技的发展,越来越多的人进入虚拟世界。与此同时,互联网每天都在成为一个更大的网络,它的结构也随着这种增长而变得复杂。利用结构化数据获取所需的信息已成为一个日益重要的问题。通过文本挖掘方法将这些复杂的数据进行分类是解决这一问题的有效方法之一。通过使用这种分类创建语义相似性,可以有效而快速地获得数据。本研究旨在使用不同的文本挖掘技术,对经济、政治、体育和健康四个不同类别的新闻文本数据,采用不同的特征提取和术语加权方法进行分类,并测试方法的效率和成功率。该方法对新闻文本的分类成功率达到100%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Haber metinlerinin farkli metin madenciliği yöntemleriyle siniflandirilmasi
With the development of technology, people are entering the virtual world more and more. Parallel to this, the internet becomes a bigger network every day and it gets a complex structure depending on this growth. Achieving the desired information with structred data becomes an increasingly important problem. One of the useful ways to find solution for this problem is to divide this complex data into categories by text mining methods. By creating semantic similarities with this categorization, data can be achieved effectively and quickly. In this study, it is aimed to classify the news text data that have four different categories (economy, politics, sports and health) with different feature extraction and term weighting methods using different text mining techniques and to test the efficiency and success of the methods. By the proposed method, 100% classification success rate was obtained on news texts.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信