{"title":"一种土耳其语电影评论分类方法","authors":"M. Ceyhan, Z. Orhan, Dimitrios Alexios Karras","doi":"10.26417/328uno67t","DOIUrl":null,"url":null,"abstract":"Abstract Web 2.0 has given to all people the right to become a representative of a huge cast of informal media. The importance of this power is getting more evident everyday. Every social media actor can influence the rest of the world by one’s own opinions, feelings, and thoughts generously shared on multiple media. This information belonging to various fields of life can be very handy and be used to one’s advantage, gaining precious experience. One of the greatest problems that this poses is the huge number of data spread everywhere, which are difficult to process as row data per se. Social media and general sentiment text analysis is of much valuable use, accomplishing the task extracting pure gold out of raw mineral. The key point of this investigation is to characterize new reviews automatically. To start with, features selected out of all the word roots appearing in the comments were used to train the system according to known machine learning algorithms. Next, critical words determining positive or negative sense were extracted. Another strategy was attempted eliminating common terms and dealing only with the significant class-determining words to build vocabulary with them. Aparts from linear approach, vector based feature sets were prepared out all or some of the features. The outcomes acquired were analyzed and compared leading to important conclusions, emphasizing the importance of feature selection in text classification.","PeriodicalId":202400,"journal":{"name":"European Journal of Formal Sciences and Engineering","volume":"68 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"An Approach for Movie Review Classification in Turkish\",\"authors\":\"M. Ceyhan, Z. Orhan, Dimitrios Alexios Karras\",\"doi\":\"10.26417/328uno67t\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract Web 2.0 has given to all people the right to become a representative of a huge cast of informal media. The importance of this power is getting more evident everyday. Every social media actor can influence the rest of the world by one’s own opinions, feelings, and thoughts generously shared on multiple media. This information belonging to various fields of life can be very handy and be used to one’s advantage, gaining precious experience. One of the greatest problems that this poses is the huge number of data spread everywhere, which are difficult to process as row data per se. Social media and general sentiment text analysis is of much valuable use, accomplishing the task extracting pure gold out of raw mineral. The key point of this investigation is to characterize new reviews automatically. To start with, features selected out of all the word roots appearing in the comments were used to train the system according to known machine learning algorithms. Next, critical words determining positive or negative sense were extracted. Another strategy was attempted eliminating common terms and dealing only with the significant class-determining words to build vocabulary with them. Aparts from linear approach, vector based feature sets were prepared out all or some of the features. The outcomes acquired were analyzed and compared leading to important conclusions, emphasizing the importance of feature selection in text classification.\",\"PeriodicalId\":202400,\"journal\":{\"name\":\"European Journal of Formal Sciences and Engineering\",\"volume\":\"68 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Journal of Formal Sciences and Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.26417/328uno67t\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Formal Sciences and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.26417/328uno67t","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
Web 2.0赋予了所有人成为庞大的非正式媒体代表的权利。这种能力的重要性日益明显。每个社交媒体演员都可以通过在多种媒体上慷慨分享自己的观点、感受和想法来影响世界其他地方。这些信息属于生活的各个领域,可以非常方便,并被用来为自己的优势,获得宝贵的经验。这带来的最大问题之一是遍布各地的大量数据,很难将其作为行数据本身进行处理。社交媒体和一般情感文本分析非常有价值,可以完成从原始矿物中提取纯金的任务。这项研究的关键是自动描述新的评论。首先,从评论中出现的所有词根中选择的特征用于根据已知的机器学习算法训练系统。其次,提取决定积极或消极意义的关键词语。另一种策略是试图消除通用术语,只处理重要的类决定词,用它们来建立词汇。在线性方法的基础上,建立了基于向量的特征集,提取出全部或部分特征。对得到的结果进行分析和比较,得出重要结论,强调特征选择在文本分类中的重要性。
An Approach for Movie Review Classification in Turkish
Abstract Web 2.0 has given to all people the right to become a representative of a huge cast of informal media. The importance of this power is getting more evident everyday. Every social media actor can influence the rest of the world by one’s own opinions, feelings, and thoughts generously shared on multiple media. This information belonging to various fields of life can be very handy and be used to one’s advantage, gaining precious experience. One of the greatest problems that this poses is the huge number of data spread everywhere, which are difficult to process as row data per se. Social media and general sentiment text analysis is of much valuable use, accomplishing the task extracting pure gold out of raw mineral. The key point of this investigation is to characterize new reviews automatically. To start with, features selected out of all the word roots appearing in the comments were used to train the system according to known machine learning algorithms. Next, critical words determining positive or negative sense were extracted. Another strategy was attempted eliminating common terms and dealing only with the significant class-determining words to build vocabulary with them. Aparts from linear approach, vector based feature sets were prepared out all or some of the features. The outcomes acquired were analyzed and compared leading to important conclusions, emphasizing the importance of feature selection in text classification.