使用探索性数据分析和集成机器学习算法的评审系统

Ayan Banerjee, Tapan Chowdhury
{"title":"使用探索性数据分析和集成机器学习算法的评审系统","authors":"Ayan Banerjee, Tapan Chowdhury","doi":"10.1109/temsmet53515.2021.9768736","DOIUrl":null,"url":null,"abstract":"In the fast-growing technological advancements, every business venture is thriving for success. Irrespective of the business’s paradigm, each one has a reviewing system that envisages the working throughput of the system. The impact justifies the company’s current commercial state and also helps to bring in new customers. The proposed exploratory data analysis shows the relationship between the reviews on textual and non-textual parameters aimed to work on a user-based analysis. In this paper two different datasets are analyzed where the reviews are predicted based on extensive exploratory data analysis and feature engineering. This paper focused on examining how reviews are influenced based on the independent parameters from Zomato and Yelp datasets. These datasets have huge geographical differences that allowed us to obtain various diverse users-review coming from different backgrounds, cultures, and heritage. After feature construction, the predictive model is analyzed by applying supervised and ensemble machine learning algorithms such as Linear regression, Logistic regression, Decision trees, RandomForest, XGBoost, and GradientBoosting. The inferences obtained from both datasets gave us coherent results that validate the experimental procedures, extend the same analogy in different business paradigms, and offer a comprehensive prediction.","PeriodicalId":170546,"journal":{"name":"2021 IEEE 2nd International Conference on Technology, Engineering, Management for Societal impact using Marketing, Entrepreneurship and Talent (TEMSMET)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Reviewing System using Exploratory Data Analysis and Ensemble Machine Learning Algorithms\",\"authors\":\"Ayan Banerjee, Tapan Chowdhury\",\"doi\":\"10.1109/temsmet53515.2021.9768736\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the fast-growing technological advancements, every business venture is thriving for success. Irrespective of the business’s paradigm, each one has a reviewing system that envisages the working throughput of the system. The impact justifies the company’s current commercial state and also helps to bring in new customers. The proposed exploratory data analysis shows the relationship between the reviews on textual and non-textual parameters aimed to work on a user-based analysis. In this paper two different datasets are analyzed where the reviews are predicted based on extensive exploratory data analysis and feature engineering. This paper focused on examining how reviews are influenced based on the independent parameters from Zomato and Yelp datasets. These datasets have huge geographical differences that allowed us to obtain various diverse users-review coming from different backgrounds, cultures, and heritage. After feature construction, the predictive model is analyzed by applying supervised and ensemble machine learning algorithms such as Linear regression, Logistic regression, Decision trees, RandomForest, XGBoost, and GradientBoosting. The inferences obtained from both datasets gave us coherent results that validate the experimental procedures, extend the same analogy in different business paradigms, and offer a comprehensive prediction.\",\"PeriodicalId\":170546,\"journal\":{\"name\":\"2021 IEEE 2nd International Conference on Technology, Engineering, Management for Societal impact using Marketing, Entrepreneurship and Talent (TEMSMET)\",\"volume\":\"43 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 2nd International Conference on Technology, Engineering, Management for Societal impact using Marketing, Entrepreneurship and Talent (TEMSMET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/temsmet53515.2021.9768736\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 2nd International Conference on Technology, Engineering, Management for Societal impact using Marketing, Entrepreneurship and Talent (TEMSMET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/temsmet53515.2021.9768736","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

在快速发展的技术进步中,每一家企业都在走向成功。不管业务范例是什么,每个业务范例都有一个设想系统工作吞吐量的审查系统。这种影响证明了该公司目前的商业状态是合理的,也有助于吸引新客户。提出的探索性数据分析显示了对文本和非文本参数的审查之间的关系,旨在进行基于用户的分析。本文分析了两个不同的数据集,其中基于广泛的探索性数据分析和特征工程来预测评论。本文重点研究了基于Zomato和Yelp数据集的独立参数如何影响评论。这些数据集具有巨大的地理差异,这使我们能够获得来自不同背景、文化和遗产的各种不同用户的评论。在特征构建完成后,运用线性回归、逻辑回归、决策树、随机森林、XGBoost、梯度提升等监督和集成机器学习算法对预测模型进行分析。从两个数据集获得的推论为我们提供了一致的结果,验证了实验过程,将相同的类比扩展到不同的业务范式,并提供了全面的预测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Reviewing System using Exploratory Data Analysis and Ensemble Machine Learning Algorithms
In the fast-growing technological advancements, every business venture is thriving for success. Irrespective of the business’s paradigm, each one has a reviewing system that envisages the working throughput of the system. The impact justifies the company’s current commercial state and also helps to bring in new customers. The proposed exploratory data analysis shows the relationship between the reviews on textual and non-textual parameters aimed to work on a user-based analysis. In this paper two different datasets are analyzed where the reviews are predicted based on extensive exploratory data analysis and feature engineering. This paper focused on examining how reviews are influenced based on the independent parameters from Zomato and Yelp datasets. These datasets have huge geographical differences that allowed us to obtain various diverse users-review coming from different backgrounds, cultures, and heritage. After feature construction, the predictive model is analyzed by applying supervised and ensemble machine learning algorithms such as Linear regression, Logistic regression, Decision trees, RandomForest, XGBoost, and GradientBoosting. The inferences obtained from both datasets gave us coherent results that validate the experimental procedures, extend the same analogy in different business paradigms, and offer a comprehensive prediction.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信