Characteristics Classification of Mobile Apps on Apple Store Using Clustering

Boxin Fu
{"title":"Characteristics Classification of Mobile Apps on Apple Store Using Clustering","authors":"Boxin Fu","doi":"10.4236/jdaip.2020.82004","DOIUrl":null,"url":null,"abstract":"This research is interested in the user ratings of Apps on Apple Stores. The purpose of this research is to have a better understanding of some characteristics of the good Apps on Apple Store so Apps makers can potentially focus on these traits to maximize their profit. The data for this research is collected from kaggle.com, and originally collected from iTunes Search API, according to the abstract of the data. Four different attributes contribute directly toward an App’s user rating: rating_count_tot, rating_count_ver, user_rating and user_rating_ver. The relationship between Apps receiving higher ratings and Apps receiving lower ratings is analyzed using Exploratory Data Analysis and Data Science technique “clustering” on their numerical attributes. Apps, which are represented as a data point, with similar characteristics in rating are classified as belonging to the same cluster, while common characteristics of all Apps in the same clusters are the determining traits of Apps for that cluster. Both techniques are achieved using Google Colab and libraries including pandas, numpy, seaborn, and matplotlib. The data reveals direct correlation from number of devices supported and languages supported to user rating and inverse correlation from size and price of the App to user rating. In conclusion, free small Apps that many different types of users are able to use are generally well rated by most users, according to the data.","PeriodicalId":71434,"journal":{"name":"数据分析和信息处理(英文)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"数据分析和信息处理(英文)","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.4236/jdaip.2020.82004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

This research is interested in the user ratings of Apps on Apple Stores. The purpose of this research is to have a better understanding of some characteristics of the good Apps on Apple Store so Apps makers can potentially focus on these traits to maximize their profit. The data for this research is collected from kaggle.com, and originally collected from iTunes Search API, according to the abstract of the data. Four different attributes contribute directly toward an App’s user rating: rating_count_tot, rating_count_ver, user_rating and user_rating_ver. The relationship between Apps receiving higher ratings and Apps receiving lower ratings is analyzed using Exploratory Data Analysis and Data Science technique “clustering” on their numerical attributes. Apps, which are represented as a data point, with similar characteristics in rating are classified as belonging to the same cluster, while common characteristics of all Apps in the same clusters are the determining traits of Apps for that cluster. Both techniques are achieved using Google Colab and libraries including pandas, numpy, seaborn, and matplotlib. The data reveals direct correlation from number of devices supported and languages supported to user rating and inverse correlation from size and price of the App to user rating. In conclusion, free small Apps that many different types of users are able to use are generally well rated by most users, according to the data.
基于聚类的苹果商店移动应用程序特征分类
这项研究关注的是苹果商店应用的用户评分。本研究的目的是为了更好地了解苹果商店中优秀应用程序的一些特征,以便应用程序制造商可以潜在地专注于这些特征,以最大化他们的利润。根据数据摘要,本研究的数据来源于kaggle.com,原始数据来源于iTunes Search API。有四个不同的属性直接影响应用的用户评级:rating_count_tot、rating_count_ver、user_rating和user_rating_ver。使用探索性数据分析和数据科学技术对其数值属性进行“聚类”,分析获得较高评级和较低评级的应用程序之间的关系。用数据点表示的具有相似特征的应用程序被归类为属于同一集群,而同一集群中所有应用程序的共同特征是该集群中应用程序的决定性特征。这两种技术都是使用谷歌Colab和包括pandas、numpy、seaborn和matplotlib在内的库实现的。数据显示,应用支持的设备数量和语言与用户评价呈正相关,而应用的大小和价格与用户评价呈负相关。综上所述,数据显示,许多不同类型的用户都能使用的免费小应用通常都得到了大多数用户的好评。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
91
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信