{"title":"基于搜索查询和安装的应用和游戏预测用户的人口特征","authors":"Ghazal Kalhor, B. Bahrak","doi":"10.1109/CSICC58665.2023.10105350","DOIUrl":null,"url":null,"abstract":"Employing various strategies to catch interest in using online app stores has become a common trend in recent decades. One of the dominant factors in determining the success of online businesses in this area is whether they have information about users' demographic features such as gender or age. In this study, we try to detect these features based on the lists of installed applications, installed games, and searched queries collected from an Iranian mobile application store. For this goal, we use a wide range of machine learning techniques to identify which model has the highest performance in these classification and regression tasks. Our findings show that we can detect genders with a balanced accuracy of 0.76. We also achieve 10.02 as the RMSE for age predictions and the ROC AUC of 0.81 in determining users' age groups.","PeriodicalId":127277,"journal":{"name":"2023 28th International Computer Conference, Computer Society of Iran (CSICC)","volume":"129 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting Users' Demographic Features Based on Searched Queries and Installed Apps and Games\",\"authors\":\"Ghazal Kalhor, B. Bahrak\",\"doi\":\"10.1109/CSICC58665.2023.10105350\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Employing various strategies to catch interest in using online app stores has become a common trend in recent decades. One of the dominant factors in determining the success of online businesses in this area is whether they have information about users' demographic features such as gender or age. In this study, we try to detect these features based on the lists of installed applications, installed games, and searched queries collected from an Iranian mobile application store. For this goal, we use a wide range of machine learning techniques to identify which model has the highest performance in these classification and regression tasks. Our findings show that we can detect genders with a balanced accuracy of 0.76. We also achieve 10.02 as the RMSE for age predictions and the ROC AUC of 0.81 in determining users' age groups.\",\"PeriodicalId\":127277,\"journal\":{\"name\":\"2023 28th International Computer Conference, Computer Society of Iran (CSICC)\",\"volume\":\"129 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 28th International Computer Conference, Computer Society of Iran (CSICC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSICC58665.2023.10105350\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 28th International Computer Conference, Computer Society of Iran (CSICC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSICC58665.2023.10105350","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Predicting Users' Demographic Features Based on Searched Queries and Installed Apps and Games
Employing various strategies to catch interest in using online app stores has become a common trend in recent decades. One of the dominant factors in determining the success of online businesses in this area is whether they have information about users' demographic features such as gender or age. In this study, we try to detect these features based on the lists of installed applications, installed games, and searched queries collected from an Iranian mobile application store. For this goal, we use a wide range of machine learning techniques to identify which model has the highest performance in these classification and regression tasks. Our findings show that we can detect genders with a balanced accuracy of 0.76. We also achieve 10.02 as the RMSE for age predictions and the ROC AUC of 0.81 in determining users' age groups.