应用数据聚类方法创建房地产目标用户群

Herald of Khmelnytskyi National University. Technical sciences Pub Date : 2023-04-27 DOI:10.31891/2307-5732-2023-319-1-300-307

Olexandr Tkachyk

{"title":"应用数据聚类方法创建房地产目标用户群","authors":"Olexandr Tkachyk","doi":"10.31891/2307-5732-2023-319-1-300-307","DOIUrl":null,"url":null,"abstract":"In this paper applied unsupervised clustering to a dataset examines the application of k-means clustering to create target user groups for a real estate platform. The goal is to segment the user base into meaningful groups to better understand their preferences and behaviors, and tailor marketing campaigns and product features to the needs of each group. The key step in the application of k-means clustering to real estate data is data preparation. Real estate data can be particularly messy and incomplete, and thus requires careful cleaning and normalization before clustering can be applied. Data preparation includes several key steps, such as removing irrelevant or redundant features, creating new features as feature scaling is also an important step in data preparation. K-means clustering is sensitive to the scale of the data, so features may need to be normalized to ensure that they are on the same scale, handling missing or erroneous data, and scaling or transforming features to ensure they are on the same scale. Dataset of 2000 customers interested in real estate with the various types of data was taken as a basis. Then the data was observed, investigated and based on results it was prepared for clustering by doing data cleaning as irrelevant data or empty data points may include features that do not significantly contribute to the clustering process, data normalization as it is necessary to ensure that all features are on the same scale, feature selection to determine most relevant features for clustering, feature encoding and dimensionality reduction which was achieved through principal component analysis (PCA). By carefully cleaning, normalizing, and selecting relevant features, clustering algorithms such as k-means were applied more effectively and target user groups were identified.","PeriodicalId":386560,"journal":{"name":"Herald of Khmelnytskyi National University. Technical sciences","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"APPLYING DATA CLUSTERING METHODS FOR CREATING TARGETING USER GROUPS FOR REAL ESTATE\",\"authors\":\"Olexandr Tkachyk\",\"doi\":\"10.31891/2307-5732-2023-319-1-300-307\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper applied unsupervised clustering to a dataset examines the application of k-means clustering to create target user groups for a real estate platform. The goal is to segment the user base into meaningful groups to better understand their preferences and behaviors, and tailor marketing campaigns and product features to the needs of each group. The key step in the application of k-means clustering to real estate data is data preparation. Real estate data can be particularly messy and incomplete, and thus requires careful cleaning and normalization before clustering can be applied. Data preparation includes several key steps, such as removing irrelevant or redundant features, creating new features as feature scaling is also an important step in data preparation. K-means clustering is sensitive to the scale of the data, so features may need to be normalized to ensure that they are on the same scale, handling missing or erroneous data, and scaling or transforming features to ensure they are on the same scale. Dataset of 2000 customers interested in real estate with the various types of data was taken as a basis. Then the data was observed, investigated and based on results it was prepared for clustering by doing data cleaning as irrelevant data or empty data points may include features that do not significantly contribute to the clustering process, data normalization as it is necessary to ensure that all features are on the same scale, feature selection to determine most relevant features for clustering, feature encoding and dimensionality reduction which was achieved through principal component analysis (PCA). By carefully cleaning, normalizing, and selecting relevant features, clustering algorithms such as k-means were applied more effectively and target user groups were identified.\",\"PeriodicalId\":386560,\"journal\":{\"name\":\"Herald of Khmelnytskyi National University. Technical sciences\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Herald of Khmelnytskyi National University. Technical sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.31891/2307-5732-2023-319-1-300-307\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Herald of Khmelnytskyi National University. Technical sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31891/2307-5732-2023-319-1-300-307","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文将无监督聚类应用于数据集，研究了k-means聚类在房地产平台创建目标用户组中的应用。目标是将用户群划分为有意义的群体，以便更好地了解他们的偏好和行为，并根据每个群体的需求定制营销活动和产品功能。将k-means聚类方法应用于房地产数据的关键步骤是数据准备。房地产数据可能特别混乱和不完整，因此在应用聚类之前需要仔细清理和规范化。数据准备包括几个关键步骤，例如删除不相关或冗余的特征，创建新特征，因为特征缩放也是数据准备中的重要步骤。K-means聚类对数据的规模很敏感，所以特征可能需要规范化以确保它们在相同的规模上，处理缺失或错误的数据，并缩放或转换特征以确保它们在相同的规模上。以2000个对房地产感兴趣的客户的数据集为基础，并以各种类型的数据为基础。然后对数据进行观察、调查，并根据结果进行数据清理，为聚类做准备，因为不相关数据或空数据点可能包括对聚类过程没有显著贡献的特征，数据归一化是必要的，以确保所有特征都在相同的尺度上，特征选择以确定聚类最相关的特征，通过主成分分析(PCA)实现特征编码和降维。通过仔细清理、归一化和选择相关特征，可以更有效地应用k-means等聚类算法，并识别目标用户群。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

APPLYING DATA CLUSTERING METHODS FOR CREATING TARGETING USER GROUPS FOR REAL ESTATE

In this paper applied unsupervised clustering to a dataset examines the application of k-means clustering to create target user groups for a real estate platform. The goal is to segment the user base into meaningful groups to better understand their preferences and behaviors, and tailor marketing campaigns and product features to the needs of each group. The key step in the application of k-means clustering to real estate data is data preparation. Real estate data can be particularly messy and incomplete, and thus requires careful cleaning and normalization before clustering can be applied. Data preparation includes several key steps, such as removing irrelevant or redundant features, creating new features as feature scaling is also an important step in data preparation. K-means clustering is sensitive to the scale of the data, so features may need to be normalized to ensure that they are on the same scale, handling missing or erroneous data, and scaling or transforming features to ensure they are on the same scale. Dataset of 2000 customers interested in real estate with the various types of data was taken as a basis. Then the data was observed, investigated and based on results it was prepared for clustering by doing data cleaning as irrelevant data or empty data points may include features that do not significantly contribute to the clustering process, data normalization as it is necessary to ensure that all features are on the same scale, feature selection to determine most relevant features for clustering, feature encoding and dimensionality reduction which was achieved through principal component analysis (PCA). By carefully cleaning, normalizing, and selecting relevant features, clustering algorithms such as k-means were applied more effectively and target user groups were identified.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Herald of Khmelnytskyi National University. Technical sciences

自引率

0.00%

发文量