{"title":"Analysis of the Relevance between Title of Product and Search Term","authors":"Jiayi Clien, Yutao Wei, Yukang Zou","doi":"10.1109/MLISE57402.2022.00041","DOIUrl":null,"url":null,"abstract":"In recent years, the product recommendation algorithm of e-commerce platforms has become more and more important. In this paper, we built a Random Forest Regression model for the problem of predicting the correlation of “search term” and “product title We found this model for the dataset by describing and attributing the products. In the process of numerically calculating features, we applied two types of feature engineering. The first method is to describe and attribute the numbers to the number of words or the length of the sentence. The second method is to use string similarity characteristics to calculate the distance between “search term” and “product title On the results, we got a similar histogram of the correlation scores between the training dataset and the results from the test dataset. The RMSE of the relevance between the training dataset and the predicted value is 0.3179 which indicates that the model is working well.","PeriodicalId":350291,"journal":{"name":"2022 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MLISE57402.2022.00041","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, the product recommendation algorithm of e-commerce platforms has become more and more important. In this paper, we built a Random Forest Regression model for the problem of predicting the correlation of “search term” and “product title We found this model for the dataset by describing and attributing the products. In the process of numerically calculating features, we applied two types of feature engineering. The first method is to describe and attribute the numbers to the number of words or the length of the sentence. The second method is to use string similarity characteristics to calculate the distance between “search term” and “product title On the results, we got a similar histogram of the correlation scores between the training dataset and the results from the test dataset. The RMSE of the relevance between the training dataset and the predicted value is 0.3179 which indicates that the model is working well.