{"title":"机器学习@ Amazon","authors":"R. Rastogi","doi":"10.1145/2778865.2778867","DOIUrl":null,"url":null,"abstract":"In this talk, I will first provide an overview of the key Machine Learning (ML) applications we are developing at Amazon. I will then describe a matrix factorization model that we have developed for making product recommendations âĂŞ the salient characteristics of the model are: (1) It uses a Bayesian approach to handle data sparsity, (2) It leverages user and item features to handle the cold start problem, and (3) It introduces latent variables to handle multiple personas associated with a user account (e.g. family members). Our experimental results with synthetic and real-life datasets show that leveraging user and item features, and incorporating user personas enables our model to provide lower RMSE and perplexity compared to baselines.","PeriodicalId":116839,"journal":{"name":"Proceedings of the 2nd IKDD Conference on Data Sciences","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Machine Learning @ Amazon\",\"authors\":\"R. Rastogi\",\"doi\":\"10.1145/2778865.2778867\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this talk, I will first provide an overview of the key Machine Learning (ML) applications we are developing at Amazon. I will then describe a matrix factorization model that we have developed for making product recommendations âĂŞ the salient characteristics of the model are: (1) It uses a Bayesian approach to handle data sparsity, (2) It leverages user and item features to handle the cold start problem, and (3) It introduces latent variables to handle multiple personas associated with a user account (e.g. family members). Our experimental results with synthetic and real-life datasets show that leveraging user and item features, and incorporating user personas enables our model to provide lower RMSE and perplexity compared to baselines.\",\"PeriodicalId\":116839,\"journal\":{\"name\":\"Proceedings of the 2nd IKDD Conference on Data Sciences\",\"volume\":\"61 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-03-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2nd IKDD Conference on Data Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2778865.2778867\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2nd IKDD Conference on Data Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2778865.2778867","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
In this talk, I will first provide an overview of the key Machine Learning (ML) applications we are developing at Amazon. I will then describe a matrix factorization model that we have developed for making product recommendations âĂŞ the salient characteristics of the model are: (1) It uses a Bayesian approach to handle data sparsity, (2) It leverages user and item features to handle the cold start problem, and (3) It introduces latent variables to handle multiple personas associated with a user account (e.g. family members). Our experimental results with synthetic and real-life datasets show that leveraging user and item features, and incorporating user personas enables our model to provide lower RMSE and perplexity compared to baselines.