{"title":"基于潜在变量挖掘的EM聚类算法研究与实现","authors":"Qiang Yue, Zhong-yu Hu, Dongping Li","doi":"10.12783/DTCSE/CCNT2020/35417","DOIUrl":null,"url":null,"abstract":"Clustering analysis is one of the hot research fields in data mining. EM algorithm is an effective method to realize maximum likelihood estimation, which is mainly used for parameter estimation of incomplete data. It greatly simplifies the likelihood function equation by assuming the existence of latent variable, while maximum likelihood estimation is a commonly used parameter estimation method, and the EM algorithm makes its application more extensive. This paper takes clustering algorithm as the main research object, introduces the basic idea of maximum likelihood estimation, describes the basic theory of EM algorithm, and realizes EM algorithm. The experimental results show that compared with the traditional clustering algorithm, the EM algorithm has better convergence and clustering ability.","PeriodicalId":11066,"journal":{"name":"DEStech Transactions on Computer Science and Engineering","volume":"30 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Research and Implementation of EM Clustering Algorithm Based on Latent Variable Mining\",\"authors\":\"Qiang Yue, Zhong-yu Hu, Dongping Li\",\"doi\":\"10.12783/DTCSE/CCNT2020/35417\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Clustering analysis is one of the hot research fields in data mining. EM algorithm is an effective method to realize maximum likelihood estimation, which is mainly used for parameter estimation of incomplete data. It greatly simplifies the likelihood function equation by assuming the existence of latent variable, while maximum likelihood estimation is a commonly used parameter estimation method, and the EM algorithm makes its application more extensive. This paper takes clustering algorithm as the main research object, introduces the basic idea of maximum likelihood estimation, describes the basic theory of EM algorithm, and realizes EM algorithm. The experimental results show that compared with the traditional clustering algorithm, the EM algorithm has better convergence and clustering ability.\",\"PeriodicalId\":11066,\"journal\":{\"name\":\"DEStech Transactions on Computer Science and Engineering\",\"volume\":\"30 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-03-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"DEStech Transactions on Computer Science and Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12783/DTCSE/CCNT2020/35417\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"DEStech Transactions on Computer Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12783/DTCSE/CCNT2020/35417","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research and Implementation of EM Clustering Algorithm Based on Latent Variable Mining
Clustering analysis is one of the hot research fields in data mining. EM algorithm is an effective method to realize maximum likelihood estimation, which is mainly used for parameter estimation of incomplete data. It greatly simplifies the likelihood function equation by assuming the existence of latent variable, while maximum likelihood estimation is a commonly used parameter estimation method, and the EM algorithm makes its application more extensive. This paper takes clustering algorithm as the main research object, introduces the basic idea of maximum likelihood estimation, describes the basic theory of EM algorithm, and realizes EM algorithm. The experimental results show that compared with the traditional clustering algorithm, the EM algorithm has better convergence and clustering ability.