{"title":"有限平移尺度Dirichlet混合模型的模型选择与估计","authors":"Rua Alsuroji, Nuha Zamzami, N. Bouguila","doi":"10.1109/ICMLA.2018.00112","DOIUrl":null,"url":null,"abstract":"This paper proposes an unsupervised learning algorithm for a finite mixture model of shifted-scaled Dirichlet distributions. Maximum likelihood and Newton raphson approaches are used for parameters estimation. In this research work, we address the flexibility challenge of the Dirichlet distribution by having another set of parameters for the location (beside the Scale parameter) that add functional probability models. This paper evaluates the capability of the discussed model to perform the categorization using both synthetic and real data related to the medical science to help in selecting wart treatment method, in the business field to detect the reasons behind employees' absenteeism, and the writer identification application to define the author of off-line handwritten documents. We also compare the model performance against scaled Dirichlet, the classic Dirichlet, and Gaussian mixture models. Finally, experimental results are presented on the selected datasets. Besides, we apply the minimum message length to determine the optimal number of the components found within each dataset.","PeriodicalId":6533,"journal":{"name":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"9 1","pages":"707-713"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Model Selection and Estimation of a Finite Shifted-Scaled Dirichlet Mixture Model\",\"authors\":\"Rua Alsuroji, Nuha Zamzami, N. Bouguila\",\"doi\":\"10.1109/ICMLA.2018.00112\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes an unsupervised learning algorithm for a finite mixture model of shifted-scaled Dirichlet distributions. Maximum likelihood and Newton raphson approaches are used for parameters estimation. In this research work, we address the flexibility challenge of the Dirichlet distribution by having another set of parameters for the location (beside the Scale parameter) that add functional probability models. This paper evaluates the capability of the discussed model to perform the categorization using both synthetic and real data related to the medical science to help in selecting wart treatment method, in the business field to detect the reasons behind employees' absenteeism, and the writer identification application to define the author of off-line handwritten documents. We also compare the model performance against scaled Dirichlet, the classic Dirichlet, and Gaussian mixture models. Finally, experimental results are presented on the selected datasets. Besides, we apply the minimum message length to determine the optimal number of the components found within each dataset.\",\"PeriodicalId\":6533,\"journal\":{\"name\":\"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)\",\"volume\":\"9 1\",\"pages\":\"707-713\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLA.2018.00112\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2018.00112","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Model Selection and Estimation of a Finite Shifted-Scaled Dirichlet Mixture Model
This paper proposes an unsupervised learning algorithm for a finite mixture model of shifted-scaled Dirichlet distributions. Maximum likelihood and Newton raphson approaches are used for parameters estimation. In this research work, we address the flexibility challenge of the Dirichlet distribution by having another set of parameters for the location (beside the Scale parameter) that add functional probability models. This paper evaluates the capability of the discussed model to perform the categorization using both synthetic and real data related to the medical science to help in selecting wart treatment method, in the business field to detect the reasons behind employees' absenteeism, and the writer identification application to define the author of off-line handwritten documents. We also compare the model performance against scaled Dirichlet, the classic Dirichlet, and Gaussian mixture models. Finally, experimental results are presented on the selected datasets. Besides, we apply the minimum message length to determine the optimal number of the components found within each dataset.