{"title":"Football and the dark side of cluster analysis","authors":"C. Hennig, Serhat Emre Akhanli","doi":"10.20347/WIAS.REPORT.29","DOIUrl":"https://doi.org/10.20347/WIAS.REPORT.29","url":null,"abstract":"In cluster analysis, decisions on data preprocessing such as how to select, transform, and standardise variables and how to aggregate information from continuous, count and categorical variables cannot be made in a supervised manner, i.e., based on prediction of a response variable. Statisticians often attempt to make such decisions in an automated way by optimising certain objective functions of the data anyway, but this usually ignores the fact that in cluster analysis these decisions determine the meaning of the resulting clustering. We argue that the decisions should be made based on the aim and intended interpretation of the clustering and the meaning of the variables. The rationale is that preprocessing should be done in such a way that the resulting distances, as used by the clustering method, match as well as possible the \"interpretative distances\" between objects as determined by the meaning of the variables and objects. Such \"interpretative distances\" are usually not precisely specified and involve a certain amount of subjectivity. We will use ongoing work on clustering football players based on performance data to illustrate how such decisions can be made, how much of an impact they can have, how the data can still help with them and to highlight some issues with the approach.","PeriodicalId":330529,"journal":{"name":"International Federation of Classification Societies","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124555925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Functional data analysis for optimizing strategies of cash flow management","authors":"F. Salvo, M. Chiodi, P. Patricola","doi":"10.1007/978-3-319-55723-6_17","DOIUrl":"https://doi.org/10.1007/978-3-319-55723-6_17","url":null,"abstract":"","PeriodicalId":330529,"journal":{"name":"International Federation of Classification Societies","volume":"34 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114039349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Large Covariance Matrix Estimation by Composite Minimization","authors":"M. Farné","doi":"10.6092/UNIBO/AMSDOTTORATO/7250","DOIUrl":"https://doi.org/10.6092/UNIBO/AMSDOTTORATO/7250","url":null,"abstract":"A method to regularize large-dimensional covariance matrices under the assumption of approximate factor model will be presented. Existing methods perform estimation by recovering principal components and sparsifying the residual covariance matrix. In our setting this task is achieved recovering the low rank plus sparse decomposition by least squares minimization under nuclear norm plus $l_1$ norm penalization. In the literature, the best known algorithm to solve this problem is soft thresholding plus singular value thresholding and consistency of estimators is derived under specific assumptions on the eigenvalues of the low rank component matrix. In this paper consistency of the proposed estimator will be derived under the pervasive condition, providing the identification of low rank and sparse spaces by introducing the unshrinking of estimated eigenvalues. Algorithm derivation and convergence analysis are provided, and the new procedure is compared with the existing ones under the same assumptions. The performance of our minimizer is described in a wide simulation study, where various low rank plus sparse settings are simulated according to different parameter values.","PeriodicalId":330529,"journal":{"name":"International Federation of Classification Societies","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125291246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Justyna Brzezinska-Grabowska, Aneta Rybicka, A. Sagan
{"title":"IRT-based conjoint analysis in the optimization of banking products","authors":"Justyna Brzezinska-Grabowska, Aneta Rybicka, A. Sagan","doi":"10.15611/EKT.2017.3.04","DOIUrl":"https://doi.org/10.15611/EKT.2017.3.04","url":null,"abstract":"Conjoint measurement and analysis have common underlying psychometric and statistical assumption concerning axioms of additivity and two-way frame of reference in preference measurement. However, whereas the former concept is widely used in fundamental measurement of subject $times$ object dominance structures as in IRT and Rasch measurement models, the latter is utilized in a broad familiy of object $times$ object dominance structures in both compositional (i.e. Thurestone case III and V) as well as decompositional (classical conjoint experiments and BTL/alpha simulation) preference measurement models. These two traditions are rarely combined in one measurement model and research design that integrates subject $times$ object $times$ object measurement (Neubauer 2001). The aim of the paper is to adopt and compare three types of preference measurement models in the area of banking products in Poland:1/ paired-comparisons and rating scale conjoint experiment,2/ IRT-based conjoint (Rasch and Birnbaum politomous models) and 3/ compositional Thurstone III/V models (Bockenholt 2006). Part-worth utilities are used for product optimization and comparison across the estimated models.","PeriodicalId":330529,"journal":{"name":"International Federation of Classification Societies","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130854892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Correction for chance and correction for maximum value","authors":"M. Warrens","doi":"10.14355/JMMF.2013.0204.01","DOIUrl":"https://doi.org/10.14355/JMMF.2013.0204.01","url":null,"abstract":"In data analysis and classification association coefficients are used to quantify the association in contingency tables. Various coefficients are chance-corrected, that is, they have value zero under statistical independence. Examples are the phi coefficient, Cohen's kappa, and the adjusted Rand index. Other coefficients are corrected for maximum value. Correction for chance and correction for maximum value are studied as functions on a space of association coefficients for contingency tables. Both functions are idempotent, and the two functions commute under composition. Furthermore, the composed function maps a coefficient and all its linear transformations given the marginal totals to the same coefficient. The algebraic structure of the two functions, the their composition, and the identity function, turns out to be an idempotent commutative monoid.","PeriodicalId":330529,"journal":{"name":"International Federation of Classification Societies","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129505817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
B. Pawełek, K. Galuszka, Jadwiga Kostrzewska, M. Kostrzewski
{"title":"Classification Methods in the Research on the Financial Standing of Construction Enterprises After Bankruptcy in Poland","authors":"B. Pawełek, K. Galuszka, Jadwiga Kostrzewska, M. Kostrzewski","doi":"10.1007/978-3-319-55723-6_3","DOIUrl":"https://doi.org/10.1007/978-3-319-55723-6_3","url":null,"abstract":"","PeriodicalId":330529,"journal":{"name":"International Federation of Classification Societies","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126889027","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Cause-Related Marketing: A Qualitative and Quantitative Analysis on Pinkwashing","authors":"G. Schoier, P. D. Luca","doi":"10.1007/978-3-319-55723-6_25","DOIUrl":"https://doi.org/10.1007/978-3-319-55723-6_25","url":null,"abstract":"","PeriodicalId":330529,"journal":{"name":"International Federation of Classification Societies","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128735818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Predicting the Evolution of a Constrained Network: A Beta Regression Model","authors":"L. Stracqualursi, P. Agati","doi":"10.1007/978-3-319-55723-6_26","DOIUrl":"https://doi.org/10.1007/978-3-319-55723-6_26","url":null,"abstract":"","PeriodicalId":330529,"journal":{"name":"International Federation of Classification Societies","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133034439","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Missing Data Imputation and Its Effect on the Accuracy of Classification","authors":"L. Hunt","doi":"10.1007/978-3-319-55723-6_1","DOIUrl":"https://doi.org/10.1007/978-3-319-55723-6_1","url":null,"abstract":"","PeriodicalId":330529,"journal":{"name":"International Federation of Classification Societies","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130516974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}