{"title":"Data depth for mixed-type data through MDS. An application to biological age imputation","authors":"Ignacio Cascos, Aurea Grané, Jingye Qian","doi":"10.1016/j.seps.2024.102140","DOIUrl":null,"url":null,"abstract":"<div><div>For a mixed-type dataset, we propose a new procedure to assess the quality of an observation as a central tendency. Next, we apply this technique to valuate the functional condition of a human organism in terms of its biological age, which is based on biomarkers, medical conditions, life habits, and sociodemographic variables. These records are of mixed type since they are made up by numerical and categorical variables. In order to evaluate the centrality of an observation in a mixed-type dataset, we obtain a Multidimensional Scaling representation and use some classical notion of multivariate data depth in an appropriate space. The biological age of an individual is finally assessed in terms of the age that would make it as deep as possible with respect to a sample of individuals of a similar age subject to it retaining all other features unchanged.</div></div>","PeriodicalId":22033,"journal":{"name":"Socio-economic Planning Sciences","volume":"98 ","pages":"Article 102140"},"PeriodicalIF":6.2000,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Socio-economic Planning Sciences","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0038012124003409","RegionNum":2,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
For a mixed-type dataset, we propose a new procedure to assess the quality of an observation as a central tendency. Next, we apply this technique to valuate the functional condition of a human organism in terms of its biological age, which is based on biomarkers, medical conditions, life habits, and sociodemographic variables. These records are of mixed type since they are made up by numerical and categorical variables. In order to evaluate the centrality of an observation in a mixed-type dataset, we obtain a Multidimensional Scaling representation and use some classical notion of multivariate data depth in an appropriate space. The biological age of an individual is finally assessed in terms of the age that would make it as deep as possible with respect to a sample of individuals of a similar age subject to it retaining all other features unchanged.
期刊介绍:
Studies directed toward the more effective utilization of existing resources, e.g. mathematical programming models of health care delivery systems with relevance to more effective program design; systems analysis of fire outbreaks and its relevance to the location of fire stations; statistical analysis of the efficiency of a developing country economy or industry.
Studies relating to the interaction of various segments of society and technology, e.g. the effects of government health policies on the utilization and design of hospital facilities; the relationship between housing density and the demands on public transportation or other service facilities: patterns and implications of urban development and air or water pollution.
Studies devoted to the anticipations of and response to future needs for social, health and other human services, e.g. the relationship between industrial growth and the development of educational resources in affected areas; investigation of future demands for material and child health resources in a developing country; design of effective recycling in an urban setting.