{"title":"来自CAB数据集的健康数据的分析和可视化","authors":"Y. Bhavsar, Mitaxi Mehta","doi":"10.1109/ICAMMAET.2017.8186717","DOIUrl":null,"url":null,"abstract":"Many datasets contain variables that take binary values. Often one would like to subset the data set according to the value of such a binary variables and compare and contrast the statistical parameters for such subsets. We have written an R code to analyze such datasets and create plots to give comparison of mean of variables for such binary partitions. We show the result of this analysis for the CAB database which has health data from several Indian states. The data contains survey from the year 2014 with total 53 health indicators, covering 8 states and with total data 13.8 MB. We also show the state-wise means of several partitioned and normalized variables using a single plot. Two binary variables have been used, a demographic one (rural/urban) and the gender (male/female), to partition and compare the database.","PeriodicalId":425974,"journal":{"name":"2017 International Conference on Algorithms, Methodology, Models and Applications in Emerging Technologies (ICAMMAET)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis and visualization of health data from the CAB dataset\",\"authors\":\"Y. Bhavsar, Mitaxi Mehta\",\"doi\":\"10.1109/ICAMMAET.2017.8186717\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Many datasets contain variables that take binary values. Often one would like to subset the data set according to the value of such a binary variables and compare and contrast the statistical parameters for such subsets. We have written an R code to analyze such datasets and create plots to give comparison of mean of variables for such binary partitions. We show the result of this analysis for the CAB database which has health data from several Indian states. The data contains survey from the year 2014 with total 53 health indicators, covering 8 states and with total data 13.8 MB. We also show the state-wise means of several partitioned and normalized variables using a single plot. Two binary variables have been used, a demographic one (rural/urban) and the gender (male/female), to partition and compare the database.\",\"PeriodicalId\":425974,\"journal\":{\"name\":\"2017 International Conference on Algorithms, Methodology, Models and Applications in Emerging Technologies (ICAMMAET)\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Algorithms, Methodology, Models and Applications in Emerging Technologies (ICAMMAET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAMMAET.2017.8186717\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Algorithms, Methodology, Models and Applications in Emerging Technologies (ICAMMAET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAMMAET.2017.8186717","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Analysis and visualization of health data from the CAB dataset
Many datasets contain variables that take binary values. Often one would like to subset the data set according to the value of such a binary variables and compare and contrast the statistical parameters for such subsets. We have written an R code to analyze such datasets and create plots to give comparison of mean of variables for such binary partitions. We show the result of this analysis for the CAB database which has health data from several Indian states. The data contains survey from the year 2014 with total 53 health indicators, covering 8 states and with total data 13.8 MB. We also show the state-wise means of several partitioned and normalized variables using a single plot. Two binary variables have been used, a demographic one (rural/urban) and the gender (male/female), to partition and compare the database.