A topological approach of multiple correspondence analysis

Q4 Mathematics
Rafik Abdesselam
{"title":"A topological approach of multiple correspondence analysis","authors":"Rafik Abdesselam","doi":"10.1080/23737484.2020.1830733","DOIUrl":null,"url":null,"abstract":"ABSTRACT Topological multiple correspondence analysis (TMCA) studies a group of categorical variables defined on the same set of individuals. It is a topological method of data analysis that consists of exploring, analyzing, and representing the associations between several qualitative variables in the context of multiple correspondence analysis (MCA). It compares and classifies proximity measures to select the best one according to the data under consideration, then analyzes, interprets, and visualizes with graphic representations, the possible associations between several categorical variables relating to the known problem of MCA. Based on the notion of neighborhood graphs, some of these proximity measures are more-or-less equivalent. A topological equivalence index between two measures is defined and statistically tested according to the degree of description of the associations between the modalities of these qualitative variables. We compare proximity measures and propose a topological criterion for choosing the best association measure, adapted to the data considered, from among some of the most widely used proximity measures for categorical data. The principle of the proposed approach is illustrated using a real dataset with conventional proximity measures for binary variables from the literature. The first step is to find the proximity measure that can best be adapted to the data; the second step is to use this measure to perform the TMCA.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"1 1","pages":"429 - 447"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications in Statistics Case Studies Data Analysis and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/23737484.2020.1830733","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 1

Abstract

ABSTRACT Topological multiple correspondence analysis (TMCA) studies a group of categorical variables defined on the same set of individuals. It is a topological method of data analysis that consists of exploring, analyzing, and representing the associations between several qualitative variables in the context of multiple correspondence analysis (MCA). It compares and classifies proximity measures to select the best one according to the data under consideration, then analyzes, interprets, and visualizes with graphic representations, the possible associations between several categorical variables relating to the known problem of MCA. Based on the notion of neighborhood graphs, some of these proximity measures are more-or-less equivalent. A topological equivalence index between two measures is defined and statistically tested according to the degree of description of the associations between the modalities of these qualitative variables. We compare proximity measures and propose a topological criterion for choosing the best association measure, adapted to the data considered, from among some of the most widely used proximity measures for categorical data. The principle of the proposed approach is illustrated using a real dataset with conventional proximity measures for binary variables from the literature. The first step is to find the proximity measure that can best be adapted to the data; the second step is to use this measure to perform the TMCA.
多对应分析的拓扑方法
拓扑多重对应分析(TMCA)研究定义在同一个体集合上的一组分类变量。它是一种数据分析的拓扑方法,包括在多重对应分析(MCA)的背景下探索、分析和表示几个定性变量之间的关联。它根据所考虑的数据对接近度量进行比较和分类,以选择最佳的接近度量,然后分析、解释和可视化与已知MCA问题相关的几个分类变量之间的可能关联。基于邻域图的概念,这些接近度量中的一些或多或少是等价的。根据这些定性变量的模态之间的关联的描述程度,定义了两个度量之间的拓扑等价指数并进行了统计检验。我们比较了接近度量,并提出了一个拓扑标准,用于从分类数据中最广泛使用的接近度量中选择适合所考虑数据的最佳关联度量。所提出的方法的原理是用一个真实的数据集来说明的,该数据集具有文献中二元变量的传统接近度量。第一步是找到最适合数据的接近度量;第二步是使用这一措施来执行TMCA。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
1.00
自引率
0.00%
发文量
29
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信