Viktorie Nesrstová, Ines Wilms, Karel Hron, Peter Filzmoser
{"title":"用稀疏主成分分析识别组合数据中的重要成对对数。","authors":"Viktorie Nesrstová, Ines Wilms, Karel Hron, Peter Filzmoser","doi":"10.1007/s11004-024-10159-0","DOIUrl":null,"url":null,"abstract":"<p><p>Compositional data are characterized by the fact that their elemental information is contained in simple pairwise logratios of the parts that constitute the composition. While pairwise logratios are typically easy to interpret, the number of possible pairs to consider quickly becomes too large even for medium-sized compositions, which may hinder interpretability in further multivariate analysis. Sparse methods can therefore be useful for identifying a few important pairwise logratios (and parts contained in them) from the total candidate set. To this end, we propose a procedure based on the construction of all possible pairwise logratios and employ sparse principal component analysis to identify important pairwise logratios. The performance of the procedure is demonstrated with both simulated and real-world data. In our empirical analysis, we propose three visual tools showing (i) the balance between sparsity and explained variability, (ii) the stability of the pairwise logratios, and (iii) the importance of the original compositional parts to aid practitioners in their model interpretation.</p>","PeriodicalId":51117,"journal":{"name":"Mathematical Geosciences","volume":"57 2","pages":"333-358"},"PeriodicalIF":2.8000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11805788/pdf/","citationCount":"0","resultStr":"{\"title\":\"Identifying Important Pairwise Logratios in Compositional Data with Sparse Principal Component Analysis.\",\"authors\":\"Viktorie Nesrstová, Ines Wilms, Karel Hron, Peter Filzmoser\",\"doi\":\"10.1007/s11004-024-10159-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Compositional data are characterized by the fact that their elemental information is contained in simple pairwise logratios of the parts that constitute the composition. While pairwise logratios are typically easy to interpret, the number of possible pairs to consider quickly becomes too large even for medium-sized compositions, which may hinder interpretability in further multivariate analysis. Sparse methods can therefore be useful for identifying a few important pairwise logratios (and parts contained in them) from the total candidate set. To this end, we propose a procedure based on the construction of all possible pairwise logratios and employ sparse principal component analysis to identify important pairwise logratios. The performance of the procedure is demonstrated with both simulated and real-world data. In our empirical analysis, we propose three visual tools showing (i) the balance between sparsity and explained variability, (ii) the stability of the pairwise logratios, and (iii) the importance of the original compositional parts to aid practitioners in their model interpretation.</p>\",\"PeriodicalId\":51117,\"journal\":{\"name\":\"Mathematical Geosciences\",\"volume\":\"57 2\",\"pages\":\"333-358\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11805788/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mathematical Geosciences\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://doi.org/10.1007/s11004-024-10159-0\",\"RegionNum\":3,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/10/10 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"GEOSCIENCES, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical Geosciences","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s11004-024-10159-0","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/10/10 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
Identifying Important Pairwise Logratios in Compositional Data with Sparse Principal Component Analysis.
Compositional data are characterized by the fact that their elemental information is contained in simple pairwise logratios of the parts that constitute the composition. While pairwise logratios are typically easy to interpret, the number of possible pairs to consider quickly becomes too large even for medium-sized compositions, which may hinder interpretability in further multivariate analysis. Sparse methods can therefore be useful for identifying a few important pairwise logratios (and parts contained in them) from the total candidate set. To this end, we propose a procedure based on the construction of all possible pairwise logratios and employ sparse principal component analysis to identify important pairwise logratios. The performance of the procedure is demonstrated with both simulated and real-world data. In our empirical analysis, we propose three visual tools showing (i) the balance between sparsity and explained variability, (ii) the stability of the pairwise logratios, and (iii) the importance of the original compositional parts to aid practitioners in their model interpretation.
期刊介绍:
Mathematical Geosciences (formerly Mathematical Geology) publishes original, high-quality, interdisciplinary papers in geomathematics focusing on quantitative methods and studies of the Earth, its natural resources and the environment. This international publication is the official journal of the IAMG. Mathematical Geosciences is an essential reference for researchers and practitioners of geomathematics who develop and apply quantitative models to earth science and geo-engineering problems.