{"title":"基于流形学习的星系形态分类","authors":"Vasyl Semenov , Vitalii Tymchyshyn , Volodymyr Bezguba , Maksym Tsizh , Andrii Khlevniuk","doi":"10.1016/j.ascom.2025.100963","DOIUrl":null,"url":null,"abstract":"<div><div>We address the problem of morphological classification of galaxies from the Galaxy Zoo DECaLS dataset using classical machine learning techniques. Our approach employs a dimensionality reduction method followed by a classical classifier to categorize galaxies based on shape (cigar/in-between/ round; edge-on/face-on) and texture (smooth/featured). We evaluate various dimensionality reduction methods, including Locally Linear Embedding (LLE), Isomap, Uniform Manifold Approximation and Projection (UMAP), t-SNE, and Principal Component Analysis (PCA). Our results demonstrate that most classical classifiers achieve their highest performance when combined with LLE, attaining accuracy comparable to that of simple neural networks. Moreover, in the case of shape classification, the three-dimensional representation remains interpretable, in contrast to the commonly observed loss of interpretability following nonlinear transformations. Additionally, we explore dimensionality reduction followed by k-means clustering to assess whether the data exhibits a natural tendency toward a specific number of clusters. We evaluate clustering performance using silhouette, elbow, Dunn, and Davies–Bouldin scores. While the Davies–Bouldin score indicates a slight preference for four clusters — closely aligning with classifications made by human astronomers — the other metrics do not support a distinct clustering structure.</div></div>","PeriodicalId":48757,"journal":{"name":"Astronomy and Computing","volume":"52 ","pages":"Article 100963"},"PeriodicalIF":1.9000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Galaxy morphological classification with manifold learning\",\"authors\":\"Vasyl Semenov , Vitalii Tymchyshyn , Volodymyr Bezguba , Maksym Tsizh , Andrii Khlevniuk\",\"doi\":\"10.1016/j.ascom.2025.100963\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>We address the problem of morphological classification of galaxies from the Galaxy Zoo DECaLS dataset using classical machine learning techniques. Our approach employs a dimensionality reduction method followed by a classical classifier to categorize galaxies based on shape (cigar/in-between/ round; edge-on/face-on) and texture (smooth/featured). We evaluate various dimensionality reduction methods, including Locally Linear Embedding (LLE), Isomap, Uniform Manifold Approximation and Projection (UMAP), t-SNE, and Principal Component Analysis (PCA). Our results demonstrate that most classical classifiers achieve their highest performance when combined with LLE, attaining accuracy comparable to that of simple neural networks. Moreover, in the case of shape classification, the three-dimensional representation remains interpretable, in contrast to the commonly observed loss of interpretability following nonlinear transformations. Additionally, we explore dimensionality reduction followed by k-means clustering to assess whether the data exhibits a natural tendency toward a specific number of clusters. We evaluate clustering performance using silhouette, elbow, Dunn, and Davies–Bouldin scores. While the Davies–Bouldin score indicates a slight preference for four clusters — closely aligning with classifications made by human astronomers — the other metrics do not support a distinct clustering structure.</div></div>\",\"PeriodicalId\":48757,\"journal\":{\"name\":\"Astronomy and Computing\",\"volume\":\"52 \",\"pages\":\"Article 100963\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2025-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Astronomy and Computing\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2213133725000368\",\"RegionNum\":4,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ASTRONOMY & ASTROPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Astronomy and Computing","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2213133725000368","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
Galaxy morphological classification with manifold learning
We address the problem of morphological classification of galaxies from the Galaxy Zoo DECaLS dataset using classical machine learning techniques. Our approach employs a dimensionality reduction method followed by a classical classifier to categorize galaxies based on shape (cigar/in-between/ round; edge-on/face-on) and texture (smooth/featured). We evaluate various dimensionality reduction methods, including Locally Linear Embedding (LLE), Isomap, Uniform Manifold Approximation and Projection (UMAP), t-SNE, and Principal Component Analysis (PCA). Our results demonstrate that most classical classifiers achieve their highest performance when combined with LLE, attaining accuracy comparable to that of simple neural networks. Moreover, in the case of shape classification, the three-dimensional representation remains interpretable, in contrast to the commonly observed loss of interpretability following nonlinear transformations. Additionally, we explore dimensionality reduction followed by k-means clustering to assess whether the data exhibits a natural tendency toward a specific number of clusters. We evaluate clustering performance using silhouette, elbow, Dunn, and Davies–Bouldin scores. While the Davies–Bouldin score indicates a slight preference for four clusters — closely aligning with classifications made by human astronomers — the other metrics do not support a distinct clustering structure.
Astronomy and ComputingASTRONOMY & ASTROPHYSICSCOMPUTER SCIENCE,-COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
CiteScore
4.10
自引率
8.00%
发文量
67
期刊介绍:
Astronomy and Computing is a peer-reviewed journal that focuses on the broad area between astronomy, computer science and information technology. The journal aims to publish the work of scientists and (software) engineers in all aspects of astronomical computing, including the collection, analysis, reduction, visualisation, preservation and dissemination of data, and the development of astronomical software and simulations. The journal covers applications for academic computer science techniques to astronomy, as well as novel applications of information technologies within astronomy.