{"title":"COMPARISON AND ANALYSIS OF DIFFERENT MACHINE LEARNING METHODS ON ASTEROID DIAMETER PREDICTIONS BASED ON THE NASA SMALL CELESTIAL BODIES DATABASE","authors":"B. E. Duisek, D. D. Sarsembin, K. A. Abdurazak","doi":"10.55452/1998-6688-2023-20-3-7-16","DOIUrl":null,"url":null,"abstract":"The database of small celestial bodies NASA is provided by the Jet Propulsion Laboratory and represents the collected information about asteroids and comets, describing their parameters available for observation and determination, including physical ones, as well as their classification and data on the number and duration of observation. Many of these celestial techs have an incomplete description of their properties, which makes it difficult to predict their behavior and potential interaction with other objects in space, including man-made ones. This study proposes a solution to a certain part of the problems of asteroid exploration by finding a prediction of the diameter of asteroids based on information from the NASA database and the results of machine learning methods on processed data from the source. For this research, some of the most commonly used algorithms for implementing such prediction models have been selected, such as KNN, linear regression, random forest, decision trees, and gradient boosting. Applied machine learning algorithms were evaluated based on the results of diameter prediction accuracy, speed of training and prediction process, and square mean error rates. The study will help to choose the most optimal approach for predicting this feature of asteroids, describe the process of data pre-processing, while achieving the best performance of the model, and analyze the correlations between the properties of these celestial bodies.","PeriodicalId":473042,"journal":{"name":"Ķazaķstan-Britan tehnikalyķ universitetìnìņ habaršysy","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ķazaķstan-Britan tehnikalyķ universitetìnìņ habaršysy","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.55452/1998-6688-2023-20-3-7-16","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The database of small celestial bodies NASA is provided by the Jet Propulsion Laboratory and represents the collected information about asteroids and comets, describing their parameters available for observation and determination, including physical ones, as well as their classification and data on the number and duration of observation. Many of these celestial techs have an incomplete description of their properties, which makes it difficult to predict their behavior and potential interaction with other objects in space, including man-made ones. This study proposes a solution to a certain part of the problems of asteroid exploration by finding a prediction of the diameter of asteroids based on information from the NASA database and the results of machine learning methods on processed data from the source. For this research, some of the most commonly used algorithms for implementing such prediction models have been selected, such as KNN, linear regression, random forest, decision trees, and gradient boosting. Applied machine learning algorithms were evaluated based on the results of diameter prediction accuracy, speed of training and prediction process, and square mean error rates. The study will help to choose the most optimal approach for predicting this feature of asteroids, describe the process of data pre-processing, while achieving the best performance of the model, and analyze the correlations between the properties of these celestial bodies.