基于NASA小天体数据库的不同机器学习方法在小行星直径预测中的比较分析

B. E. Duisek, D. D. Sarsembin, K. A. Abdurazak
{"title":"基于NASA小天体数据库的不同机器学习方法在小行星直径预测中的比较分析","authors":"B. E. Duisek, D. D. Sarsembin, K. A. Abdurazak","doi":"10.55452/1998-6688-2023-20-3-7-16","DOIUrl":null,"url":null,"abstract":"The database of small celestial bodies NASA is provided by the Jet Propulsion Laboratory and represents the collected information about asteroids and comets, describing their parameters available for observation and determination, including physical ones, as well as their classification and data on the number and duration of observation. Many of these celestial techs have an incomplete description of their properties, which makes it difficult to predict their behavior and potential interaction with other objects in space, including man-made ones. This study proposes a solution to a certain part of the problems of asteroid exploration by finding a prediction of the diameter of asteroids based on information from the NASA database and the results of machine learning methods on processed data from the source. For this research, some of the most commonly used algorithms for implementing such prediction models have been selected, such as KNN, linear regression, random forest, decision trees, and gradient boosting. Applied machine learning algorithms were evaluated based on the results of diameter prediction accuracy, speed of training and prediction process, and square mean error rates. The study will help to choose the most optimal approach for predicting this feature of asteroids, describe the process of data pre-processing, while achieving the best performance of the model, and analyze the correlations between the properties of these celestial bodies.","PeriodicalId":473042,"journal":{"name":"Ķazaķstan-Britan tehnikalyķ universitetìnìņ habaršysy","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"COMPARISON AND ANALYSIS OF DIFFERENT MACHINE LEARNING METHODS ON ASTEROID DIAMETER PREDICTIONS BASED ON THE NASA SMALL CELESTIAL BODIES DATABASE\",\"authors\":\"B. E. Duisek, D. D. Sarsembin, K. A. Abdurazak\",\"doi\":\"10.55452/1998-6688-2023-20-3-7-16\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The database of small celestial bodies NASA is provided by the Jet Propulsion Laboratory and represents the collected information about asteroids and comets, describing their parameters available for observation and determination, including physical ones, as well as their classification and data on the number and duration of observation. Many of these celestial techs have an incomplete description of their properties, which makes it difficult to predict their behavior and potential interaction with other objects in space, including man-made ones. This study proposes a solution to a certain part of the problems of asteroid exploration by finding a prediction of the diameter of asteroids based on information from the NASA database and the results of machine learning methods on processed data from the source. For this research, some of the most commonly used algorithms for implementing such prediction models have been selected, such as KNN, linear regression, random forest, decision trees, and gradient boosting. Applied machine learning algorithms were evaluated based on the results of diameter prediction accuracy, speed of training and prediction process, and square mean error rates. The study will help to choose the most optimal approach for predicting this feature of asteroids, describe the process of data pre-processing, while achieving the best performance of the model, and analyze the correlations between the properties of these celestial bodies.\",\"PeriodicalId\":473042,\"journal\":{\"name\":\"Ķazaķstan-Britan tehnikalyķ universitetìnìņ habaršysy\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ķazaķstan-Britan tehnikalyķ universitetìnìņ habaršysy\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.55452/1998-6688-2023-20-3-7-16\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ķazaķstan-Britan tehnikalyķ universitetìnìņ habaršysy","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.55452/1998-6688-2023-20-3-7-16","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

美国航空航天局的小天体数据库是由喷气推进实验室提供的,它代表了收集到的关于小行星和彗星的信息,描述了它们可用于观测和确定的参数,包括物理参数,以及它们的分类和观测次数和持续时间的数据。许多这些天体技术对其特性的描述不完整,这使得很难预测它们的行为以及与太空中其他物体(包括人造物体)的潜在相互作用。本研究基于NASA数据库的信息和机器学习方法对源处理数据的结果,找到了一种小行星直径的预测方法,从而解决了小行星探测的一部分问题。在本研究中,选择了一些最常用的算法来实现这些预测模型,如KNN、线性回归、随机森林、决策树和梯度增强。根据直径预测精度、训练和预测过程的速度以及平均错误率的结果对应用的机器学习算法进行评估。该研究将有助于选择最优的方法来预测小行星的这一特征,描述数据预处理的过程,同时实现模型的最佳性能,并分析这些天体属性之间的相关性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
COMPARISON AND ANALYSIS OF DIFFERENT MACHINE LEARNING METHODS ON ASTEROID DIAMETER PREDICTIONS BASED ON THE NASA SMALL CELESTIAL BODIES DATABASE
The database of small celestial bodies NASA is provided by the Jet Propulsion Laboratory and represents the collected information about asteroids and comets, describing their parameters available for observation and determination, including physical ones, as well as their classification and data on the number and duration of observation. Many of these celestial techs have an incomplete description of their properties, which makes it difficult to predict their behavior and potential interaction with other objects in space, including man-made ones. This study proposes a solution to a certain part of the problems of asteroid exploration by finding a prediction of the diameter of asteroids based on information from the NASA database and the results of machine learning methods on processed data from the source. For this research, some of the most commonly used algorithms for implementing such prediction models have been selected, such as KNN, linear regression, random forest, decision trees, and gradient boosting. Applied machine learning algorithms were evaluated based on the results of diameter prediction accuracy, speed of training and prediction process, and square mean error rates. The study will help to choose the most optimal approach for predicting this feature of asteroids, describe the process of data pre-processing, while achieving the best performance of the model, and analyze the correlations between the properties of these celestial bodies.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信