{"title":"Machine Learning Detection of Collision-Risk Asteroids","authors":"Ö. Eskicioğlu, A. Işık, Onur Sevli","doi":"10.31202/ecjse.1135651","DOIUrl":null,"url":null,"abstract":"Asteroids have attracted people's attention from the past to the present. It has a wide place in the beliefs and cultures of ancient civilizations. The sense of discovery and curiosity of human beings causes an increase in their interest in these objects. With the technology coming to a certain level, the detection, diagnosis and materials of asteroids can be found clearly. The route and collision effects of these objects require constant observation. In our study, asteroids that are likely to hit the Earth have been classified using an asteroid data set in Kaggle and the source of which is NASA-JPL. The dataset contains 4687 asteroid data. Pre-processing steps such as filling in missing data, anomaly detection and normalization were applied on the data. Then, with the help of correlation, 19 features were determined from the dataset for dangerous situations. Asteroid classification was made by using Decision Tree with features, Naive Bayes, Logistic Regression, Random Forest, Support Vector Machines, K-Nearest Neighbor, Xgboost and Adaboost machine learning algorithms. With the artificial neural network with different number of neurons and layers, the data were trained and compared with classification algorithms. As a result of the comparison, the highest performance was achieved with the AdaBoost algorithm with 99.80%. Hyperparameter optimization was performed using the grid-search method in all the classification algorithms that were run. Thus, a method that requires continuous observation and enables the processing of large amounts of data in a more efficient way has been proposed.","PeriodicalId":11622,"journal":{"name":"El-Cezeri Fen ve Mühendislik Dergisi","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"El-Cezeri Fen ve Mühendislik Dergisi","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31202/ecjse.1135651","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Asteroids have attracted people's attention from the past to the present. It has a wide place in the beliefs and cultures of ancient civilizations. The sense of discovery and curiosity of human beings causes an increase in their interest in these objects. With the technology coming to a certain level, the detection, diagnosis and materials of asteroids can be found clearly. The route and collision effects of these objects require constant observation. In our study, asteroids that are likely to hit the Earth have been classified using an asteroid data set in Kaggle and the source of which is NASA-JPL. The dataset contains 4687 asteroid data. Pre-processing steps such as filling in missing data, anomaly detection and normalization were applied on the data. Then, with the help of correlation, 19 features were determined from the dataset for dangerous situations. Asteroid classification was made by using Decision Tree with features, Naive Bayes, Logistic Regression, Random Forest, Support Vector Machines, K-Nearest Neighbor, Xgboost and Adaboost machine learning algorithms. With the artificial neural network with different number of neurons and layers, the data were trained and compared with classification algorithms. As a result of the comparison, the highest performance was achieved with the AdaBoost algorithm with 99.80%. Hyperparameter optimization was performed using the grid-search method in all the classification algorithms that were run. Thus, a method that requires continuous observation and enables the processing of large amounts of data in a more efficient way has been proposed.
小行星从过去到现在都吸引着人们的注意。它在古代文明的信仰和文化中占有广泛的地位。人类的发现意识和好奇心使他们对这些物品的兴趣增加。随着技术发展到一定水平,小行星的探测、诊断和材料都可以清晰地找到。这些物体的路径和碰撞效果需要持续观察。在我们的研究中,可能撞击地球的小行星已经使用Kaggle的小行星数据集进行了分类,这些数据的来源是NASA-JPL。该数据集包含4687颗小行星的数据。对数据进行了缺失数据填充、异常检测和归一化等预处理。然后,在相关性的帮助下,从数据集中确定了危险情况的19个特征。采用Decision Tree with feature、Naive Bayes、Logistic Regression、Random Forest、Support Vector Machines、K-Nearest Neighbor、Xgboost和Adaboost机器学习算法对小行星进行分类。使用不同神经元数和层数的人工神经网络对数据进行训练,并与分类算法进行比较。经过比较,AdaBoost算法达到了99.80%的最高性能。在运行的所有分类算法中,使用网格搜索方法进行超参数优化。因此,提出了一种需要连续观测并能够以更有效的方式处理大量数据的方法。