Generalized Time-Series Analysis for In Situ Spacecraft Observations: Anomaly Detection and Data Prioritization Using Principal Components Analysis and Unsupervised Clustering
Matthew G. Finley, Miguel Martinez-Ledesma, William R. Paterson, Matthew R. Argall, David M. Miles, John C. Dorelli, Eftyhia Zesta
{"title":"Generalized Time-Series Analysis for In Situ Spacecraft Observations: Anomaly Detection and Data Prioritization Using Principal Components Analysis and Unsupervised Clustering","authors":"Matthew G. Finley, Miguel Martinez-Ledesma, William R. Paterson, Matthew R. Argall, David M. Miles, John C. Dorelli, Eftyhia Zesta","doi":"10.1029/2024EA003753","DOIUrl":null,"url":null,"abstract":"<p>In situ spacecraft observations are critical to our study and understanding of the various phenomena that couple mass, momentum, and energy throughout near-Earth space and beyond. However, on-orbit telemetry constraints can severely limit the capability of spacecraft to transmit high-cadence data, and missions are often only able to telemeter a small percentage of their captured data at full rate. This presents a programmatic need to prioritize intervals with the highest probability of enabling the mission's science goals. Larger missions such as the Magnetospheric Multiscale mission (MMS) aim to solve this problem with a Scientist-In-The-Loop (SITL), where a domain expert flags intervals of time with potentially interesting data for high-cadence data downlink and subsequent study. Although suitable for some missions, the SITL solution is not always feasible, especially for low-cost missions such as CubeSats and NanoSats. This manuscript presents a generalizable method for the detection of anomalous data points in spacecraft observations, enabling rapid data prioritization without substantial computational overhead or the need for additional infrastructure on the ground. Specifically, Principal Components Analysis and One-Class Support Vector Machines are used to generate an alternative representation of the data and provide an indication, for each point, of the data's potential for scientific utility. The technique's performance and generalizability is demonstrated through application to intervals of observations, including magnetic field data and plasma moments, from the CASSIOPE e-POP/Swarm-Echo and MMS missions.</p>","PeriodicalId":54286,"journal":{"name":"Earth and Space Science","volume":"11 9","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1029/2024EA003753","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earth and Space Science","FirstCategoryId":"89","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1029/2024EA003753","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
In situ spacecraft observations are critical to our study and understanding of the various phenomena that couple mass, momentum, and energy throughout near-Earth space and beyond. However, on-orbit telemetry constraints can severely limit the capability of spacecraft to transmit high-cadence data, and missions are often only able to telemeter a small percentage of their captured data at full rate. This presents a programmatic need to prioritize intervals with the highest probability of enabling the mission's science goals. Larger missions such as the Magnetospheric Multiscale mission (MMS) aim to solve this problem with a Scientist-In-The-Loop (SITL), where a domain expert flags intervals of time with potentially interesting data for high-cadence data downlink and subsequent study. Although suitable for some missions, the SITL solution is not always feasible, especially for low-cost missions such as CubeSats and NanoSats. This manuscript presents a generalizable method for the detection of anomalous data points in spacecraft observations, enabling rapid data prioritization without substantial computational overhead or the need for additional infrastructure on the ground. Specifically, Principal Components Analysis and One-Class Support Vector Machines are used to generate an alternative representation of the data and provide an indication, for each point, of the data's potential for scientific utility. The technique's performance and generalizability is demonstrated through application to intervals of observations, including magnetic field data and plasma moments, from the CASSIOPE e-POP/Swarm-Echo and MMS missions.
期刊介绍:
Marking AGU’s second new open access journal in the last 12 months, Earth and Space Science is the only journal that reflects the expansive range of science represented by AGU’s 62,000 members, including all of the Earth, planetary, and space sciences, and related fields in environmental science, geoengineering, space engineering, and biogeochemistry.