Matthew G. Finley, Miguel Martinez-Ledesma, William R. Paterson, Matthew R. Argall, David M. Miles, John C. Dorelli, Eftyhia Zesta
{"title":"航天器现场观测的广义时间序列分析:利用主成分分析和无监督聚类进行异常检测和数据优先排序","authors":"Matthew G. Finley, Miguel Martinez-Ledesma, William R. Paterson, Matthew R. Argall, David M. Miles, John C. Dorelli, Eftyhia Zesta","doi":"10.1029/2024EA003753","DOIUrl":null,"url":null,"abstract":"<p>In situ spacecraft observations are critical to our study and understanding of the various phenomena that couple mass, momentum, and energy throughout near-Earth space and beyond. However, on-orbit telemetry constraints can severely limit the capability of spacecraft to transmit high-cadence data, and missions are often only able to telemeter a small percentage of their captured data at full rate. This presents a programmatic need to prioritize intervals with the highest probability of enabling the mission's science goals. Larger missions such as the Magnetospheric Multiscale mission (MMS) aim to solve this problem with a Scientist-In-The-Loop (SITL), where a domain expert flags intervals of time with potentially interesting data for high-cadence data downlink and subsequent study. Although suitable for some missions, the SITL solution is not always feasible, especially for low-cost missions such as CubeSats and NanoSats. This manuscript presents a generalizable method for the detection of anomalous data points in spacecraft observations, enabling rapid data prioritization without substantial computational overhead or the need for additional infrastructure on the ground. Specifically, Principal Components Analysis and One-Class Support Vector Machines are used to generate an alternative representation of the data and provide an indication, for each point, of the data's potential for scientific utility. The technique's performance and generalizability is demonstrated through application to intervals of observations, including magnetic field data and plasma moments, from the CASSIOPE e-POP/Swarm-Echo and MMS missions.</p>","PeriodicalId":54286,"journal":{"name":"Earth and Space Science","volume":"11 9","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1029/2024EA003753","citationCount":"0","resultStr":"{\"title\":\"Generalized Time-Series Analysis for In Situ Spacecraft Observations: Anomaly Detection and Data Prioritization Using Principal Components Analysis and Unsupervised Clustering\",\"authors\":\"Matthew G. Finley, Miguel Martinez-Ledesma, William R. Paterson, Matthew R. Argall, David M. Miles, John C. Dorelli, Eftyhia Zesta\",\"doi\":\"10.1029/2024EA003753\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>In situ spacecraft observations are critical to our study and understanding of the various phenomena that couple mass, momentum, and energy throughout near-Earth space and beyond. However, on-orbit telemetry constraints can severely limit the capability of spacecraft to transmit high-cadence data, and missions are often only able to telemeter a small percentage of their captured data at full rate. This presents a programmatic need to prioritize intervals with the highest probability of enabling the mission's science goals. Larger missions such as the Magnetospheric Multiscale mission (MMS) aim to solve this problem with a Scientist-In-The-Loop (SITL), where a domain expert flags intervals of time with potentially interesting data for high-cadence data downlink and subsequent study. Although suitable for some missions, the SITL solution is not always feasible, especially for low-cost missions such as CubeSats and NanoSats. This manuscript presents a generalizable method for the detection of anomalous data points in spacecraft observations, enabling rapid data prioritization without substantial computational overhead or the need for additional infrastructure on the ground. Specifically, Principal Components Analysis and One-Class Support Vector Machines are used to generate an alternative representation of the data and provide an indication, for each point, of the data's potential for scientific utility. The technique's performance and generalizability is demonstrated through application to intervals of observations, including magnetic field data and plasma moments, from the CASSIOPE e-POP/Swarm-Echo and MMS missions.</p>\",\"PeriodicalId\":54286,\"journal\":{\"name\":\"Earth and Space Science\",\"volume\":\"11 9\",\"pages\":\"\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2024-09-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1029/2024EA003753\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Earth and Space Science\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1029/2024EA003753\",\"RegionNum\":3,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ASTRONOMY & ASTROPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earth and Space Science","FirstCategoryId":"89","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1029/2024EA003753","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
Generalized Time-Series Analysis for In Situ Spacecraft Observations: Anomaly Detection and Data Prioritization Using Principal Components Analysis and Unsupervised Clustering
In situ spacecraft observations are critical to our study and understanding of the various phenomena that couple mass, momentum, and energy throughout near-Earth space and beyond. However, on-orbit telemetry constraints can severely limit the capability of spacecraft to transmit high-cadence data, and missions are often only able to telemeter a small percentage of their captured data at full rate. This presents a programmatic need to prioritize intervals with the highest probability of enabling the mission's science goals. Larger missions such as the Magnetospheric Multiscale mission (MMS) aim to solve this problem with a Scientist-In-The-Loop (SITL), where a domain expert flags intervals of time with potentially interesting data for high-cadence data downlink and subsequent study. Although suitable for some missions, the SITL solution is not always feasible, especially for low-cost missions such as CubeSats and NanoSats. This manuscript presents a generalizable method for the detection of anomalous data points in spacecraft observations, enabling rapid data prioritization without substantial computational overhead or the need for additional infrastructure on the ground. Specifically, Principal Components Analysis and One-Class Support Vector Machines are used to generate an alternative representation of the data and provide an indication, for each point, of the data's potential for scientific utility. The technique's performance and generalizability is demonstrated through application to intervals of observations, including magnetic field data and plasma moments, from the CASSIOPE e-POP/Swarm-Echo and MMS missions.
期刊介绍:
Marking AGU’s second new open access journal in the last 12 months, Earth and Space Science is the only journal that reflects the expansive range of science represented by AGU’s 62,000 members, including all of the Earth, planetary, and space sciences, and related fields in environmental science, geoengineering, space engineering, and biogeochemistry.