Victoria Da Poian, E. I. Lyness, J. Y. Qi, I. Shah, G. Lipstein, P. D. Archer, L. Chou, C. Freissinet, C. Malespin, A. McAdam, C. A. Knudson, B. P. Theiling, S. M. H”orst
{"title":"利用开放科学机器学习挑战数据受限的行星任务仪器","authors":"Victoria Da Poian, E. I. Lyness, J. Y. Qi, I. Shah, G. Lipstein, P. D. Archer, L. Chou, C. Freissinet, C. Malespin, A. McAdam, C. A. Knudson, B. P. Theiling, S. M. H”orst","doi":"10.1093/rasti/rzae009","DOIUrl":null,"url":null,"abstract":"\n We set up two open-science machine learning (ML) challenges focusing on building models to automatically analyze mass spectrometry (MS) data for Mars exploration. ML challenges provide an excellent way to engage a diverse set of experts with benchmark training data, explore a wide range of ML and data science approaches, and identify promising models based on empirical results, as well as to get independent external analyses to compare to those of the internal team. These two challenges were proof-of-concept projects to analyze the feasibility of combining data collected from different instruments in a single ML application. We selected mass spectrometry data from 1) commercial instruments and 2) the Sample Analysis at Mars (SAM, an instrument suite that includes a mass spectrometer subsystem onboard the Curiosity rover) testbed. These challenges, organized with DrivenData, gathered more than 1,150 unique participants from all over the world, and obtained more than 600 solutions contributing powerful models to the analysis of rock and soil samples relevant to planetary science using various mass spectrometry datasets. These two challenges demonstrated the suitability and value of multiple ML approaches to classifying planetary analog datasets from both commercial and flight-like instruments.\n We present the processes from the problem identification, challenge setups, and challenge results that gathered creative and diverse solutions from worldwide participants, in some cases with no backgrounds in mass spectrometry. We also present the potential and limitations of these solutions for ML application in future planetary missions. Our longer-term goal is to deploy these powerful methods onboard the spacecraft to autonomously guide space operations and reduce ground-in-the-loop reliance.","PeriodicalId":367327,"journal":{"name":"RAS Techniques and Instruments","volume":"62 S1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Leveraging open science machine learning challenges for data constrained planetary mission instruments\",\"authors\":\"Victoria Da Poian, E. I. Lyness, J. Y. Qi, I. Shah, G. Lipstein, P. D. Archer, L. Chou, C. Freissinet, C. Malespin, A. McAdam, C. A. Knudson, B. P. Theiling, S. M. H”orst\",\"doi\":\"10.1093/rasti/rzae009\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n We set up two open-science machine learning (ML) challenges focusing on building models to automatically analyze mass spectrometry (MS) data for Mars exploration. ML challenges provide an excellent way to engage a diverse set of experts with benchmark training data, explore a wide range of ML and data science approaches, and identify promising models based on empirical results, as well as to get independent external analyses to compare to those of the internal team. These two challenges were proof-of-concept projects to analyze the feasibility of combining data collected from different instruments in a single ML application. We selected mass spectrometry data from 1) commercial instruments and 2) the Sample Analysis at Mars (SAM, an instrument suite that includes a mass spectrometer subsystem onboard the Curiosity rover) testbed. These challenges, organized with DrivenData, gathered more than 1,150 unique participants from all over the world, and obtained more than 600 solutions contributing powerful models to the analysis of rock and soil samples relevant to planetary science using various mass spectrometry datasets. These two challenges demonstrated the suitability and value of multiple ML approaches to classifying planetary analog datasets from both commercial and flight-like instruments.\\n We present the processes from the problem identification, challenge setups, and challenge results that gathered creative and diverse solutions from worldwide participants, in some cases with no backgrounds in mass spectrometry. We also present the potential and limitations of these solutions for ML application in future planetary missions. Our longer-term goal is to deploy these powerful methods onboard the spacecraft to autonomously guide space operations and reduce ground-in-the-loop reliance.\",\"PeriodicalId\":367327,\"journal\":{\"name\":\"RAS Techniques and Instruments\",\"volume\":\"62 S1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"RAS Techniques and Instruments\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/rasti/rzae009\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"RAS Techniques and Instruments","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/rasti/rzae009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
我们设立了两个开放科学机器学习(ML)挑战赛,重点是为火星探测建立自动分析质谱(MS)数据的模型。机器学习挑战赛提供了一种极好的方式,让不同的专家利用基准训练数据参与其中,探索各种机器学习和数据科学方法,并根据经验结果确定有前途的模型,同时获得独立的外部分析结果,以便与内部团队的分析结果进行比较。这两项挑战是概念验证项目,旨在分析在单一 ML 应用程序中结合从不同仪器收集的数据的可行性。我们选择的质谱数据来自:1)商业仪器;2)火星样本分析(SAM,一种包括好奇号漫游车搭载的质谱仪子系统在内的仪器套件)试验台。与DrivenData共同组织的这些挑战赛聚集了来自世界各地的1150多名参与者,并获得了600多个解决方案,这些解决方案为利用各种质谱数据集分析与行星科学相关的岩石和土壤样本提供了强大的模型。这两项挑战展示了多种 ML 方法的适用性和价值,可用于对商用仪器和飞行类仪器的行星模拟数据集进行分类。我们介绍了从问题识别、挑战设置到挑战结果的过程,这些过程汇集了来自世界各地参与者的创造性和多样化的解决方案,在某些情况下,这些参与者并没有质谱分析的背景。我们还介绍了这些解决方案在未来行星任务中应用于质谱分析的潜力和局限性。我们的长期目标是在航天器上部署这些强大的方法,以自主指导太空操作,减少对地面的依赖。
Leveraging open science machine learning challenges for data constrained planetary mission instruments
We set up two open-science machine learning (ML) challenges focusing on building models to automatically analyze mass spectrometry (MS) data for Mars exploration. ML challenges provide an excellent way to engage a diverse set of experts with benchmark training data, explore a wide range of ML and data science approaches, and identify promising models based on empirical results, as well as to get independent external analyses to compare to those of the internal team. These two challenges were proof-of-concept projects to analyze the feasibility of combining data collected from different instruments in a single ML application. We selected mass spectrometry data from 1) commercial instruments and 2) the Sample Analysis at Mars (SAM, an instrument suite that includes a mass spectrometer subsystem onboard the Curiosity rover) testbed. These challenges, organized with DrivenData, gathered more than 1,150 unique participants from all over the world, and obtained more than 600 solutions contributing powerful models to the analysis of rock and soil samples relevant to planetary science using various mass spectrometry datasets. These two challenges demonstrated the suitability and value of multiple ML approaches to classifying planetary analog datasets from both commercial and flight-like instruments.
We present the processes from the problem identification, challenge setups, and challenge results that gathered creative and diverse solutions from worldwide participants, in some cases with no backgrounds in mass spectrometry. We also present the potential and limitations of these solutions for ML application in future planetary missions. Our longer-term goal is to deploy these powerful methods onboard the spacecraft to autonomously guide space operations and reduce ground-in-the-loop reliance.