Xiaorou Zheng , Jianxin Jia , Shoubin Dong , Yawei Wang , Runuo Lu , Yuwei Chen , Yueming Wang
{"title":"机器学习算法的训练和推理时间效率评估框架:高光谱图像分类的案例研究","authors":"Xiaorou Zheng , Jianxin Jia , Shoubin Dong , Yawei Wang , Runuo Lu , Yuwei Chen , Yueming Wang","doi":"10.1016/j.jag.2025.104591","DOIUrl":null,"url":null,"abstract":"<div><div>The increasing complexity and scale of remote sensing datasets, coupled with the challenges of accurately estimating algorithmic time efficiency, often lead to significant resource waste or even failure when using machine learning algorithms in urgent or resource-constrained scenarios. Accurate time efficiency estimation is critical for deploying effective algorithms, yet it remains challenging due to the many factors influencing computational performance. Traditional methods of evaluating time efficiency often neglect the effects of core model parameters and complex data scales in spectral and temporal dimensions. In addition, inference time, an essential factor in real-world applications, is often overlooked. To address these limitations, we propose the Time Efficiency Assessment Framework (TEAF), a novel method for evaluating the time efficiency of machine learning algorithms. Through mathematical reasoning, TEAF models the training and inference time as functions (<span><math><mi>ψ</mi></math></span>) of complex data scales and core model parameters. The strong linear correlation between <span><math><mi>ψ</mi></math></span> and the actual runtime allows TEAF to accurately predict the time and cost of machine learning tasks with a low computational overhead before algorithm execution. To validate this framework, we derived TEAF formulations for five classical machine learning algorithms and tested them on state-of-the-art hyperspectral image datasets and Sentinel-2 multispectral datasets. The results demonstrated that TEAF could accurately predict both training and inference time for various algorithms, with a strong linear correlation between <span><math><mi>ψ</mi></math></span> and actual runtime (<span><math><mrow><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup><mo>></mo><mn>0</mn><mo>.</mo><mn>942</mn></mrow></math></span>). This study offers a practical solution to the challenges posed by the increasing volume and complexity of data in remote sensing image processing. The code is available at <span><span>https://github.com/SCUT-CCNL/TEAF</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"141 ","pages":"Article 104591"},"PeriodicalIF":7.6000,"publicationDate":"2025-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Training and inference Time Efficiency Assessment Framework for machine learning algorithms: A case study for hyperspectral image classification\",\"authors\":\"Xiaorou Zheng , Jianxin Jia , Shoubin Dong , Yawei Wang , Runuo Lu , Yuwei Chen , Yueming Wang\",\"doi\":\"10.1016/j.jag.2025.104591\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The increasing complexity and scale of remote sensing datasets, coupled with the challenges of accurately estimating algorithmic time efficiency, often lead to significant resource waste or even failure when using machine learning algorithms in urgent or resource-constrained scenarios. Accurate time efficiency estimation is critical for deploying effective algorithms, yet it remains challenging due to the many factors influencing computational performance. Traditional methods of evaluating time efficiency often neglect the effects of core model parameters and complex data scales in spectral and temporal dimensions. In addition, inference time, an essential factor in real-world applications, is often overlooked. To address these limitations, we propose the Time Efficiency Assessment Framework (TEAF), a novel method for evaluating the time efficiency of machine learning algorithms. Through mathematical reasoning, TEAF models the training and inference time as functions (<span><math><mi>ψ</mi></math></span>) of complex data scales and core model parameters. The strong linear correlation between <span><math><mi>ψ</mi></math></span> and the actual runtime allows TEAF to accurately predict the time and cost of machine learning tasks with a low computational overhead before algorithm execution. To validate this framework, we derived TEAF formulations for five classical machine learning algorithms and tested them on state-of-the-art hyperspectral image datasets and Sentinel-2 multispectral datasets. The results demonstrated that TEAF could accurately predict both training and inference time for various algorithms, with a strong linear correlation between <span><math><mi>ψ</mi></math></span> and actual runtime (<span><math><mrow><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup><mo>></mo><mn>0</mn><mo>.</mo><mn>942</mn></mrow></math></span>). This study offers a practical solution to the challenges posed by the increasing volume and complexity of data in remote sensing image processing. The code is available at <span><span>https://github.com/SCUT-CCNL/TEAF</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":73423,\"journal\":{\"name\":\"International journal of applied earth observation and geoinformation : ITC journal\",\"volume\":\"141 \",\"pages\":\"Article 104591\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-05-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International journal of applied earth observation and geoinformation : ITC journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1569843225002389\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"REMOTE SENSING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of applied earth observation and geoinformation : ITC journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1569843225002389","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"REMOTE SENSING","Score":null,"Total":0}
Training and inference Time Efficiency Assessment Framework for machine learning algorithms: A case study for hyperspectral image classification
The increasing complexity and scale of remote sensing datasets, coupled with the challenges of accurately estimating algorithmic time efficiency, often lead to significant resource waste or even failure when using machine learning algorithms in urgent or resource-constrained scenarios. Accurate time efficiency estimation is critical for deploying effective algorithms, yet it remains challenging due to the many factors influencing computational performance. Traditional methods of evaluating time efficiency often neglect the effects of core model parameters and complex data scales in spectral and temporal dimensions. In addition, inference time, an essential factor in real-world applications, is often overlooked. To address these limitations, we propose the Time Efficiency Assessment Framework (TEAF), a novel method for evaluating the time efficiency of machine learning algorithms. Through mathematical reasoning, TEAF models the training and inference time as functions () of complex data scales and core model parameters. The strong linear correlation between and the actual runtime allows TEAF to accurately predict the time and cost of machine learning tasks with a low computational overhead before algorithm execution. To validate this framework, we derived TEAF formulations for five classical machine learning algorithms and tested them on state-of-the-art hyperspectral image datasets and Sentinel-2 multispectral datasets. The results demonstrated that TEAF could accurately predict both training and inference time for various algorithms, with a strong linear correlation between and actual runtime (). This study offers a practical solution to the challenges posed by the increasing volume and complexity of data in remote sensing image processing. The code is available at https://github.com/SCUT-CCNL/TEAF.
期刊介绍:
The International Journal of Applied Earth Observation and Geoinformation publishes original papers that utilize earth observation data for natural resource and environmental inventory and management. These data primarily originate from remote sensing platforms, including satellites and aircraft, supplemented by surface and subsurface measurements. Addressing natural resources such as forests, agricultural land, soils, and water, as well as environmental concerns like biodiversity, land degradation, and hazards, the journal explores conceptual and data-driven approaches. It covers geoinformation themes like capturing, databasing, visualization, interpretation, data quality, and spatial uncertainty.