Performance Evaluation最新文献

筛选
英文 中文
Strategic pricing and ranking in recommendation systems with seller competition 考虑卖家竞争的推荐系统策略定价与排名
IF 0.8 4区 计算机科学
Performance Evaluation Pub Date : 2025-11-01 Epub Date: 2025-11-13 DOI: 10.1016/j.peva.2025.102518
Tushar Shankar Walunj , Veeraruna Kavitha , Jayakrishnan Nair , Priyank Agarwal
{"title":"Strategic pricing and ranking in recommendation systems with seller competition","authors":"Tushar Shankar Walunj ,&nbsp;Veeraruna Kavitha ,&nbsp;Jayakrishnan Nair ,&nbsp;Priyank Agarwal","doi":"10.1016/j.peva.2025.102518","DOIUrl":"10.1016/j.peva.2025.102518","url":null,"abstract":"<div><div>We study a recommendation system where sellers compete for visibility by strategically offering commissions to a platform that optimally curates a ranked menu of items and their respective prices for each customer. Customers interact sequentially with the menu following a cascade click model, and their purchase decisions are influenced by price sensitivity and positions of various items in the menu. We model the seller-platform interaction as a Stackelberg game with sellers as leaders and consider two different games depending on whether the prices are set by the platform or prefixed by the sellers.</div><div>It is complicated to find the optimal policy of the platform in complete generality; hence, we solve the problem in an important asymptotic regime. In fact, both the games coincide in this regime, obtained by decreasing the customer exploration rates <span><math><mi>γ</mi></math></span> to zero (in this regime, the customers explore fewer items). Through simulations, we illustrate that the limit game well approximates the original game(s) even for exploration probabilities as high as 0.4 (the differences are around 2.54%). Further, the second game (where the sellers prefix the prices) coincides with the approximate game for all values of <span><math><mi>γ</mi></math></span>.</div><div>The core contribution of this paper lies in characterizing the equilibrium structure of the limit game. We show that when sellers are of different strengths, the standard Nash equilibrium does not exist due to discontinuities in utilities. We instead establish the existence of a novel equilibrium solution, namely ‘<span><math><mi>μ</mi></math></span>-connected equilibrium cycle’ (<span><math><mi>μ</mi></math></span>-EC), which captures oscillatory strategic responses at the equilibrium. Unlike the (pure) Nash equilibrium, which defines a fixed point of mutual best responses, this is a set-valued solution concept of connected components. This novel equilibrium concept identifies a Cartesian product set of connected action profiles in the continuous action space that satisfies four important properties: stability against external deviations, no external chains, instability against internal deviations, and minimality. We extend a recently introduced solution concept <em>equilibrium cycle</em> to include stability against measure-zero violations and avoid some topological difficulties to propose <span><math><mi>μ</mi></math></span>-EC.</div></div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"170 ","pages":"Article 102518"},"PeriodicalIF":0.8,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145516997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian optimization for dynamic pricing and learning 动态定价与学习的贝叶斯优化
IF 0.8 4区 计算机科学
Performance Evaluation Pub Date : 2025-11-01 Epub Date: 2025-11-12 DOI: 10.1016/j.peva.2025.102519
Anush Anand, Pranav Agrawal, Tejas Bodas
{"title":"Bayesian optimization for dynamic pricing and learning","authors":"Anush Anand,&nbsp;Pranav Agrawal,&nbsp;Tejas Bodas","doi":"10.1016/j.peva.2025.102519","DOIUrl":"10.1016/j.peva.2025.102519","url":null,"abstract":"<div><div>Dynamic pricing is the practice of adjusting the selling price of a product to maximize a firm’s revenue by responding to market demand. The literature typically distinguishes between two settings: infinite inventory, where the firm has unlimited stock and time to sell, and finite inventory, where both inventory and selling horizon are limited. In both cases, the central challenge lies in the fact that the demand function — how sales respond to price — is unknown and must be learned from data. Traditional approaches often assume a specific parametric form for the demand function, enabling the use of reinforcement learning (RL) to identify near-optimal pricing strategies. However, such assumptions may not hold in real-world scenarios, limiting the applicability of these methods.</div><div>In this work, we propose a Gaussian Process (GP) based nonparametric approach to dynamic pricing that avoids restrictive modeling assumptions. We treat the demand function as a black-box function of the price and develop pricing algorithms based on Bayesian Optimization (BO)—a sample-efficient method for optimizing unknown functions. We present BO-based algorithms tailored for both infinite and finite inventory settings and provide regret guarantees for both regimes, thereby quantifying the learning efficiency of our methods. Through extensive experiments, we demonstrate that our BO-based methods outperform several state-of-the-art RL algorithms in terms of revenue, while requiring fewer assumptions and offering greater robustness. This highlights Bayesian Optimization as a powerful and practical tool for dynamic pricing in complex, uncertain environments.</div></div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"170 ","pages":"Article 102519"},"PeriodicalIF":0.8,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145568452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparing approximations in the ASIP tandem queue 比较ASIP串联队列中的近似
IF 0.8 4区 计算机科学
Performance Evaluation Pub Date : 2025-11-01 Epub Date: 2025-11-04 DOI: 10.1016/j.peva.2025.102523
Wesley Geelen , Maria Vlasiou , Yaron Yeger
{"title":"Comparing approximations in the ASIP tandem queue","authors":"Wesley Geelen ,&nbsp;Maria Vlasiou ,&nbsp;Yaron Yeger","doi":"10.1016/j.peva.2025.102523","DOIUrl":"10.1016/j.peva.2025.102523","url":null,"abstract":"<div><div>The Asymmetric Inclusion Process (ASIP) models unidirectional transport with particle clustering, yet remains analytically intractable for systems beyond small sizes. To address this, we develop two approximation methods: the replica mean-field (RMF) limit, providing a first-order approximation, and the power series algorithm (PSA), a numerical scheme based on traffic intensity expansions. We evaluate these approximations against Monte Carlo simulations for general systems and prior exact results for homogeneous ASIP systems. Both methods yield accurate estimates, with PSA closely matching simulations for both homogeneous and heterogeneous systems, while RMF performing well for early sites but being slightly impacted downstream or as load increases. These approximations offer practical and computationally efficient alternatives to simulation, enabling detailed performance analysis of ASIP tandem queues where exact solutions are unavailable.</div></div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"170 ","pages":"Article 102523"},"PeriodicalIF":0.8,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145466620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimizing resource allocation for geographically-distributed inference by large language models 基于大型语言模型的地理分布推理资源优化分配
IF 0.8 4区 计算机科学
Performance Evaluation Pub Date : 2025-11-01 Epub Date: 2025-11-08 DOI: 10.1016/j.peva.2025.102527
Tingyang Sun , Ting He , Bo Ji , Parimal Parag
{"title":"Optimizing resource allocation for geographically-distributed inference by large language models","authors":"Tingyang Sun ,&nbsp;Ting He ,&nbsp;Bo Ji ,&nbsp;Parimal Parag","doi":"10.1016/j.peva.2025.102527","DOIUrl":"10.1016/j.peva.2025.102527","url":null,"abstract":"<div><div>Large language models (LLMs) have demonstrated extraordinary performance in many artificial intelligence (AI) tasks but are expensive to use, even after training, due to their requirement of high-end GPUs. Recently, a distributed system called PETALS was developed to lower the barrier for deploying LLMs by splitting the model blocks across multiple servers with low-end GPUs distributed over the Internet, which was much faster than swapping the model parameters between the GPU memory and other cheaper but slower local storage media. However, the performance of such a distributed system critically depends on the resource allocation, and how to do so optimally remains unknown. In this work, we present the first systematic study of the resource allocation problem in distributed LLM inference, with focus on two important decisions: block placement and request routing. Our main results include: (i) experimentally validated performance models that can predict the inference performance under given block placement and request routing decisions, (ii) a formulation of the offline optimization of block placement and request routing as a mixed integer linear programming (MILP) problem together with the NP-hardness proof and a polynomial-complexity algorithm with guaranteed performance, and (iii) an adaptation of the offline algorithm for the online setting with the same performance guarantee under bounded load. Through both experiments and experimentally-validated simulations, we have verified that the proposed solution can substantially reduce the inference time compared to the state-of-the-art solution in diverse settings with geographically-distributed servers. As a byproduct, we have also developed a light-weighted CPU-only simulator capable of predicting the performance of distributed LLM inference on GPU servers, which can evaluate large deployments and facilitate future research for researchers with limited GPU access.</div></div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"170 ","pages":"Article 102527"},"PeriodicalIF":0.8,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145517000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Designing asymptotically optimal policies for continuous-time weakly coupled MDPs 连续时间弱耦合mdp的渐近最优策略设计
IF 0.8 4区 计算机科学
Performance Evaluation Pub Date : 2025-11-01 Epub Date: 2025-11-05 DOI: 10.1016/j.peva.2025.102528
Matthieu Perbal, Balakrishna Prabhu, Ina Maria Verloop
{"title":"Designing asymptotically optimal policies for continuous-time weakly coupled MDPs","authors":"Matthieu Perbal,&nbsp;Balakrishna Prabhu,&nbsp;Ina Maria Verloop","doi":"10.1016/j.peva.2025.102528","DOIUrl":"10.1016/j.peva.2025.102528","url":null,"abstract":"<div><div>We study the continuous-time Weakly Coupled Markov Decision Process (WCMDP), a class of decision problems involving multiple interacting Markov processes (or “arms”) subject to shared resource constraints. We present a general framework for policy design using a combination of an underlying Markov process and a sequence of mappings. Our main theoretical result establishes sufficient conditions on the Markov process and mapping defining the policy, such that it is asymptotically optimal as the number of arms grows.</div><div>We construct both deterministic and randomized policies based on a solution to a linear program (LP). These policies initially assign actions to arms — either proportionally (deterministic) or randomly — based on conditional measures derived from the LP. As this initial allocation may violate feasibility constraints, we introduce a mapping to enforce the resource constraints are satisfied. Finally, we numerically evaluate and compare the performance of our proposed policies, both deterministic and randomized, under different choices of mappings.</div></div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"170 ","pages":"Article 102528"},"PeriodicalIF":0.8,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145517052","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
User equilibria in heterogeneous discriminatory processor sharing queues 异构歧视性处理器共享队列中的用户均衡
IF 0.8 4区 计算机科学
Performance Evaluation Pub Date : 2025-11-01 Epub Date: 2025-09-24 DOI: 10.1016/j.peva.2025.102510
Dieter Fiems , Balakrishna J. Prabhu
{"title":"User equilibria in heterogeneous discriminatory processor sharing queues","authors":"Dieter Fiems ,&nbsp;Balakrishna J. Prabhu","doi":"10.1016/j.peva.2025.102510","DOIUrl":"10.1016/j.peva.2025.102510","url":null,"abstract":"<div><div>We consider a strategic routing game for a two-class discriminatory processor-sharing queue with an additional cost for joining the premium class. We show that, depending on the specific parameters of the system, various equilibria can coexist, including equilibria where the queueing system is not ergodic for the equilibrium traffic split. We also investigate how the server can select the priority of the classes and the fees charged to the customers to maximise its revenue. We then investigate learning strategies that converge to particular equilibria. Finally, we study how the elasticity of the traffic demand affects the equilibrium solutions.</div></div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"170 ","pages":"Article 102510"},"PeriodicalIF":0.8,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145159602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The last, the least, and the urgent: Fluid modeling and performance equivalence for scheduling policies in partial service queues with abandonment 最后,最少,也是最紧迫的:放弃部分服务队列中调度策略的流体建模和性能等效
IF 0.8 4区 计算机科学
Performance Evaluation Pub Date : 2025-11-01 Epub Date: 2025-11-08 DOI: 10.1016/j.peva.2025.102517
Andres Ferragut, Diego Goldsztajn, Fernando Paganini
{"title":"The last, the least, and the urgent: Fluid modeling and performance equivalence for scheduling policies in partial service queues with abandonment","authors":"Andres Ferragut,&nbsp;Diego Goldsztajn,&nbsp;Fernando Paganini","doi":"10.1016/j.peva.2025.102517","DOIUrl":"10.1016/j.peva.2025.102517","url":null,"abstract":"<div><div>In several queueing systems, arriving tasks or customers have both service and timing requirements, the latter expressed as a deadline for the task to be served. These systems with customer abandonment have a long and rich history in queueing theory, and have several applications in task scheduling in computer systems, operations research problems, etc. A common feature in all of these works is that they deal with customers reneging from the system only while in the queue, and not during service. However, in several applications, customers may also leave during service, and the partial work performed by the system during their stay is still useful.</div><div>In this paper we analyze these partial service queues with abandonment in a many-server setting, characterizing the equilibrium performance of several policies in terms of the amount of service attained by tasks. For this purpose, we develop fluid models with two-dimensional independent variables, corresponding to service and sojourn times, which take the form of partial differential equations expressed in weak form. These fluid models allow us to consider general and possibly correlated service and timing requirements, as well as a wide range of service disciplines. In particular, we focus on Earliest-Deadline-First, Least-Attained-Service and Last-Come-First-Served, and establish that all three policies have the same equilibrium performance, even though the latter two do not need any information about deadlines. This striking property means that designers may avoid the difficult job of estimating deadlines without incurring a performance penalty. The fluid model conclusions are validated by extensive numerical experiments.</div></div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"170 ","pages":"Article 102517"},"PeriodicalIF":0.8,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145568453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ASIP tandem queues with Lévy input and consumption 带有lims输入和消耗的ASIP串联队列
IF 0.8 4区 计算机科学
Performance Evaluation Pub Date : 2025-11-01 Epub Date: 2025-10-27 DOI: 10.1016/j.peva.2025.102513
Onno Boxma , Offer Kella , Jacques Resing
{"title":"ASIP tandem queues with Lévy input and consumption","authors":"Onno Boxma ,&nbsp;Offer Kella ,&nbsp;Jacques Resing","doi":"10.1016/j.peva.2025.102513","DOIUrl":"10.1016/j.peva.2025.102513","url":null,"abstract":"<div><div>We consider an ASIP (asymmetric inclusion process) tandem queue, in which the first queue receives a fluid input according to a nondecreasing Lévy process. Each queue has a gate that opens after independent, exponentially distributed periods for an infinitesimal amount of time, allowing the queue content to move to the next queue. In addition, again at independent exponentially distributed instants, a fixed fraction of a queue content is removed from the system.</div><div>For this model, restricting ourselves to steady state, we obtain the following results. (i) We derive the buffer content distribution of the first queue. (ii) For the 2-queue model, we obtain relatively simple explicit expressions for the Laplace transform of the joint buffer content in several special cases. (iii) Asymptotic results are obtained for the 2-queue model when the above-mentioned buffer content removal process approaches a shot-noise process. (iv) For the general <span><math><mi>n</mi></math></span>-queue case, we show how all moments of the buffer contents at all queues can be obtained. (v) For the general <span><math><mi>n</mi></math></span>-queue case, we sketch an approximation method that allows one in principle to derive tractable expressions for the Laplace transform of the buffer content at each queue, with exact mean buffer contents at all queues.</div></div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"170 ","pages":"Article 102513"},"PeriodicalIF":0.8,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145417138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Switching constrained OCO with predictions and feedback delays 具有预测和反馈延迟的切换约束OCO
IF 0.8 4区 计算机科学
Performance Evaluation Pub Date : 2025-11-01 Epub Date: 2025-11-05 DOI: 10.1016/j.peva.2025.102524
Weici Pan, Zhenhua Liu
{"title":"Switching constrained OCO with predictions and feedback delays","authors":"Weici Pan,&nbsp;Zhenhua Liu","doi":"10.1016/j.peva.2025.102524","DOIUrl":"10.1016/j.peva.2025.102524","url":null,"abstract":"<div><div>We examine Online Convex Optimization (OCO) problems with feedback delay and a strict limit on decision switching, which exists in applications such as smart grid and learning. Existing algorithms developed for traditional OCO struggle in this setting, often violating switching constraints or incurring high regrets, as evidenced by simulations. In this paper, we establish a new algorithm, Follow-the-Maximally-Coupled-Latest-Leader (FMCLL), achieving a near-optimal regret of <span><math><mrow><mi>O</mi><mrow><mo>(</mo><mi>T</mi><mo>/</mo><mi>S</mi><mo>)</mo></mrow></mrow></math></span> for such problems with delayed feedbacks and a bound of <span><math><mrow><mi>O</mi><mrow><mo>(</mo><mi>T</mi><mo>/</mo><mi>S</mi><mo>−</mo><mi>τ</mi><mo>)</mo></mrow></mrow></math></span> for problems with predictions of <span><math><mi>τ</mi></math></span> rounds, even though the player is only allowed to move at most <span><math><mi>S</mi></math></span> times in expectation across <span><math><mi>T</mi></math></span> rounds. FMCLL meets performance bounds in scenarios with delays and predictions by using maximal coupling sampling to inform algorithm design for switching-constrained problems. To better apply our framework to practical applications, we also extend the algorithm and results to the bandit feedback setting. Simulations demonstrate FMCLL’s superiority over traditional Gradient Descent or Follow-the-Leader algorithms, excelling under adversarial or stochastic losses and reducing constraint violations.</div></div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"170 ","pages":"Article 102524"},"PeriodicalIF":0.8,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145517001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LLMEmu: A lightweight performance emulator for high-fidelity distributed LLM training LLMEmu:用于高保真分布式LLM训练的轻量级性能模拟器
IF 0.8 4区 计算机科学
Performance Evaluation Pub Date : 2025-11-01 Epub Date: 2025-11-12 DOI: 10.1016/j.peva.2025.102526
Siyuan Yang , Enda Yu , Pingjing Lu , Dezun Dong
{"title":"LLMEmu: A lightweight performance emulator for high-fidelity distributed LLM training","authors":"Siyuan Yang ,&nbsp;Enda Yu ,&nbsp;Pingjing Lu ,&nbsp;Dezun Dong","doi":"10.1016/j.peva.2025.102526","DOIUrl":"10.1016/j.peva.2025.102526","url":null,"abstract":"<div><div>The prohibitive cost of training trillion-parameter large language models (LLMs) necessitates low-cost emulation tools for distributed system optimization. In modern large-scale clusters, communication often becomes the primary bottleneck to scalability. However, existing emulators, such as vTrain and ASTRA-Sim, overlook dynamic network factors that significantly impact performance at scale, resulting in limited emulation accuracy. This work offers an efficient and reliable tool for training system optimization and parallel strategy exploration, considerably lowering the barrier to large-scale AI research. We present LLMEmu, a distributed training emulator that combines real kernel profiling and actual communication execution. First, computation is profiled through real CUDA kernel traces on GPU nodes to construct an operator-level latency lookup table, enabling GPU-like execution on CPU clusters. Second, inter-node communication is executed using communication library primitives (e.g., AllReduce, Send/Recv), triggered by communication anchors embedded in the execution graph, and implemented using a pluggable communication backend. LLMEmu can seamlessly model hybrid parallelism strategies and supports multiple collective algorithms. Its lightweight design incorporates gradient bucketing with latency reuse to minimize overhead while maintaining extensibility to various network interconnects. The effectiveness of LLMEmu is validated through its performance results, demonstrating an average prediction error of only 2.17% on 24-GPU clusters, which outperforms vTrain by 21.09%, and confirming its scalability in modeling training cost distributions across 128-node CPU emulations under varying network conditions.</div></div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"170 ","pages":"Article 102526"},"PeriodicalIF":0.8,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145516998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书