Development of a Software Suite for Testing Server Hardware

IF 0.4 Q4 PHYSICS, PARTICLES & FIELDS
E. Tsamtsurov, N. Balashov, K. Lukyanov
{"title":"Development of a Software Suite for Testing Server Hardware","authors":"E. Tsamtsurov,&nbsp;N. Balashov,&nbsp;K. Lukyanov","doi":"10.1134/S1547477125700980","DOIUrl":null,"url":null,"abstract":"<p>Testing of server equipment prior to its operation is crucial for ensuring reliable and smooth operation of systems at the Multifunctional Information and Computation Complex of the Joint Institute for Nuclear Research. The main purpose of testing is to identify hidden defects that may arise under critical loads on the equipment. There are various empirical methods described in production standards used to detect equipment failures. The paper presents an automated system for testing server equipment, including automation of system installation, launching tests, and collecting test logs. In the current implementation of the system, testing is carried out using the method of Highly Accelerated Stress Screening (HASS). A key part of the system is the monitoring subsystem required for collecting and analyzing temperature data from the tested components. Temperature metrics analysis during the testing phase allows to determine the duration of testing with a given accuracy. In addition to the monitoring tools such as Node Exporter, Prometheus, Prometheus Gateway and Grafana, the system uses Stress-ng to load the equipment with synthetic tests. All of these subsystems are freely distributed, and the proposed system can be easily implemented for similar testing in comparable infrastructures.</p>","PeriodicalId":730,"journal":{"name":"Physics of Particles and Nuclei Letters","volume":"22 5","pages":"1015 - 1018"},"PeriodicalIF":0.4000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physics of Particles and Nuclei Letters","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1134/S1547477125700980","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PHYSICS, PARTICLES & FIELDS","Score":null,"Total":0}
引用次数: 0

Abstract

Testing of server equipment prior to its operation is crucial for ensuring reliable and smooth operation of systems at the Multifunctional Information and Computation Complex of the Joint Institute for Nuclear Research. The main purpose of testing is to identify hidden defects that may arise under critical loads on the equipment. There are various empirical methods described in production standards used to detect equipment failures. The paper presents an automated system for testing server equipment, including automation of system installation, launching tests, and collecting test logs. In the current implementation of the system, testing is carried out using the method of Highly Accelerated Stress Screening (HASS). A key part of the system is the monitoring subsystem required for collecting and analyzing temperature data from the tested components. Temperature metrics analysis during the testing phase allows to determine the duration of testing with a given accuracy. In addition to the monitoring tools such as Node Exporter, Prometheus, Prometheus Gateway and Grafana, the system uses Stress-ng to load the equipment with synthetic tests. All of these subsystems are freely distributed, and the proposed system can be easily implemented for similar testing in comparable infrastructures.

Abstract Image

服务器硬件测试软件套件的开发
在运行前对服务器设备进行测试对于确保联合核研究所多功能信息和计算中心系统的可靠和平稳运行至关重要。测试的主要目的是识别在设备的临界负载下可能出现的隐藏缺陷。在生产标准中描述了用于检测设备故障的各种经验方法。介绍了一种用于服务器设备测试的自动化系统,包括系统安装自动化、测试启动自动化和测试日志采集自动化。在目前的系统实施中,使用高加速应力筛选(HASS)方法进行测试。该系统的关键部分是监测子系统,用于采集和分析被测部件的温度数据。测试阶段的温度度量分析允许以给定的精度确定测试的持续时间。除了使用Node出口商、Prometheus、Prometheus Gateway和Grafana等监控工具外,该系统还使用Stress-ng来加载设备进行综合测试。所有这些子系统都是自由分布的,并且所提出的系统可以很容易地在类似的基础设施中实现类似的测试。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Physics of Particles and Nuclei Letters
Physics of Particles and Nuclei Letters PHYSICS, PARTICLES & FIELDS-
CiteScore
0.80
自引率
20.00%
发文量
108
期刊介绍: The journal Physics of Particles and Nuclei Letters, brief name Particles and Nuclei Letters, publishes the articles with results of the original theoretical, experimental, scientific-technical, methodological and applied research. Subject matter of articles covers: theoretical physics, elementary particle physics, relativistic nuclear physics, nuclear physics and related problems in other branches of physics, neutron physics, condensed matter physics, physics and engineering at low temperatures, physics and engineering of accelerators, physical experimental instruments and methods, physical computation experiments, applied research in these branches of physics and radiology, ecology and nuclear medicine.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信