ETRI Journal最新文献

筛选
英文 中文
2024 Reviewer List
IF 1.3 4区 计算机科学
ETRI Journal Pub Date : 2025-02-27 DOI: 10.4218/etr2.70008
{"title":"2024 Reviewer List","authors":"","doi":"10.4218/etr2.70008","DOIUrl":"https://doi.org/10.4218/etr2.70008","url":null,"abstract":"<p>A, Ashwini, Vel Tech Rangarajan Dr Sagunthala R&D Institute of Science and Technology</p><p>A, Revathi, SASTRA Deemed University</p><p>A, UMAMAGESWARI, SRM University - Ramapuram Campus</p><p>Abd El-Hafeez, Tarek, Minia University</p><p>Abd Rahman, Mohd Amiruddin, Universiti Putra Malaysia</p><p>Abdi, Asad, University of Derby</p><p>Abdullah, Hadeel, University of Technology</p><p>Abebe, Abiy, Addis Ababa Institute of Technology</p><p>Adewunmi, Mary, National Center for Technology Management</p><p>Afify, Heba M., Higher Inst. of Engineering in Shorouk Academy</p><p>Ahmad, Mushtaq, Nanjing University of Aeronautics and Astronautics</p><p>Ahmed, Suhaib, Baba Ghulam Shah Badshah University</p><p>Ahn, Sungsoo, Gyeongsang National University</p><p>Akbar, Son, Universitas Ahmad Dahlan</p><p>Akhriza, Tubagus, Kampus STIMATA</p><p>Akoushideh, Alireza, Technical and Vocational University</p><p>Al-Araji, Ahmed S., University of technology - Iraq</p><p>Al-Azzoni, Issam, Al Ain University</p><p>Alfaverh, Fayiz, University of Hertfordshire</p><p>alghanimi, abdulhameed, Middle Technical Univ.</p><p>Ali, Dia M, Ninevah University</p><p>ali, Tariq, PMAS Arid Agriculture university</p><p>Alikhani, Nasim,</p><p>Al-Kaltakchi, Musab T. S., Mustansiriyah University</p><p>Al-kaltakchi, Musab, Mustansiriyah University</p><p>Alkinoon, Mohammed, University of Central Florida</p><p>Al-masni, Mohammed A., Sejong University</p><p>Al-Sakkaf, Ahmed Gaafar, Universidad Carlos III de Madrid Escuela Politécnica Superior</p><p>Ansarian, Sasan,</p><p>Arora, Shashank, SUNY</p><p>Asgher, Umer, National University of Sciences and Technology</p><p>Ashraf, Umer, NIT Srinagar</p><p>atashbar, mahmoud, Azarbaijan Shahid Madani University,</p><p>Atrey, Pradeep, State University of New York</p><p>Azim, Rezaul, University of Chittagong</p><p>B, Srinivas, Maharaj Vijayaram Gajapathi Ram College of Engineering</p><p>Baek, Donghyun, Chung-Ang University</p><p>Baek, Hoki, Kyungpook National University</p><p>Balbinot, Alexandre, Universidade Federal do Rio Grande do Sul</p><p>BANDI, SUDHEER, Panimalar Engineering College</p><p>Baranwal, Alok, NIT-Durgapur</p><p>Baydargil, Husnu Baris, Institute for Basic Science</p><p>Beniwal, Ruby, Jaypee Institute of Information Technology</p><p>Benrabah, Abdeldjabar,</p><p>Bhattacharya, Ratnadeep, The George Washington University</p><p>Bhowmik, Showmik, Ghani Khan Choudhury Institute of Engineering and Technology</p><p>Bonthagorla, Praveen Kumar, National Institute of Technology Goa</p><p>Byun, Gangil, UNIST</p><p>Byun, Hayoung, Myongji University</p><p>C, Arunkumar Madhuvappan, Vinayaka Mission's Kirupananda Variyar Engineering College</p><p>Callou, G., UFRPE</p><p>Cammarasana, Simone, CNR-IMATI</p><p>Castillo-Soria, Francisco, Universidad Autónoma de San Luis Potosí</p><p>Ceberio, Josu, University of the Basque Country</p><p>Cha, Ho-Young, Hongik University</p><p>Chabir, Karim, ENIG</p><p>Chaudhary, Girdhari, Jeonbuk National University</p><p>Che, Ren","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"47 1","pages":"167-171"},"PeriodicalIF":1.3,"publicationDate":"2025-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etr2.70008","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143497340","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detection of IPv6 routing attacks using ANN and a novel IoT dataset 利用 ANN 和新型物联网数据集检测 IPv6 路由攻击
IF 1.3 4区 计算机科学
ETRI Journal Pub Date : 2025-02-11 DOI: 10.4218/etrij.2023-0506
Murat Emeç
{"title":"Detection of IPv6 routing attacks using ANN and a novel IoT dataset","authors":"Murat Emeç","doi":"10.4218/etrij.2023-0506","DOIUrl":"https://doi.org/10.4218/etrij.2023-0506","url":null,"abstract":"<p>The Internet of Things (IoT) is an intelligent network paradigm created by interconnected device networks. Although the importance of IoT systems has increased in various applications, the increasing number of connected devices has made security even more critical. This study presents the ROUT-4-2023 dataset, which represents a step toward the security of IoT networks. This dataset simulates potential attacks on RPL-based IoT networks and provides a new platform for researchers in this field. Using artificial intelligence and machine-learning techniques, a performance evaluation was performed on four different artificial neural network models (convolutional neural network, deep neural network, multilayer perceptron structure, and routing attack detection-fed forward neural network [RaD-FFNN]). The results show that the RaD-FFNN model has high accuracy, precision, and retrieval rates, indicating that it can be used as an effective tool for the security of IoT networks. This study contributes to the protection of IoT networks from potential attacks by presenting ROUT-4-2023 and RaD-FFNN models, which will lead to further research on IoT security.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"47 2","pages":"350-361"},"PeriodicalIF":1.3,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2023-0506","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143835975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Peak-to-average power ratio reduction of orthogonal frequency division multiplexing signals using improved salp swarm optimization-based partial transmit sequence model
IF 1.3 4区 计算机科学
ETRI Journal Pub Date : 2025-01-21 DOI: 10.4218/etrij.2023-0347
Vandana Tripathi, Prabhat Patel, Prashant Kumar Jain, Shailja Shukla
{"title":"Peak-to-average power ratio reduction of orthogonal frequency division multiplexing signals using improved salp swarm optimization-based partial transmit sequence model","authors":"Vandana Tripathi,&nbsp;Prabhat Patel,&nbsp;Prashant Kumar Jain,&nbsp;Shailja Shukla","doi":"10.4218/etrij.2023-0347","DOIUrl":"https://doi.org/10.4218/etrij.2023-0347","url":null,"abstract":"<p>Several peak-to-average power ratio (PAPR) reduction methods have been used in orthogonal frequency division multiplexing (OFDM) applications. Among the available methods, partial transmit sequence (PTS) is an efficient PAPR reduction method but can be computationally expensive while determining optimal phase factors (OPFs). Therefore, an optimization algorithm, namely, the improved salp swarm optimization algorithm (ISSA), is incorporated with the PTS to reduce the PAPR of the OFDM signals with limited computational cost. The ISSA includes a dynamic weight element and Lévy flight process to improve the global exploration ability of the optimization algorithm and to control the global and local search ability of the population with a better convergence rate. Three evaluation measures, namely, the complementary cumulative distribution function (CCDF), bit error rate (BER), and symbol error rate (SER), demonstrate the efficacy of the PTS-ISSA model, which achieves a lower PAPR of 3.47 dB and is superior to other optimization algorithms using the PTS method.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"47 2","pages":"256-269"},"PeriodicalIF":1.3,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2023-0347","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143836441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A graph neural network model application in point cloud structure for prolonged sitting detection system based on smartphone sensor data
IF 1.3 4区 计算机科学
ETRI Journal Pub Date : 2025-01-21 DOI: 10.4218/etrij.2023-0190
Mardi Hardjianto, Jazi Eko Istiyanto, A. Min Tjoa, Arfa Shaha Syahrulfath, Satriawan Rasyid Purnama, Rifda Hakima Sari, Zaidan Hakim, M. Ridho Fuadin, Nias Ananto
{"title":"A graph neural network model application in point cloud structure for prolonged sitting detection system based on smartphone sensor data","authors":"Mardi Hardjianto,&nbsp;Jazi Eko Istiyanto,&nbsp;A. Min Tjoa,&nbsp;Arfa Shaha Syahrulfath,&nbsp;Satriawan Rasyid Purnama,&nbsp;Rifda Hakima Sari,&nbsp;Zaidan Hakim,&nbsp;M. Ridho Fuadin,&nbsp;Nias Ananto","doi":"10.4218/etrij.2023-0190","DOIUrl":"https://doi.org/10.4218/etrij.2023-0190","url":null,"abstract":"<p>The prolonged sitting inherent in modern work and study environments poses significant health risks, necessitating effective monitoring solutions. Traditional human activity recognition systems often fall short in these contexts owing to their reliance on structured data, which may fail to capture the complexity of human movements or accommodate the often incomplete or unstructured nature of healthcare data. To address this gap, our study introduces a novel application of graph neural networks (GNNs) for detecting prolonged sitting periods using point cloud data from smartphone sensors. Unlike conventional methods, our GNN model excels at processing the unordered, three-dimensional structure of sensor data, enabling more accurate classification of sedentary activities. The effectiveness of our approach is demonstrated by its superior ability to identify sitting, standing, and walking activities—critical for assessing health risks associated with prolonged sitting. By providing real-time activity recognition, our model offers a promising tool for healthcare professionals to mitigate the adverse effects of sedentary behavior.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"47 2","pages":"290-302"},"PeriodicalIF":1.3,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2023-0190","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143836442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance analysis of wireless-powered cell-free massive multiple-input multiple-output system with spatial correlation in Internet of Things network 物联网网络中具有空间相关性的无线供电无蜂窝大规模多输入多输出系统的性能分析
IF 1.3 4区 计算机科学
ETRI Journal Pub Date : 2025-01-05 DOI: 10.4218/etrij.2023-0216
Haiyan Wang, Xinmin Li, Yuan Fang, Xiaoqiang Zhang
{"title":"Performance analysis of wireless-powered cell-free massive multiple-input multiple-output system with spatial correlation in Internet of Things network","authors":"Haiyan Wang,&nbsp;Xinmin Li,&nbsp;Yuan Fang,&nbsp;Xiaoqiang Zhang","doi":"10.4218/etrij.2023-0216","DOIUrl":"https://doi.org/10.4218/etrij.2023-0216","url":null,"abstract":"<p>The massive multiple-input multiple-output (mMIMO) approach is promising for the Internet of Things (IoT) owing to its massive connectivity and high data rate. We introduce a wireless-powered cell-free mMIMO system, in which ground IoT devices transmit pilot and uplink information by harvesting downlink power from multiantenna access points. Considering the spatial correlation, we derive closed-form expressions for the average harvested power with a nonlinear energy-harvesting model per IoT device and achievable data rate according to the random matrix theory. The analytical expressions show that spatial correlation has a negative effect on the data rate owing to the increasing interference power. In contrast, the average received power improves with increasing spatial correlation. Simulation results demonstrate that the derived analytical expressions are consistent with results from the Monte Carlo method.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"47 2","pages":"208-215"},"PeriodicalIF":1.3,"publicationDate":"2025-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2023-0216","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143835779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Network function parallelism configuration with segment routing over IPv6 based on deep reinforcement learning 基于深度强化学习的 IPv6 分段路由网络功能并行性配置
IF 1.3 4区 计算机科学
ETRI Journal Pub Date : 2024-12-11 DOI: 10.4218/etrij.2023-0511
Seokwon Jang, Namseok Ko, Yeunwoong Kyung, Haneul Ko, Jaewook Lee, Sangheon Pack
{"title":"Network function parallelism configuration with segment routing over IPv6 based on deep reinforcement learning","authors":"Seokwon Jang,&nbsp;Namseok Ko,&nbsp;Yeunwoong Kyung,&nbsp;Haneul Ko,&nbsp;Jaewook Lee,&nbsp;Sangheon Pack","doi":"10.4218/etrij.2023-0511","DOIUrl":"https://doi.org/10.4218/etrij.2023-0511","url":null,"abstract":"<p>Network function parallelism (NFP) has gained attention for processing packets in parallel through service functions arranged in the required service function chain. While parallel processing efficiently reduces the service function chaining (SFC) completion time, it incurs a higher network overhead (e.g., network congestion) to replicate various packets for processing. To reduce the SFC completion time while maintaining a low network overhead, we propose a deep-reinforcement-learning-based NFP algorithm (DeepNFP) that provides an SFC processing policy to determine the sequential or parallel processing of every service function. In DeepNFP, deep reinforcement learning captures the network dynamics and service function conditions and iteratively finds the SFC processing policy in the network environment. Furthermore, an SFC data plane protocol based on segment routing over IPv6 configures and operates the policy in the SFC data network. Evaluation results show that DeepNFP can achieve 46% of the SFC completion time and 66% of the network overhead compared with conventional SFC and NFP, respectively.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"47 2","pages":"278-289"},"PeriodicalIF":1.3,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2023-0511","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143835974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to “NEST-C: A deep learning compiler framework for heterogeneous computing systems with artificial intelligence accelerators” 更正“NEST-C:一个用于具有人工智能加速器的异构计算系统的深度学习编译器框架”
IF 1.3 4区 计算机科学
ETRI Journal Pub Date : 2024-12-08 DOI: 10.4218/etr2.12748
Jeman Park, Misun Yu, Jinse Kwon, Junmo Park, Jemin Lee, Yongin Kwon
{"title":"Correction to “NEST-C: A deep learning compiler framework for heterogeneous computing systems with artificial intelligence accelerators”","authors":"Jeman Park,&nbsp;Misun Yu,&nbsp;Jinse Kwon,&nbsp;Junmo Park,&nbsp;Jemin Lee,&nbsp;Yongin Kwon","doi":"10.4218/etr2.12748","DOIUrl":"https://doi.org/10.4218/etr2.12748","url":null,"abstract":"<p>NEST-C: A deep learning compiler framework for heterogeneous computing systems with artificial intelligence accelerators</p><p>https://doi.org/10.4218/etrij.2024-0139</p><p>ETRI Journal, Volume 46, Issue 5, October 2024, pp. 851–864.</p><p>In the article entitled “NEST-C: A deep learning compiler framework for heterogeneous computing systems with artificial intelligence accelerators,” the authors would like to correct the funding information of their article. It should be written as follows:</p><p><b>Funding information</b> This study is supported by a grant from the Institute of Information &amp; Communications Technology Planning &amp; Evaluation (IITP), funded by the Korean government (MSIT) (No. RS-2023-00277060, Development of OpenEdge AI SoC hardware and software platform and No. 2018-0-00769, Neuromorphic Computing Software Platform for Artificial Intelligence Systems).</p><p>The authors would like to apologize for the inconvenience caused.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 6","pages":"1126"},"PeriodicalIF":1.3,"publicationDate":"2024-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etr2.12748","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142860615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to “Low-complexity patch projection method for efficient and lightweight point-cloud compression” 对 "用于高效、轻量级点云压缩的低复杂度补丁投影法 "的更正
IF 1.3 4区 计算机科学
ETRI Journal Pub Date : 2024-11-26 DOI: 10.4218/etr2.12746
{"title":"Correction to “Low-complexity patch projection method for efficient and lightweight point-cloud compression”","authors":"","doi":"10.4218/etr2.12746","DOIUrl":"https://doi.org/10.4218/etr2.12746","url":null,"abstract":"<p><b>Sungryeul Rhyu</b> | <b>Junsik Kim | Gwang Hoon Park | Kyuheon Kim</b></p><p>Low-complexity patch projection method for efficient and lightweight point-cloud compression</p><p>https://doi.org/10.4218/etrij.2023-0242</p><p><i>ETRI Journal</i>, Volume 46, Issue 4, August 2024, pp. 683–696.</p><p>In the article entitled “Low-complexity patch projection method for efficient and lightweight point-cloud compression”, the authors would like to correct the funding information of their article. It should be written as follows:</p><p><b>Funding information</b></p><p>This study was supported by the Information Technology Research Center of the Ministry of Science and ICT, Korea (grant number: IITP-2024-2021-0-02046) and the Institute of Information &amp; Communications Technology Planning &amp; Evaluation, Korea (grant number: RS-2023-00227431, Development of 3D space digital media standard technology).</p><p>The authors would like to apologize for the inconvenience caused.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 6","pages":"1125"},"PeriodicalIF":1.3,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etr2.12746","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142862152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SNN eXpress: Streamlining Low-Power AI-SoC Development With Unsigned Weight Accumulation Spiking Neural Network SNN eXpress:利用无符号权值累积尖峰神经网络简化低功耗 AI-SoC 开发
IF 1.3 4区 计算机科学
ETRI Journal Pub Date : 2024-10-28 DOI: 10.4218/etrij.2024-0114
Hyeonguk Jang, Kyuseung Han, Kwang-Il Oh, Sukho Lee, Jae-Jin Lee, Woojoo Lee
{"title":"SNN eXpress: Streamlining Low-Power AI-SoC Development With Unsigned Weight Accumulation Spiking Neural Network","authors":"Hyeonguk Jang,&nbsp;Kyuseung Han,&nbsp;Kwang-Il Oh,&nbsp;Sukho Lee,&nbsp;Jae-Jin Lee,&nbsp;Woojoo Lee","doi":"10.4218/etrij.2024-0114","DOIUrl":"https://doi.org/10.4218/etrij.2024-0114","url":null,"abstract":"<p>SoCs with analog-circuit-based unsigned weight-accumulating spiking neural networks (UWA-SNNs) are a highly promising solution for achieving low-power AI-SoCs. This paper addresses the challenges that must be overcome to realize the potential of UWA-SNNs in low-power AI-SoCs: (i) the absence of UWA-SNN learning methods and the lack of an environment for developing applications based on trained SNN models and (ii) the inherent issue of testing and validating applications on the system being nearly impractical until the final chip is fabricated owing to the mixed-signal circuit implementation of UWA-SNN-based SoCs. This paper argues that, by integrating the proposed solutions, the development of an EDA tool that enables the easy and rapid development of UWA-SNN-based SoCs is feasible, and demonstrates this through the development of the SNN eXpress (SNX) tool. The developed SNX automates the generation of RTL code, FPGA prototypes, and a software development kit tailored for UWA-SNN-based application development. Comprehensive details of SNX development and the performance evaluation and verification results of two AI-SoCs developed using SNX are also presented.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 5","pages":"829-838"},"PeriodicalIF":1.3,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2024-0114","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142525391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PF-GEMV: Utilization maximizing architecture in fast matrix–vector multiplication for GPT-2 inference PF-GEMV:用于 GPT-2 推理的快速矩阵向量乘法中的利用率最大化架构
IF 1.3 4区 计算机科学
ETRI Journal Pub Date : 2024-10-28 DOI: 10.4218/etrij.2024-0111
Hyeji Kim, Yeongmin Lee, Chun-Gi Lyuh
{"title":"PF-GEMV: Utilization maximizing architecture in fast matrix–vector multiplication for GPT-2 inference","authors":"Hyeji Kim,&nbsp;Yeongmin Lee,&nbsp;Chun-Gi Lyuh","doi":"10.4218/etrij.2024-0111","DOIUrl":"https://doi.org/10.4218/etrij.2024-0111","url":null,"abstract":"<p>Owing to the widespread advancement of transformer-based artificial neural networks, artificial intelligence (AI) processors are now required to perform matrix–vector multiplication in addition to the conventional matrix–matrix multiplication. However, current AI processor architectures are optimized for general matrix–matrix multiplications (GEMMs), which causes significant throughput degradation when processing general matrix–vector multiplications (GEMVs). In this study, we proposed a port-folding GEMV (PF-GEMV) scheme employing multiformat and low-precision techniques while reusing an outer product-based processor optimized for conventional GEMM operations. This approach achieves 93.7% utilization in GEMV operations with an 8-bit format on an 8 \u0000<span></span><math>\u0000 <mo>×</mo></math> 8 processor, thus resulting in a 7.5 \u0000<span></span><math>\u0000 <mo>×</mo></math> increase in throughput compared with that of the original scheme. Furthermore, when applied to the matrix operation of the GPT-2 large model, an increase in speed by 7 \u0000<span></span><math>\u0000 <mo>×</mo></math> is achieved in single-batch inferences.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"46 5","pages":"817-828"},"PeriodicalIF":1.3,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2024-0111","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142525397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信