在各种硬件架构上利用适应性迁移学习应用 GPT-2 模型的效率分析

MB University International Review Pub Date : 2024-07-25 DOI:10.61837/mbuir020124174d

Dejan Dodić, Dušan Regodić

{"title":"在各种硬件架构上利用适应性迁移学习应用 GPT-2 模型的效率分析","authors":"Dejan Dodić, Dušan Regodić","doi":"10.61837/mbuir020124174d","DOIUrl":null,"url":null,"abstract":"This paper conducts an analysis of the efficiency in implementing the GPT-2 model, one of the advanced artificial intelligence models for text generation, through adapted transfer learning, focusing particularly on the utilization of various GPU architectures. The primary goal of this research is to examine the impact of adapted transfer learning on the performance of the GPT-2 model exclusively on various GPU architectures, assessing how different GPU strengths enhance or influence the model's efficiency. The work relies on an experimental method to evaluate and compare the model's performance in terms of accuracy, processing speed, and energy efficiency on each of the tested platforms. Special attention is given to analysing how different characteristics of hardware architectures, such as processing power and memory capacity, affect the efficiency of the transfer learning process. This study provides important insights into the potential for optimizing the GPT-2 model for specific hardware platforms, which is crucial for its application in a wide range of real-world scenarios. The results of this research offer valuable information for researchers in the fields of artificial intelligence and machine learning, providing a foundation for further development and improvement of AI technologies.","PeriodicalId":514816,"journal":{"name":"MB University International Review","volume":"21 13","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ANALYSIS OF THE EFFICIENCY OF GPT-2 MODEL APPLICATION WITH ADAPTED TRANSFER LEARNING ON VARIOUS HARDWARE ARCHITECTURES\",\"authors\":\"Dejan Dodić, Dušan Regodić\",\"doi\":\"10.61837/mbuir020124174d\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper conducts an analysis of the efficiency in implementing the GPT-2 model, one of the advanced artificial intelligence models for text generation, through adapted transfer learning, focusing particularly on the utilization of various GPU architectures. The primary goal of this research is to examine the impact of adapted transfer learning on the performance of the GPT-2 model exclusively on various GPU architectures, assessing how different GPU strengths enhance or influence the model's efficiency. The work relies on an experimental method to evaluate and compare the model's performance in terms of accuracy, processing speed, and energy efficiency on each of the tested platforms. Special attention is given to analysing how different characteristics of hardware architectures, such as processing power and memory capacity, affect the efficiency of the transfer learning process. This study provides important insights into the potential for optimizing the GPT-2 model for specific hardware platforms, which is crucial for its application in a wide range of real-world scenarios. The results of this research offer valuable information for researchers in the fields of artificial intelligence and machine learning, providing a foundation for further development and improvement of AI technologies.\",\"PeriodicalId\":514816,\"journal\":{\"name\":\"MB University International Review\",\"volume\":\"21 13\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"MB University International Review\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.61837/mbuir020124174d\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"MB University International Review","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.61837/mbuir020124174d","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文分析了通过适应性迁移学习（adapted transfer learning）实现 GPT-2 模型（用于文本生成的先进人工智能模型之一）的效率，尤其侧重于各种 GPU 架构的利用。这项研究的主要目标是检验适应性迁移学习对完全在各种 GPU 架构上的 GPT-2 模型性能的影响，评估不同 GPU 的优势如何提高或影响该模型的效率。这项研究采用实验方法，对模型在每个测试平台上的准确性、处理速度和能效方面的性能进行评估和比较。研究特别关注分析硬件架构的不同特性（如处理能力和内存容量）如何影响迁移学习过程的效率。这项研究为针对特定硬件平台优化 GPT-2 模型的潜力提供了重要见解，而这对于该模型在现实世界中的广泛应用至关重要。这项研究成果为人工智能和机器学习领域的研究人员提供了宝贵的信息，为进一步开发和改进人工智能技术奠定了基础。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

ANALYSIS OF THE EFFICIENCY OF GPT-2 MODEL APPLICATION WITH ADAPTED TRANSFER LEARNING ON VARIOUS HARDWARE ARCHITECTURES

This paper conducts an analysis of the efficiency in implementing the GPT-2 model, one of the advanced artificial intelligence models for text generation, through adapted transfer learning, focusing particularly on the utilization of various GPU architectures. The primary goal of this research is to examine the impact of adapted transfer learning on the performance of the GPT-2 model exclusively on various GPU architectures, assessing how different GPU strengths enhance or influence the model's efficiency. The work relies on an experimental method to evaluate and compare the model's performance in terms of accuracy, processing speed, and energy efficiency on each of the tested platforms. Special attention is given to analysing how different characteristics of hardware architectures, such as processing power and memory capacity, affect the efficiency of the transfer learning process. This study provides important insights into the potential for optimizing the GPT-2 model for specific hardware platforms, which is crucial for its application in a wide range of real-world scenarios. The results of this research offer valuable information for researchers in the fields of artificial intelligence and machine learning, providing a foundation for further development and improvement of AI technologies.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

MB University International Review

自引率

0.00%

发文量