Detection and classification of ChatGPT-generated content using deep transformer models.

IF 3 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Artificial Intelligence Pub Date : 2025-04-04 eCollection Date: 2025-01-01 DOI:10.3389/frai.2025.1458707

Mahdi Maktabdar Oghaz, Lakshmi Babu Saheer, Kshipra Dhame, Gayathri Singaram

{"title":"Detection and classification of ChatGPT-generated content using deep transformer models.","authors":"Mahdi Maktabdar Oghaz, Lakshmi Babu Saheer, Kshipra Dhame, Gayathri Singaram","doi":"10.3389/frai.2025.1458707","DOIUrl":null,"url":null,"abstract":"Introduction: The rapid advancement of AI, particularly artificial neural networks, has led to revolutionary breakthroughs and applications, such as text-generating tools and chatbots. However, this potent technology also introduces potential misuse and societal implications, including privacy violations, misinformation, and challenges to integrity and originality in academia. Several studies have attempted to distinguish and classify AI-generated textual content from human-authored work, but their performance remains questionable, particularly for AI models utilizing large language models like ChatGPT.Methods: To address this issue, we compiled a dataset consisting of both human-written and AI-generated (ChatGPT) content. This dataset was then used to train and evaluate a range of machine learning and deep learning models under various training conditions. We assessed the efficacy of different models in detecting and classifying AI-generated content, with a particular focus on transformer-based architectures.Results: Experimental results demonstrate that the proposed RoBERTa-based custom deep learning model achieved an F1-score of 0.992 and an accuracy of 0.991, followed by DistilBERT, which yielded an F1-score of 0.988 and an accuracy of 0.988. These results indicate exceptional performance in detecting and classifying AI-generated content.Discussion: Our findings establish a robust baseline for the detection and classification of AI-generated textual content. This work marks a significant step toward mitigating the potential misuse of AI-powered text generation tools by providing a reliable approach for distinguishing between human and AI-generated text. Future research could explore the generalizability of these models across different AI-generated content sources and address evolving challenges in AI text detection.","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1458707"},"PeriodicalIF":3.0000,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12006062/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/frai.2025.1458707","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction: The rapid advancement of AI, particularly artificial neural networks, has led to revolutionary breakthroughs and applications, such as text-generating tools and chatbots. However, this potent technology also introduces potential misuse and societal implications, including privacy violations, misinformation, and challenges to integrity and originality in academia. Several studies have attempted to distinguish and classify AI-generated textual content from human-authored work, but their performance remains questionable, particularly for AI models utilizing large language models like ChatGPT.

Methods: To address this issue, we compiled a dataset consisting of both human-written and AI-generated (ChatGPT) content. This dataset was then used to train and evaluate a range of machine learning and deep learning models under various training conditions. We assessed the efficacy of different models in detecting and classifying AI-generated content, with a particular focus on transformer-based architectures.

Results: Experimental results demonstrate that the proposed RoBERTa-based custom deep learning model achieved an F1-score of 0.992 and an accuracy of 0.991, followed by DistilBERT, which yielded an F1-score of 0.988 and an accuracy of 0.988. These results indicate exceptional performance in detecting and classifying AI-generated content.

Discussion: Our findings establish a robust baseline for the detection and classification of AI-generated textual content. This work marks a significant step toward mitigating the potential misuse of AI-powered text generation tools by providing a reliable approach for distinguishing between human and AI-generated text. Future research could explore the generalizability of these models across different AI-generated content sources and address evolving challenges in AI text detection.

查看原文本刊更多论文

利用深层变压器模型检测和分类chatgpt生成的内容。

导读：人工智能，特别是人工神经网络的快速发展，带来了革命性的突破和应用，如文本生成工具和聊天机器人。然而，这种强大的技术也会带来潜在的误用和社会影响，包括侵犯隐私、错误信息以及对学术界完整性和独创性的挑战。一些研究试图区分和分类人工智能生成的文本内容和人类创作的作品，但它们的性能仍然值得怀疑，特别是对于使用像ChatGPT这样的大型语言模型的人工智能模型。方法：为了解决这个问题，我们编译了一个由人工编写和人工智能生成（ChatGPT）内容组成的数据集。然后使用该数据集在各种训练条件下训练和评估一系列机器学习和深度学习模型。我们评估了不同模型在检测和分类人工智能生成内容方面的功效，特别关注基于变压器的架构。结果：实验结果表明，本文提出的基于roberta的自定义深度学习模型f1得分为0.992，准确率为0.991，其次是DistilBERT模型，f1得分为0.988，准确率为0.988。这些结果表明在检测和分类人工智能生成的内容方面表现出色。讨论：我们的研究结果为人工智能生成的文本内容的检测和分类建立了一个稳健的基线。这项工作通过提供一种可靠的方法来区分人类和人工智能生成的文本，标志着在减少人工智能文本生成工具的潜在滥用方面迈出了重要的一步。未来的研究可以探索这些模型在不同人工智能生成的内容来源中的泛化性，并解决人工智能文本检测中不断发展的挑战。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊