Detection and Classification of ChatGPT Generated Contents Using Deep Transformer Models

Sushma D S, Pooja C N, Varsha H S, Yasir Hussain, P Yashash
{"title":"Detection and Classification of ChatGPT Generated Contents Using Deep Transformer Models","authors":"Sushma D S, Pooja C N, Varsha H S, Yasir Hussain, P Yashash","doi":"10.47392/irjaeh.2024.0193","DOIUrl":null,"url":null,"abstract":"AI advancements, particularly in neural networks, have brought about groundbreaking tools like text generators and chatbots. While these technologies offer tremendous benefits, they also pose serious risks such as privacy breaches, spread of misinformation, and challenges to academic integrity. Previous efforts to distinguish between human and AI-generated text have been limited, especially with models like ChatGPT. To tackle this, we created a dataset containing both human and ChatGPT-generated text, using it to train and test various machine and deep learning models. Your results, particularly the high F1-score and accuracy achieved by the RoBERTa-based custom deep learning model and Distil BERT, indicate promising progress in this area. By establishing a robust baseline for detecting and classifying AI-generated content, your work contributes significantly to mitigating potential misuse of AI-powered text generation tools.","PeriodicalId":517766,"journal":{"name":"International Research Journal on Advanced Engineering Hub (IRJAEH)","volume":"18 10","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Research Journal on Advanced Engineering Hub (IRJAEH)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47392/irjaeh.2024.0193","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

AI advancements, particularly in neural networks, have brought about groundbreaking tools like text generators and chatbots. While these technologies offer tremendous benefits, they also pose serious risks such as privacy breaches, spread of misinformation, and challenges to academic integrity. Previous efforts to distinguish between human and AI-generated text have been limited, especially with models like ChatGPT. To tackle this, we created a dataset containing both human and ChatGPT-generated text, using it to train and test various machine and deep learning models. Your results, particularly the high F1-score and accuracy achieved by the RoBERTa-based custom deep learning model and Distil BERT, indicate promising progress in this area. By establishing a robust baseline for detecting and classifying AI-generated content, your work contributes significantly to mitigating potential misuse of AI-powered text generation tools.
使用深度变换器模型检测和分类 ChatGPT 生成的内容
人工智能的进步,尤其是神经网络的进步,带来了文本生成器和聊天机器人等突破性工具。这些技术在带来巨大好处的同时,也带来了严重的风险,如隐私泄露、错误信息传播以及对学术诚信的挑战。以往区分人类文本和人工智能生成文本的努力非常有限,尤其是像 ChatGPT 这样的模型。为了解决这个问题,我们创建了一个包含人类文本和 ChatGPT 生成文本的数据集,用它来训练和测试各种机器学习和深度学习模型。您的结果,尤其是基于 RoBERTa 的定制深度学习模型和 Distil BERT 所取得的高 F1 分数和准确率,表明我们在这一领域取得了可喜的进展。通过建立检测和分类人工智能生成内容的稳健基线,你们的工作为减少人工智能驱动的文本生成工具的潜在滥用做出了重大贡献。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信