研究出版物中人工智能生成文本的检测框架

Paria Sarzaeim, Aarya Mayurpalsingh Doshi, Qusay H. Mahmoud
{"title":"研究出版物中人工智能生成文本的检测框架","authors":"Paria Sarzaeim, Aarya Mayurpalsingh Doshi, Qusay H. Mahmoud","doi":"10.58190/icat.2023.28","DOIUrl":null,"url":null,"abstract":"The use of generative artificial intelligence is becoming increasingly prevalent in creating content in various formats such as text, video, and image. However, there is a need to distinguish between content that has been generated by humans and content that has been generated by AI as misuse of these technologies can raise scientific and social challenges. Moreover, there are concerns about the reliability and comprehensiveness of the content generated by AI without human validation. This paper presents a framework for AI-generated text. The prototype implementation of the proposed approach is to train a model using predefined datasets and deploy this model on a cloud-based service to predict whether a text was created by a human or AI. This approach is specifically focused on assessing the accuracy of scientific writings and research papers rather than general text. The proposed framework is compared with recently developed tools such as OpenAI Text Classifier, ZeroGPT, and Turnitin. The results show that training a text classifier can be highly useful in detecting whether a text is written by a human or AI. The source code and dataset are made open source so others can experiment with the prototype implementation and use it for future research.","PeriodicalId":20592,"journal":{"name":"PROCEEDINGS OF THE III INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES IN MATERIALS SCIENCE, MECHANICAL AND AUTOMATION ENGINEERING: MIP: Engineering-III – 2021","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Framework for Detecting AI-Generated Text in Research Publications\",\"authors\":\"Paria Sarzaeim, Aarya Mayurpalsingh Doshi, Qusay H. Mahmoud\",\"doi\":\"10.58190/icat.2023.28\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The use of generative artificial intelligence is becoming increasingly prevalent in creating content in various formats such as text, video, and image. However, there is a need to distinguish between content that has been generated by humans and content that has been generated by AI as misuse of these technologies can raise scientific and social challenges. Moreover, there are concerns about the reliability and comprehensiveness of the content generated by AI without human validation. This paper presents a framework for AI-generated text. The prototype implementation of the proposed approach is to train a model using predefined datasets and deploy this model on a cloud-based service to predict whether a text was created by a human or AI. This approach is specifically focused on assessing the accuracy of scientific writings and research papers rather than general text. The proposed framework is compared with recently developed tools such as OpenAI Text Classifier, ZeroGPT, and Turnitin. The results show that training a text classifier can be highly useful in detecting whether a text is written by a human or AI. The source code and dataset are made open source so others can experiment with the prototype implementation and use it for future research.\",\"PeriodicalId\":20592,\"journal\":{\"name\":\"PROCEEDINGS OF THE III INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES IN MATERIALS SCIENCE, MECHANICAL AND AUTOMATION ENGINEERING: MIP: Engineering-III – 2021\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"PROCEEDINGS OF THE III INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES IN MATERIALS SCIENCE, MECHANICAL AND AUTOMATION ENGINEERING: MIP: Engineering-III – 2021\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.58190/icat.2023.28\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"PROCEEDINGS OF THE III INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES IN MATERIALS SCIENCE, MECHANICAL AND AUTOMATION ENGINEERING: MIP: Engineering-III – 2021","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.58190/icat.2023.28","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

生成式人工智能的使用在创建各种格式的内容(如文本、视频和图像)方面变得越来越普遍。然而,有必要区分人类生成的内容和人工智能生成的内容,因为滥用这些技术可能会带来科学和社会挑战。此外,在没有人工验证的情况下,人工智能生成的内容的可靠性和全面性也令人担忧。本文提出了一个人工智能生成文本的框架。提出的方法的原型实现是使用预定义的数据集训练模型,并将该模型部署在基于云的服务上,以预测文本是由人类还是人工智能创建的。这种方法特别侧重于评估科学著作和研究论文的准确性,而不是一般文本。该框架与最近开发的工具(如OpenAI文本分类器、ZeroGPT和Turnitin)进行了比较。结果表明,训练文本分类器对于检测文本是由人类还是人工智能编写的非常有用。源代码和数据集都是开源的,因此其他人可以尝试原型实现并将其用于未来的研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Framework for Detecting AI-Generated Text in Research Publications
The use of generative artificial intelligence is becoming increasingly prevalent in creating content in various formats such as text, video, and image. However, there is a need to distinguish between content that has been generated by humans and content that has been generated by AI as misuse of these technologies can raise scientific and social challenges. Moreover, there are concerns about the reliability and comprehensiveness of the content generated by AI without human validation. This paper presents a framework for AI-generated text. The prototype implementation of the proposed approach is to train a model using predefined datasets and deploy this model on a cloud-based service to predict whether a text was created by a human or AI. This approach is specifically focused on assessing the accuracy of scientific writings and research papers rather than general text. The proposed framework is compared with recently developed tools such as OpenAI Text Classifier, ZeroGPT, and Turnitin. The results show that training a text classifier can be highly useful in detecting whether a text is written by a human or AI. The source code and dataset are made open source so others can experiment with the prototype implementation and use it for future research.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信