{"title":"Classification of human-written and AI-generated sentences using a hybrid CNN-GRU model optimized by the spotted hyena algorithm","authors":"Mahmoud Ragab , Ehab Bahaudien Ashary , Faris Kateb , Abeer Hakeem , Rayan Mosli , Nasser N. Albogami , Sameer Nooh","doi":"10.1016/j.aej.2025.04.071","DOIUrl":null,"url":null,"abstract":"<div><div>The rapid advancement of artificial intelligence (AI) in generating human-like text poses significant challenges in distinguishing between human-written and AI-generated content. Recent advancements in natural language generation have significantly enhanced the quality and variety of AI-generated text, making it almost indistinguishable from human-written content. ChatGPT, a popular AI model, belongs to the generative pre-trained transformer family. While human content is created with a clear intent to convey meaning, AI-generated text aims to replicate human-like language. Classifying human-written and AI-generated sentences is crucial for addressing issues like fake news, plagiarism, and spamming. AI text often follows repetitive patterns, while human writing is more creative and original, making detection significant for combating misinformation. Therefore, this study proposes to classify human-written and AI-generated sentences using a hybrid CNN-GRU model optimized by the Spotted Hyena Algorithm (CHWAIG-DLSHO) approach. The approach involves preprocessing text data through tokenization, lemmatization, and data splitting, followed by word embedding using Latent Dirichlet Allocation (LDA). A hybrid convolutional neural network (CNN) and gated recurrent unit (GRU) model is employed for sentence classification. The spotted hyena optimizer (SHO) model is utilized to fine-tune the hyperparameters of the CNN-GRU model, enhancing its performance. The analysis of the CHWAIG-DLSHO method takes place utilizing AI vs. human text dataset. The performance validation of the CHWAIG-DLSHO method portrayed a superior accuracy value of 99.17 % over existing techniques.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"126 ","pages":"Pages 116-130"},"PeriodicalIF":6.2000,"publicationDate":"2025-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"alexandria engineering journal","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110016825005666","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
The rapid advancement of artificial intelligence (AI) in generating human-like text poses significant challenges in distinguishing between human-written and AI-generated content. Recent advancements in natural language generation have significantly enhanced the quality and variety of AI-generated text, making it almost indistinguishable from human-written content. ChatGPT, a popular AI model, belongs to the generative pre-trained transformer family. While human content is created with a clear intent to convey meaning, AI-generated text aims to replicate human-like language. Classifying human-written and AI-generated sentences is crucial for addressing issues like fake news, plagiarism, and spamming. AI text often follows repetitive patterns, while human writing is more creative and original, making detection significant for combating misinformation. Therefore, this study proposes to classify human-written and AI-generated sentences using a hybrid CNN-GRU model optimized by the Spotted Hyena Algorithm (CHWAIG-DLSHO) approach. The approach involves preprocessing text data through tokenization, lemmatization, and data splitting, followed by word embedding using Latent Dirichlet Allocation (LDA). A hybrid convolutional neural network (CNN) and gated recurrent unit (GRU) model is employed for sentence classification. The spotted hyena optimizer (SHO) model is utilized to fine-tune the hyperparameters of the CNN-GRU model, enhancing its performance. The analysis of the CHWAIG-DLSHO method takes place utilizing AI vs. human text dataset. The performance validation of the CHWAIG-DLSHO method portrayed a superior accuracy value of 99.17 % over existing techniques.
期刊介绍:
Alexandria Engineering Journal is an international journal devoted to publishing high quality papers in the field of engineering and applied science. Alexandria Engineering Journal is cited in the Engineering Information Services (EIS) and the Chemical Abstracts (CA). The papers published in Alexandria Engineering Journal are grouped into five sections, according to the following classification:
• Mechanical, Production, Marine and Textile Engineering
• Electrical Engineering, Computer Science and Nuclear Engineering
• Civil and Architecture Engineering
• Chemical Engineering and Applied Sciences
• Environmental Engineering