Tanzila Kehkashan, Raja Adil Riaz, Ahmad Sami Al-Shamayleh, Adnan Akhunzada, Noman Ali, Muhammad Hamza, Faheem Akbar
{"title":"AI-generated text detection: A comprehensive review of methods, datasets, and applications","authors":"Tanzila Kehkashan, Raja Adil Riaz, Ahmad Sami Al-Shamayleh, Adnan Akhunzada, Noman Ali, Muhammad Hamza, Faheem Akbar","doi":"10.1016/j.cosrev.2025.100793","DOIUrl":null,"url":null,"abstract":"This review examines the rapidly evolving field of AI-generated text detection, which has gained critical importance following the widespread deployment of advanced large language models like ChatGPT. We analyze the technical foundations, methodological approaches, evaluation frameworks, and practical applications of detection technologies designed to distinguish between human and machine-authored content. The paper synthesizes current knowledge across key dimensions: detection techniques ranging from statistical approaches to neural architectures, datasets and their limitations, performance metrics and evaluation challenges, real-world implementations across educational, publishing, and legal domains, and emerging research directions. Our analysis reveals significant challenges, including the inherent adversarial nature of detection, cross-domain generalization difficulties, and fairness concerns regarding certain writer populations. We identify promising trends toward multi-scale analysis, human-AI collaborative frameworks, and complementary provenance-based approaches. The review concludes that effective detection remains feasible but requires combining multiple approaches, domain-specific customization, and attention to ethical implications. This comprehensive examination serves as a resource for researchers, practitioners, and policymakers navigating the complex technical and societal dimensions of AI text detection in an era of increasingly sophisticated generative AI systems.","PeriodicalId":48633,"journal":{"name":"Computer Science Review","volume":"33 1","pages":"100793"},"PeriodicalIF":12.7000,"publicationDate":"2025-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Science Review","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1016/j.cosrev.2025.100793","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
This review examines the rapidly evolving field of AI-generated text detection, which has gained critical importance following the widespread deployment of advanced large language models like ChatGPT. We analyze the technical foundations, methodological approaches, evaluation frameworks, and practical applications of detection technologies designed to distinguish between human and machine-authored content. The paper synthesizes current knowledge across key dimensions: detection techniques ranging from statistical approaches to neural architectures, datasets and their limitations, performance metrics and evaluation challenges, real-world implementations across educational, publishing, and legal domains, and emerging research directions. Our analysis reveals significant challenges, including the inherent adversarial nature of detection, cross-domain generalization difficulties, and fairness concerns regarding certain writer populations. We identify promising trends toward multi-scale analysis, human-AI collaborative frameworks, and complementary provenance-based approaches. The review concludes that effective detection remains feasible but requires combining multiple approaches, domain-specific customization, and attention to ethical implications. This comprehensive examination serves as a resource for researchers, practitioners, and policymakers navigating the complex technical and societal dimensions of AI text detection in an era of increasingly sophisticated generative AI systems.
期刊介绍:
Computer Science Review, a publication dedicated to research surveys and expository overviews of open problems in computer science, targets a broad audience within the field seeking comprehensive insights into the latest developments. The journal welcomes articles from various fields as long as their content impacts the advancement of computer science. In particular, articles that review the application of well-known Computer Science methods to other areas are in scope only if these articles advance the fundamental understanding of those methods.