{"title":"Quantitative Analysis of AI-Generated Texts in Academic Research: A Study of AI Presence in Arxiv Submissions using AI Detection Tool","authors":"Arslan Akram","doi":"arxiv-2403.13812","DOIUrl":null,"url":null,"abstract":"Many people are interested in ChatGPT since it has become a prominent AIGC\nmodel that provides high-quality responses in various contexts, such as\nsoftware development and maintenance. Misuse of ChatGPT might cause significant\nissues, particularly in public safety and education, despite its immense\npotential. The majority of researchers choose to publish their work on Arxiv.\nThe effectiveness and originality of future work depend on the ability to\ndetect AI components in such contributions. To address this need, this study\nwill analyze a method that can see purposely manufactured content that academic\norganizations use to post on Arxiv. For this study, a dataset was created using\nphysics, mathematics, and computer science articles. Using the newly built\ndataset, the following step is to put originality.ai through its paces. The\nstatistical analysis shows that Originality.ai is very accurate, with a rate of\n98%.","PeriodicalId":501323,"journal":{"name":"arXiv - STAT - Other Statistics","volume":"87 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Other Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2403.13812","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Many people are interested in ChatGPT since it has become a prominent AIGC
model that provides high-quality responses in various contexts, such as
software development and maintenance. Misuse of ChatGPT might cause significant
issues, particularly in public safety and education, despite its immense
potential. The majority of researchers choose to publish their work on Arxiv.
The effectiveness and originality of future work depend on the ability to
detect AI components in such contributions. To address this need, this study
will analyze a method that can see purposely manufactured content that academic
organizations use to post on Arxiv. For this study, a dataset was created using
physics, mathematics, and computer science articles. Using the newly built
dataset, the following step is to put originality.ai through its paces. The
statistical analysis shows that Originality.ai is very accurate, with a rate of
98%.