{"title":"Advanced computational methods for news classification: A study in neural networks and CNN integrated with GPT","authors":"Fahim Sufi","doi":"10.1016/j.ject.2024.09.001","DOIUrl":null,"url":null,"abstract":"<div><div>In an era inundated with vast amounts of information, the imperative for efficient news classification is paramount. This research explores the sophisticated integration of neural networks and convolutional neural networks (CNN) with Generative Pre-trained Transformers (GPT) to enhance the precision and efficacy of news categorization. The rapid digital dissemination of news necessitates advanced computational methodologies capable of accurate classification and event prediction that include finance and economic events. Leveraging recent advancements in machine learning and natural language processing (NLP), this study utilizes large language models (LLMs) such as GPT and BERT, known for their exceptional comprehension and generation of human-like text. Over 232 days, our methodology classified 33,979 news articles into Education & Learning, Health & Medicine, and Science & Technology, with further subcategorization into 32 distinct subcategories. For evaluation, a sample of 5000 articles was assessed using metrics such as True Positive (TP), True Negative (TN), False Positive (FP), False Negative (FN), Precision, Recall, and F1-Score. In comparison with the existing studies, the proposed method achieving significantly higher with average scores of 0.986 (Precision), 0.987 (Recall), and 0.987 (F1-Score). This research offers substantial practical contributions, providing detailed insights into news source contributions, effective anomaly detection, and predictive trend analysis using neural networks. The theoretical contributions are profound, demonstrating the mathematical integration of GPT with CNNs and recurrent neural networks. This integration advances computational news classification and exemplifies how sophisticated mathematical frameworks enhance large-scale text data analysis, marking a pivotal advancement in applying advanced computational methods in real-world scenarios.</div></div>","PeriodicalId":100776,"journal":{"name":"Journal of Economy and Technology","volume":"3 ","pages":"Pages 264-281"},"PeriodicalIF":0.0000,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Economy and Technology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949948824000404","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In an era inundated with vast amounts of information, the imperative for efficient news classification is paramount. This research explores the sophisticated integration of neural networks and convolutional neural networks (CNN) with Generative Pre-trained Transformers (GPT) to enhance the precision and efficacy of news categorization. The rapid digital dissemination of news necessitates advanced computational methodologies capable of accurate classification and event prediction that include finance and economic events. Leveraging recent advancements in machine learning and natural language processing (NLP), this study utilizes large language models (LLMs) such as GPT and BERT, known for their exceptional comprehension and generation of human-like text. Over 232 days, our methodology classified 33,979 news articles into Education & Learning, Health & Medicine, and Science & Technology, with further subcategorization into 32 distinct subcategories. For evaluation, a sample of 5000 articles was assessed using metrics such as True Positive (TP), True Negative (TN), False Positive (FP), False Negative (FN), Precision, Recall, and F1-Score. In comparison with the existing studies, the proposed method achieving significantly higher with average scores of 0.986 (Precision), 0.987 (Recall), and 0.987 (F1-Score). This research offers substantial practical contributions, providing detailed insights into news source contributions, effective anomaly detection, and predictive trend analysis using neural networks. The theoretical contributions are profound, demonstrating the mathematical integration of GPT with CNNs and recurrent neural networks. This integration advances computational news classification and exemplifies how sophisticated mathematical frameworks enhance large-scale text data analysis, marking a pivotal advancement in applying advanced computational methods in real-world scenarios.