Harnessing Deep Convolutional Neural Networks Detecting Synthetic Cannabinoids: A Hybrid Learning Strategy for Handling Class Imbalances in Limited Datasets

IF 2.1 Q2 ENGINEERING, MULTIDISCIPLINARY

Inventions Pub Date : 2023-10-16 DOI:10.3390/inventions8050129

Catalina Mercedes Burlacu, Adrian Constantin Burlacu, Mirela Praisler, Cristina Paraschiv

{"title":"Harnessing Deep Convolutional Neural Networks Detecting Synthetic Cannabinoids: A Hybrid Learning Strategy for Handling Class Imbalances in Limited Datasets","authors":"Catalina Mercedes Burlacu, Adrian Constantin Burlacu, Mirela Praisler, Cristina Paraschiv","doi":"10.3390/inventions8050129","DOIUrl":null,"url":null,"abstract":"The aim of this research was to develop and deploy efficient deep convolutional neural network (DCNN) frameworks for detecting and discriminating between various categories of designer drugs. These are of particular relevance in forensic contexts, aiding efforts to prevent and counter drug use and trafficking and supporting associated legal investigations. Our multinomial classification architectures, based on Attenuated Total Reflectance Fourier-Transform Infrared (ATR-FTIR) spectra, are primarily tailored to accurately identify synthetic cannabinoids. Within the scope of our dataset, they also adeptly detect other forensically significant drugs and misused prescription medications. The artificial intelligence (AI) models we developed use two platforms: our custom-designed, pre-trained Convolutional Autoencoder (CAE) and a structure derived from the Vision Transformer Trained on ImageNet Competition Data (ViT-B/32) model. In order to compare and refine our models, various loss functions (cross-entropy and focal loss) and optimization algorithms (Adaptive Moment Estimation, Stochastic Gradient Descent, Sign Stochastic Gradient Descent, and Root Mean Square Propagation) were tested and evaluated at differing learning rates. This study shows that innovative transfer learning methods, which integrate both unsupervised and supervised techniques with spectroscopic data pre-processing (ATR correction, normalization, smoothing) and present significant benefits. Their effectiveness in training AI systems on limited, imbalanced datasets is particularly notable. The strategic deployment of CAEs, complemented by data augmentation and synthetic sample generation using the Synthetic Minority Oversampling Technique (SMOTE) and class weights, effectively address the challenges posed by such datasets. The robustness and adaptability of our DCNN models are discussed, emphasizing their reliability and portability for real-world applications. Beyond their primary forensic utility, these systems demonstrate versatility, making them suitable for broader computer vision tasks, notably image classification and object detection.","PeriodicalId":14564,"journal":{"name":"Inventions","volume":"36 1","pages":"0"},"PeriodicalIF":2.1000,"publicationDate":"2023-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Inventions","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/inventions8050129","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

The aim of this research was to develop and deploy efficient deep convolutional neural network (DCNN) frameworks for detecting and discriminating between various categories of designer drugs. These are of particular relevance in forensic contexts, aiding efforts to prevent and counter drug use and trafficking and supporting associated legal investigations. Our multinomial classification architectures, based on Attenuated Total Reflectance Fourier-Transform Infrared (ATR-FTIR) spectra, are primarily tailored to accurately identify synthetic cannabinoids. Within the scope of our dataset, they also adeptly detect other forensically significant drugs and misused prescription medications. The artificial intelligence (AI) models we developed use two platforms: our custom-designed, pre-trained Convolutional Autoencoder (CAE) and a structure derived from the Vision Transformer Trained on ImageNet Competition Data (ViT-B/32) model. In order to compare and refine our models, various loss functions (cross-entropy and focal loss) and optimization algorithms (Adaptive Moment Estimation, Stochastic Gradient Descent, Sign Stochastic Gradient Descent, and Root Mean Square Propagation) were tested and evaluated at differing learning rates. This study shows that innovative transfer learning methods, which integrate both unsupervised and supervised techniques with spectroscopic data pre-processing (ATR correction, normalization, smoothing) and present significant benefits. Their effectiveness in training AI systems on limited, imbalanced datasets is particularly notable. The strategic deployment of CAEs, complemented by data augmentation and synthetic sample generation using the Synthetic Minority Oversampling Technique (SMOTE) and class weights, effectively address the challenges posed by such datasets. The robustness and adaptability of our DCNN models are discussed, emphasizing their reliability and portability for real-world applications. Beyond their primary forensic utility, these systems demonstrate versatility, making them suitable for broader computer vision tasks, notably image classification and object detection.

查看原文本刊更多论文

利用深度卷积神经网络检测合成大麻素:在有限数据集中处理类失衡的混合学习策略

本研究的目的是开发和部署有效的深度卷积神经网络(DCNN)框架，用于检测和区分各种类别的设计药物。这在法医方面特别重要，有助于预防和打击毒品使用和贩运的努力，并支持相关的法律调查。我们基于衰减全反射傅里叶变换红外(ATR-FTIR)光谱的多项分类架构主要用于准确识别合成大麻素。在我们的数据集范围内，他们还熟练地检测到其他具有法医意义的药物和滥用的处方药。我们开发的人工智能(AI)模型使用两个平台:我们定制设计的预训练卷积自编码器(CAE)和源自ImageNet竞争数据(vitb /32)模型上训练的视觉转换器的结构。为了比较和完善我们的模型，我们在不同的学习率下测试和评估了各种损失函数(交叉熵和焦点损失)和优化算法(自适应矩估计、随机梯度下降、符号随机梯度下降和均方根传播)。该研究表明，将无监督和有监督技术与光谱数据预处理(ATR校正、归一化、平滑)相结合的创新迁移学习方法具有显著的优势。它们在有限的、不平衡的数据集上训练人工智能系统的有效性尤其显著。cae的战略部署，辅以数据增强和使用合成少数派过采样技术(SMOTE)和类权生成的合成样本，有效地解决了此类数据集带来的挑战。讨论了我们的DCNN模型的鲁棒性和适应性，强调了它们在实际应用中的可靠性和可移植性。除了其主要的法医用途外，这些系统还展示了多功能性，使其适用于更广泛的计算机视觉任务，特别是图像分类和目标检测。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊