利用生物启发计算提高深度学习模型的分类性能

2021 Thirteenth International Conference on Contemporary Computing (IC3-2021) Pub Date : 2021-08-05 DOI:10.1145/3474124.3474174

Vaishali Baviskar, M. Verma, Pradeep Chatterjee

{"title":"利用生物启发计算提高深度学习模型的分类性能","authors":"Vaishali Baviskar, M. Verma, Pradeep Chatterjee","doi":"10.1145/3474124.3474174","DOIUrl":null,"url":null,"abstract":"Deep learning models have paved the way towards generating high-efficiency classification systems for multiple applications. These applications include lung disease classification, electrocardiogram classification, electroencephalogram classification; forest cover classification, etc. All these applications rely on efficient feature selection capabilities of deep learning models. Models like convolutional neural network (CNN), recurrent neural networks (RNNs), long-short-term-memory (LSTM) etc. are used for this purpose. These models tend to evaluate all possible feature combinations via iterative window-based feature processing. Thereby trying to cover indefinite number of feature combinations in order to classify a definite number of features into a definite number of classes. All these models have a stopping-criteria, which depends upon the error rate difference of previous current iteration. If the error rate is less than a particular threshold, and number of iterations are above a certain predefined value, then training of these networks is stopped. This property of deep learning models limits their real-time performance, because training stops even if the accuracy is lower than expected. The reason for this low accuracy is high dimensionality of search space, due to which selection of the most optimum features is skipped. In order to reduce the probability of such conditions, this text proposes a bio-inspired Genetic Algorithm model for accuracy-based feature selection. The selected features are given to different deep learning models like LSTM RNN, and their internal performance is evaluated. Here, heart failure disease dataset from kaggle is used, and it is observed that due to pre-feature selection process, overall accuracy of these models is improved by 10, while precision, recall fMeasure scores are improved by 15 for heart disease data sets. The specificity and sensitivity performance is improved by 20 when compared with RNN and LSTM models individually.","PeriodicalId":144611,"journal":{"name":"2021 Thirteenth International Conference on Contemporary Computing (IC3-2021)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Improving Classification Performance of Deep Learning Models Using Bio-Inspired Computing\",\"authors\":\"Vaishali Baviskar, M. Verma, Pradeep Chatterjee\",\"doi\":\"10.1145/3474124.3474174\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep learning models have paved the way towards generating high-efficiency classification systems for multiple applications. These applications include lung disease classification, electrocardiogram classification, electroencephalogram classification; forest cover classification, etc. All these applications rely on efficient feature selection capabilities of deep learning models. Models like convolutional neural network (CNN), recurrent neural networks (RNNs), long-short-term-memory (LSTM) etc. are used for this purpose. These models tend to evaluate all possible feature combinations via iterative window-based feature processing. Thereby trying to cover indefinite number of feature combinations in order to classify a definite number of features into a definite number of classes. All these models have a stopping-criteria, which depends upon the error rate difference of previous current iteration. If the error rate is less than a particular threshold, and number of iterations are above a certain predefined value, then training of these networks is stopped. This property of deep learning models limits their real-time performance, because training stops even if the accuracy is lower than expected. The reason for this low accuracy is high dimensionality of search space, due to which selection of the most optimum features is skipped. In order to reduce the probability of such conditions, this text proposes a bio-inspired Genetic Algorithm model for accuracy-based feature selection. The selected features are given to different deep learning models like LSTM RNN, and their internal performance is evaluated. Here, heart failure disease dataset from kaggle is used, and it is observed that due to pre-feature selection process, overall accuracy of these models is improved by 10, while precision, recall fMeasure scores are improved by 15 for heart disease data sets. The specificity and sensitivity performance is improved by 20 when compared with RNN and LSTM models individually.\",\"PeriodicalId\":144611,\"journal\":{\"name\":\"2021 Thirteenth International Conference on Contemporary Computing (IC3-2021)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 Thirteenth International Conference on Contemporary Computing (IC3-2021)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3474124.3474174\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Thirteenth International Conference on Contemporary Computing (IC3-2021)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3474124.3474174","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

深度学习模型为多种应用生成高效分类系统铺平了道路。这些应用包括肺病分类、心电图分类、脑电图分类;森林覆盖分类等。所有这些应用都依赖于深度学习模型的高效特征选择能力。卷积神经网络(CNN)、循环神经网络(RNNs)、长短期记忆(LSTM)等模型都用于此目的。这些模型倾向于通过迭代的基于窗口的特征处理来评估所有可能的特征组合。从而试图覆盖不确定数量的特征组合，以便将确定数量的特征分类为确定数量的类。所有这些模型都有一个停止准则，该准则取决于前一次迭代的错误率差。如果错误率小于特定阈值，并且迭代次数高于某个预定义值，则停止这些网络的训练。深度学习模型的这一特性限制了它们的实时性能，因为即使精度低于预期，训练也会停止。这种低准确率的原因是搜索空间的高维，因此跳过了最优特征的选择。为了降低这种情况的概率，本文提出了一种基于精度的特征选择的生物遗传算法模型。将选择的特征分配给不同的深度学习模型，如LSTM RNN，并对其内部性能进行评估。本文使用来自kaggle的心力衰竭疾病数据集，观察到由于预特征选择过程，这些模型的总体准确率提高了10，而心脏病数据集的精度，召回率和度量分数提高了15。与RNN和LSTM模型相比，该模型的特异性和灵敏度提高了20%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Improving Classification Performance of Deep Learning Models Using Bio-Inspired Computing

Deep learning models have paved the way towards generating high-efficiency classification systems for multiple applications. These applications include lung disease classification, electrocardiogram classification, electroencephalogram classification; forest cover classification, etc. All these applications rely on efficient feature selection capabilities of deep learning models. Models like convolutional neural network (CNN), recurrent neural networks (RNNs), long-short-term-memory (LSTM) etc. are used for this purpose. These models tend to evaluate all possible feature combinations via iterative window-based feature processing. Thereby trying to cover indefinite number of feature combinations in order to classify a definite number of features into a definite number of classes. All these models have a stopping-criteria, which depends upon the error rate difference of previous current iteration. If the error rate is less than a particular threshold, and number of iterations are above a certain predefined value, then training of these networks is stopped. This property of deep learning models limits their real-time performance, because training stops even if the accuracy is lower than expected. The reason for this low accuracy is high dimensionality of search space, due to which selection of the most optimum features is skipped. In order to reduce the probability of such conditions, this text proposes a bio-inspired Genetic Algorithm model for accuracy-based feature selection. The selected features are given to different deep learning models like LSTM RNN, and their internal performance is evaluated. Here, heart failure disease dataset from kaggle is used, and it is observed that due to pre-feature selection process, overall accuracy of these models is improved by 10, while precision, recall fMeasure scores are improved by 15 for heart disease data sets. The specificity and sensitivity performance is improved by 20 when compared with RNN and LSTM models individually.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 Thirteenth International Conference on Contemporary Computing (IC3-2021)

自引率

0.00%

发文量