Large-scale text classification with deeper and wider convolution neural network

Int. J. Simul. Process. Model. Pub Date : 2020-04-21 DOI:10.1504/ijspm.2020.10028731

Min Huang, Wei Huang

引用次数: 1

Abstract

The dominant approaches for most natural language processing (NLP) tasks like text classification are recurrent neural networks (RNNs) and convolutional neural networks (CNNs). These architectures are usually shallow and only have one or two layers, which cannot easily extract inner patterns in natural language. Different from the original feature of image pixels with regularity, words and phrases are highly abstracted from human knowledge without direct correlation. Shallow models only capture the surface relation between them while deep models cannot directly apply them. Therefore, a shuffle convolution neural network (SCNN) is proposed to address the shallow learning problem by introducing wider inception cell and deeper residual connection. In the paper, the difficulty of applying deep models to NLP problems is overcome by tricks of shuffling channel input and reshaping the output dimension in the first layer. The results of the experiments carried out in this research work demonstrate that the proposed SCNN makes a great improvement in accuracy and efficiency compared to shallow models.

查看原文本刊更多论文

基于更深更广卷积神经网络的大规模文本分类

大多数自然语言处理(NLP)任务(如文本分类)的主要方法是循环神经网络(rnn)和卷积神经网络(cnn)。这些体系结构通常很浅，只有一层或两层，不容易提取自然语言的内部模式。不同于原始图像像素具有规律性的特征，词和短语是高度抽象的人类知识，没有直接的相关性。浅层模型只捕捉它们之间的表面关系，而深层模型不能直接应用它们。因此，提出了一种洗牌卷积神经网络(SCNN)，通过引入更宽的初始单元和更深的残差连接来解决浅层学习问题。在本文中，通过在第一层变换通道输入和重塑输出维度的技巧克服了将深度模型应用于NLP问题的困难。实验结果表明，与浅层模型相比，本文提出的SCNN在精度和效率方面都有很大提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Int. J. Simul. Process. Model.

自引率

0.00%

发文量