改进一元递归神经网络的性能及其在现实任务中的应用

Proceedings of the 19th International Conference on Computer Systems and Technologies Pub Date : 2018-09-13 DOI:10.1145/3274005.3274027

Ivan Ivanov, L. Jing, R. Dangovski

{"title":"改进一元递归神经网络的性能及其在现实任务中的应用","authors":"Ivan Ivanov, L. Jing, R. Dangovski","doi":"10.1145/3274005.3274027","DOIUrl":null,"url":null,"abstract":"During a prolonged time execution, deep recurrent neural networks suffer from the so-called long-term dependency problem due to their recurrent connection. Although Long Short-Term Memory (LSTM) networks provide a temporary solution to this problem, they have inferior long-term memory capabilities which limit their applications. We use a recent approach for a recurrent neural network model implementing a unitary matrix in its recurrent connection to deal with long-term dependencies, without affecting its memory abilities. The model is capable of high technical results, but due to insufficient implementation does not achieve the expected performance. We optimize the implementation and architecture of the model, achieving time performance up to 5 times better than the original implementation. Additionally, we apply our improved model to three common real-life problems: the automatic text understanding task, the speech recognition task, and cryptoanalysis, and outperform the widely used LSTM model.","PeriodicalId":152033,"journal":{"name":"Proceedings of the 19th International Conference on Computer Systems and Technologies","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Improving the Performance of Unitary Recurrent Neural Networks and Their Application in Real-life Tasks\",\"authors\":\"Ivan Ivanov, L. Jing, R. Dangovski\",\"doi\":\"10.1145/3274005.3274027\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"During a prolonged time execution, deep recurrent neural networks suffer from the so-called long-term dependency problem due to their recurrent connection. Although Long Short-Term Memory (LSTM) networks provide a temporary solution to this problem, they have inferior long-term memory capabilities which limit their applications. We use a recent approach for a recurrent neural network model implementing a unitary matrix in its recurrent connection to deal with long-term dependencies, without affecting its memory abilities. The model is capable of high technical results, but due to insufficient implementation does not achieve the expected performance. We optimize the implementation and architecture of the model, achieving time performance up to 5 times better than the original implementation. Additionally, we apply our improved model to three common real-life problems: the automatic text understanding task, the speech recognition task, and cryptoanalysis, and outperform the widely used LSTM model.\",\"PeriodicalId\":152033,\"journal\":{\"name\":\"Proceedings of the 19th International Conference on Computer Systems and Technologies\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 19th International Conference on Computer Systems and Technologies\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3274005.3274027\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 19th International Conference on Computer Systems and Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3274005.3274027","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

在较长时间的执行过程中，深度递归神经网络由于其循环连接而存在所谓的长期依赖问题。尽管长短期记忆(LSTM)网络为这个问题提供了一个临时解决方案，但它们的长期记忆能力较差，这限制了它们的应用。我们使用了一种最新的递归神经网络模型的方法，在其递归连接中实现了一个酉矩阵来处理长期依赖，而不影响其记忆能力。该模型具有较高的技术效果，但由于实现不足没有达到预期的性能。我们优化了模型的实现和架构，实现了比原始实现好5倍的时间性能。此外，我们将改进的模型应用于三个常见的现实问题:自动文本理解任务、语音识别任务和密码分析，并且优于广泛使用的LSTM模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Improving the Performance of Unitary Recurrent Neural Networks and Their Application in Real-life Tasks

During a prolonged time execution, deep recurrent neural networks suffer from the so-called long-term dependency problem due to their recurrent connection. Although Long Short-Term Memory (LSTM) networks provide a temporary solution to this problem, they have inferior long-term memory capabilities which limit their applications. We use a recent approach for a recurrent neural network model implementing a unitary matrix in its recurrent connection to deal with long-term dependencies, without affecting its memory abilities. The model is capable of high technical results, but due to insufficient implementation does not achieve the expected performance. We optimize the implementation and architecture of the model, achieving time performance up to 5 times better than the original implementation. Additionally, we apply our improved model to three common real-life problems: the automatic text understanding task, the speech recognition task, and cryptoanalysis, and outperform the widely used LSTM model.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 19th International Conference on Computer Systems and Technologies

自引率

0.00%

发文量