自然语言处理的神经网络方法

Synthesis Lectures on Human Language Technologies Pub Date : 2017-01-01 DOI:10.2200/S00762ED1V01Y201703HLT037

Yoav Goldberg

{"title":"自然语言处理的神经网络方法","authors":"Yoav Goldberg","doi":"10.2200/S00762ED1V01Y201703HLT037","DOIUrl":null,"url":null,"abstract":"Neural networks are a family of powerful machine learning models. This book focuses on the application of neural network models to natural language data. The first half of the book (Parts I and II) covers the basics of supervised machine learning and feed-forward neural networks, the basics of working with machine learning over language data, and the use of vector-based rather than symbolic representations for words. It also covers the computation-graph abstraction, which allows to easily define and train arbitrary neural networks, and is the basis behind the design of contemporary neural network software libraries. The second part of the book (Parts III and IV) introduces more specialized neural network architectures, including 1D convolutional neural networks, recurrent neural networks, conditioned-generation models, and attention-based models. These architectures and techniques are the driving force behind state-of-the-art algorithms for machine translation, syntactic parsing, and many other applications. Finally, we also discuss tree-shaped networks, structured prediction, and the prospects of multi-task learning.","PeriodicalId":22125,"journal":{"name":"Synthesis Lectures on Human Language Technologies","volume":"26 1","pages":"1-309"},"PeriodicalIF":0.0000,"publicationDate":"2017-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"547","resultStr":"{\"title\":\" Neural Network Methods for Natural Language Processing\",\"authors\":\"Yoav Goldberg\",\"doi\":\"10.2200/S00762ED1V01Y201703HLT037\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Neural networks are a family of powerful machine learning models. This book focuses on the application of neural network models to natural language data. The first half of the book (Parts I and II) covers the basics of supervised machine learning and feed-forward neural networks, the basics of working with machine learning over language data, and the use of vector-based rather than symbolic representations for words. It also covers the computation-graph abstraction, which allows to easily define and train arbitrary neural networks, and is the basis behind the design of contemporary neural network software libraries. The second part of the book (Parts III and IV) introduces more specialized neural network architectures, including 1D convolutional neural networks, recurrent neural networks, conditioned-generation models, and attention-based models. These architectures and techniques are the driving force behind state-of-the-art algorithms for machine translation, syntactic parsing, and many other applications. Finally, we also discuss tree-shaped networks, structured prediction, and the prospects of multi-task learning.\",\"PeriodicalId\":22125,\"journal\":{\"name\":\"Synthesis Lectures on Human Language Technologies\",\"volume\":\"26 1\",\"pages\":\"1-309\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"547\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Synthesis Lectures on Human Language Technologies\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2200/S00762ED1V01Y201703HLT037\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Synthesis Lectures on Human Language Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2200/S00762ED1V01Y201703HLT037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 547

摘要

神经网络是一系列强大的机器学习模型。这本书的重点是神经网络模型在自然语言数据中的应用。本书的前半部分(第一部分和第二部分)涵盖了监督机器学习和前馈神经网络的基础知识，在语言数据上使用机器学习的基础知识，以及使用基于向量而不是符号表示的单词。它还涵盖了计算图抽象，它允许轻松定义和训练任意神经网络，并且是当代神经网络软件库设计背后的基础。本书的第二部分(第三部分和第四部分)介绍了更专业的神经网络架构，包括1D卷积神经网络、循环神经网络、条件生成模型和基于注意力的模型。这些体系结构和技术是机器翻译、语法解析和许多其他应用程序的最先进算法背后的驱动力。最后，我们还讨论了树形网络、结构化预测和多任务学习的前景。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Neural Network Methods for Natural Language Processing

Neural networks are a family of powerful machine learning models. This book focuses on the application of neural network models to natural language data. The first half of the book (Parts I and II) covers the basics of supervised machine learning and feed-forward neural networks, the basics of working with machine learning over language data, and the use of vector-based rather than symbolic representations for words. It also covers the computation-graph abstraction, which allows to easily define and train arbitrary neural networks, and is the basis behind the design of contemporary neural network software libraries. The second part of the book (Parts III and IV) introduces more specialized neural network architectures, including 1D convolutional neural networks, recurrent neural networks, conditioned-generation models, and attention-based models. These architectures and techniques are the driving force behind state-of-the-art algorithms for machine translation, syntactic parsing, and many other applications. Finally, we also discuss tree-shaped networks, structured prediction, and the prospects of multi-task learning.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Synthesis Lectures on Human Language Technologies

CiteScore

2.30

自引率

0.00%

发文量