深度学习的选择性概述。

IF 3.9 1区数学 Q1 STATISTICS & PROBABILITY

Statistical Science Pub Date : 2021-05-01 Epub Date: 2020-04-19 DOI:10.1214/20-sts783

Jianqing Fan, Cong Ma, Yiqiao Zhong

{"title":"深度学习的选择性概述。","authors":"Jianqing Fan, Cong Ma, Yiqiao Zhong","doi":"10.1214/20-sts783","DOIUrl":null,"url":null,"abstract":"Deep learning has achieved tremendous success in recent years. In simple words, deep learning uses the composition of many nonlinear functions to model the complex dependency between input features and labels. While neural networks have a long history, recent advances have greatly improved their performance in computer vision, natural language processing, etc. From the statistical and scientific perspective, it is natural to ask: What is deep learning? What are the new characteristics of deep learning, compared with classical methods? What are the theoretical foundations of deep learning? To answer these questions, we introduce common neural network models (e.g., convolutional neural nets, recurrent neural nets, generative adversarial nets) and training techniques (e.g., stochastic gradient descent, dropout, batch normalization) from a statistical point of view. Along the way, we highlight new characteristics of deep learning (including depth and over-parametrization) and explain their practical and theoretical benefits. We also sample recent results on theories of deep learning, many of which are only suggestive. While a complete understanding of deep learning remains elusive, we hope that our perspectives and discussions serve as a stimulus for new statistical research.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":"36 2","pages":"264-290"},"PeriodicalIF":3.9000,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8300482/pdf/nihms-1639566.pdf","citationCount":"0","resultStr":"{\"title\":\"A selective overview of deep learning.\",\"authors\":\"Jianqing Fan, Cong Ma, Yiqiao Zhong\",\"doi\":\"10.1214/20-sts783\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep learning has achieved tremendous success in recent years. In simple words, deep learning uses the composition of many nonlinear functions to model the complex dependency between input features and labels. While neural networks have a long history, recent advances have greatly improved their performance in computer vision, natural language processing, etc. From the statistical and scientific perspective, it is natural to ask: What is deep learning? What are the new characteristics of deep learning, compared with classical methods? What are the theoretical foundations of deep learning? To answer these questions, we introduce common neural network models (e.g., convolutional neural nets, recurrent neural nets, generative adversarial nets) and training techniques (e.g., stochastic gradient descent, dropout, batch normalization) from a statistical point of view. Along the way, we highlight new characteristics of deep learning (including depth and over-parametrization) and explain their practical and theoretical benefits. We also sample recent results on theories of deep learning, many of which are only suggestive. While a complete understanding of deep learning remains elusive, we hope that our perspectives and discussions serve as a stimulus for new statistical research.\",\"PeriodicalId\":51172,\"journal\":{\"name\":\"Statistical Science\",\"volume\":\"36 2\",\"pages\":\"264-290\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2021-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8300482/pdf/nihms-1639566.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Statistical Science\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1214/20-sts783\",\"RegionNum\":1,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2020/4/19 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistical Science","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1214/20-sts783","RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2020/4/19 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}

引用次数: 0

摘要

近年来，深度学习取得了巨大成功。简单地说，深度学习使用许多非线性函数的组合来模拟输入特征和标签之间的复杂依赖关系。虽然神经网络有着悠久的历史，但近年来的进步大大提高了其在计算机视觉、自然语言处理等方面的性能。从统计学和科学的角度看，我们自然会问：什么是深度学习？与经典方法相比，深度学习有哪些新特点？深度学习的理论基础是什么？为了回答这些问题，我们从统计学的角度介绍了常见的神经网络模型（如卷积神经网络、递归神经网络、生成对抗网络）和训练技术（如随机梯度下降、丢弃、批量归一化）。在此过程中，我们强调了深度学习的新特点（包括深度和过参数化），并解释了它们在实践和理论上的益处。我们还列举了有关深度学习理论的最新成果，其中许多成果都只是建议性的。虽然对深度学习的全面理解仍遥不可及，但我们希望我们的观点和讨论能对新的统计研究起到激励作用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A selective overview of deep learning.

Deep learning has achieved tremendous success in recent years. In simple words, deep learning uses the composition of many nonlinear functions to model the complex dependency between input features and labels. While neural networks have a long history, recent advances have greatly improved their performance in computer vision, natural language processing, etc. From the statistical and scientific perspective, it is natural to ask: What is deep learning? What are the new characteristics of deep learning, compared with classical methods? What are the theoretical foundations of deep learning? To answer these questions, we introduce common neural network models (e.g., convolutional neural nets, recurrent neural nets, generative adversarial nets) and training techniques (e.g., stochastic gradient descent, dropout, batch normalization) from a statistical point of view. Along the way, we highlight new characteristics of deep learning (including depth and over-parametrization) and explain their practical and theoretical benefits. We also sample recent results on theories of deep learning, many of which are only suggestive. While a complete understanding of deep learning remains elusive, we hope that our perspectives and discussions serve as a stimulus for new statistical research.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Statistical Science 数学-统计学与概率论

CiteScore

6.50

自引率

1.80%

发文量

审稿时长

>12 weeks

期刊介绍： The central purpose of Statistical Science is to convey the richness, breadth and unity of the field by presenting the full range of contemporary statistical thought at a moderate technical level, accessible to the wide community of practitioners, researchers and students of statistics and probability.