Effective deep learning through bidirectional reading on masked language model

Hiroyuki Nishimoto
{"title":"Effective deep learning through bidirectional reading on masked language model","authors":"Hiroyuki Nishimoto","doi":"10.54941/ahfe1001178","DOIUrl":null,"url":null,"abstract":"Google BERT is a neural network that is good at natural language processing. It has two major strategies. One is “Masked language Model” to clear the word-level relationships, and the other is “Next Sentence Prediction” to clear sentence-level relationships. In the masked language model, with the task of masking some words in sentences, BERT learns to predict the original word from context. Some questions come to mind. Why BERT achieves effective learning by reading in two ways from fore and back? What is the difference between bidirectional reading? BERT learns to predict the original word using the surrounding words as context and to make two-way predictions by forward and backward readings in order to increase the precision. Besides, the bidirectional reading technique can be applied to scenario planning especially using back-casting from the future. This paper clarifies these mechanisms.","PeriodicalId":116806,"journal":{"name":"Human Systems Engineering and Design (IHSED2021) Future Trends and Applications","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Systems Engineering and Design (IHSED2021) Future Trends and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54941/ahfe1001178","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Google BERT is a neural network that is good at natural language processing. It has two major strategies. One is “Masked language Model” to clear the word-level relationships, and the other is “Next Sentence Prediction” to clear sentence-level relationships. In the masked language model, with the task of masking some words in sentences, BERT learns to predict the original word from context. Some questions come to mind. Why BERT achieves effective learning by reading in two ways from fore and back? What is the difference between bidirectional reading? BERT learns to predict the original word using the surrounding words as context and to make two-way predictions by forward and backward readings in order to increase the precision. Besides, the bidirectional reading technique can be applied to scenario planning especially using back-casting from the future. This paper clarifies these mechanisms.
基于掩蔽语言模型的双向阅读深度学习
Google BERT是一个擅长自然语言处理的神经网络。它有两个主要策略。一种是“屏蔽语言模型”,用于清除词级关系;另一种是“下一句预测”,用于清除句子级关系。在掩蔽语言模型中,BERT以掩蔽句子中的某些单词为任务,学习从上下文中预测原单词。我想到了一些问题。为什么BERT通过前后两种方式的阅读来达到有效的学习?双向阅读的区别是什么?BERT学习使用周围的词作为上下文来预测原词,并通过向前和向后阅读进行双向预测,以提高精度。此外,双向阅读技术可以应用于情景规划,特别是使用从未来回溯的方法。本文阐明了这些机制。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信