{"title":"Analysis of Neural Network Based Language Modeling","authors":"P. Karrupusamy","doi":"10.36548/jaicn.2020.1.006","DOIUrl":null,"url":null,"abstract":"The fundamental and core process of the natural language processing is the language modelling usually referred as the statistical language modelling. The language modelling is also considered to be vital in the processing the natural languages as the other chores such as the completion of sentences, recognition of speech automatically, translations of the statistical machines, and generation of text and so on. The success of the viable natural language processing totally relies on the quality of the modelling of the language. In the previous spans the research field such as the linguistics, psychology, speech recognition, data compression, neuroscience, machine translation etc. As the neural network are the very good choices for having a quality language modelling the paper presents the analysis of neural networks in the modelling of the language. Utilizing some of the dataset such as the Penn Tree bank, Billion Word Benchmark and the Wiki Test the neural network models are evaluated on the basis of the word error rate, perplexity and the bilingual evaluation under study scores to identify the optimal model.","PeriodicalId":330888,"journal":{"name":"March 2020","volume":"195 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"March 2020","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36548/jaicn.2020.1.006","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
The fundamental and core process of the natural language processing is the language modelling usually referred as the statistical language modelling. The language modelling is also considered to be vital in the processing the natural languages as the other chores such as the completion of sentences, recognition of speech automatically, translations of the statistical machines, and generation of text and so on. The success of the viable natural language processing totally relies on the quality of the modelling of the language. In the previous spans the research field such as the linguistics, psychology, speech recognition, data compression, neuroscience, machine translation etc. As the neural network are the very good choices for having a quality language modelling the paper presents the analysis of neural networks in the modelling of the language. Utilizing some of the dataset such as the Penn Tree bank, Billion Word Benchmark and the Wiki Test the neural network models are evaluated on the basis of the word error rate, perplexity and the bilingual evaluation under study scores to identify the optimal model.
自然语言处理的基础和核心过程是语言建模,通常称为统计语言建模。语言建模在自然语言处理中也被认为是至关重要的,如句子的完成、语音的自动识别、统计机器的翻译和文本的生成等。可行的自然语言处理的成功完全依赖于语言建模的质量。在此之前的研究领域涵盖了语言学、心理学、语音识别、数据压缩、神经科学、机器翻译等。鉴于神经网络是进行高质量语言建模的良好选择,本文对神经网络在语言建模中的应用进行了分析。利用Penn Tree bank、Billion Word Benchmark和Wiki Test等数据集,根据单词错误率、困惑度和学习分数下的双语评价对神经网络模型进行评估,以确定最优模型。