德语方言识别的深层和浅层模型的结合。

Proceedings of the Sixth Workshop on Pub Date : 1900-01-01 DOI:10.18653/v1/W19-1413

Andrei M. Butnaru

{"title":"德语方言识别的深层和浅层模型的结合。","authors":"Andrei M. Butnaru","doi":"10.18653/v1/W19-1413","DOIUrl":null,"url":null,"abstract":"*This is a submission for the Third VarDial Evaluation Campaign* In this paper, we present a machine learning approach for the German Dialect Identification (GDI) Closed Shared Task of the DSL 2019 Challenge. The proposed approach combines deep and shallow models, by applying a voting scheme on the outputs resulted from a Character-level Convolutional Neural Networks (Char-CNN), a Long Short-Term Memory (LSTM) network, and a model based on String Kernels. The first model used is the Char-CNN model that merges multiple convolutions computed with kernels of different sizes. The second model is the LSTM network which applies a global max pooling over the returned sequences over time. Both models pass the activation maps to two fully-connected layers. The final model is based on String Kernels, computed on character p-grams extracted from speech transcripts. The model combines two blended kernel functions, one is the presence bits kernel, and the other is the intersection kernel. The empirical results obtained in the shared task prove that the approach can achieve good results. The system proposed in this paper obtained the fourth place with a macro-F1 score of 62.55%","PeriodicalId":344344,"journal":{"name":"Proceedings of the Sixth Workshop on","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"BAM: A combination of deep and shallow models for German Dialect Identification.\",\"authors\":\"Andrei M. Butnaru\",\"doi\":\"10.18653/v1/W19-1413\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"*This is a submission for the Third VarDial Evaluation Campaign* In this paper, we present a machine learning approach for the German Dialect Identification (GDI) Closed Shared Task of the DSL 2019 Challenge. The proposed approach combines deep and shallow models, by applying a voting scheme on the outputs resulted from a Character-level Convolutional Neural Networks (Char-CNN), a Long Short-Term Memory (LSTM) network, and a model based on String Kernels. The first model used is the Char-CNN model that merges multiple convolutions computed with kernels of different sizes. The second model is the LSTM network which applies a global max pooling over the returned sequences over time. Both models pass the activation maps to two fully-connected layers. The final model is based on String Kernels, computed on character p-grams extracted from speech transcripts. The model combines two blended kernel functions, one is the presence bits kernel, and the other is the intersection kernel. The empirical results obtained in the shared task prove that the approach can achieve good results. The system proposed in this paper obtained the fourth place with a macro-F1 score of 62.55%\",\"PeriodicalId\":344344,\"journal\":{\"name\":\"Proceedings of the Sixth Workshop on\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Sixth Workshop on\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18653/v1/W19-1413\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Sixth Workshop on","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18653/v1/W19-1413","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

在本文中，我们提出了一种用于DSL 2019挑战赛德语方言识别(GDI)封闭共享任务的机器学习方法。该方法通过对字符级卷积神经网络(Char-CNN)、长短期记忆(LSTM)网络和基于字符串核的模型的输出应用投票方案，将深度和浅层模型相结合。使用的第一个模型是Char-CNN模型，该模型合并了使用不同大小的核计算的多个卷积。第二个模型是LSTM网络，它随着时间的推移对返回的序列应用全局最大池化。两个模型都将激活映射传递给两个完全连接的层。最后一个模型是基于字符串核，计算从语音文本中提取的字符p图。该模型结合了两个混合核函数，一个是存在位核函数，另一个是交集核函数。在共享任务中获得的经验结果证明，该方法可以取得良好的效果。本文提出的系统以62.55%的宏观f1得分获得第四名

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

BAM: A combination of deep and shallow models for German Dialect Identification.

*This is a submission for the Third VarDial Evaluation Campaign* In this paper, we present a machine learning approach for the German Dialect Identification (GDI) Closed Shared Task of the DSL 2019 Challenge. The proposed approach combines deep and shallow models, by applying a voting scheme on the outputs resulted from a Character-level Convolutional Neural Networks (Char-CNN), a Long Short-Term Memory (LSTM) network, and a model based on String Kernels. The first model used is the Char-CNN model that merges multiple convolutions computed with kernels of different sizes. The second model is the LSTM network which applies a global max pooling over the returned sequences over time. Both models pass the activation maps to two fully-connected layers. The final model is based on String Kernels, computed on character p-grams extracted from speech transcripts. The model combines two blended kernel functions, one is the presence bits kernel, and the other is the intersection kernel. The empirical results obtained in the shared task prove that the approach can achieve good results. The system proposed in this paper obtained the fourth place with a macro-F1 score of 62.55%

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the Sixth Workshop on

自引率

0.00%

发文量