基于语料库的大学生在线葡萄牙语词典设计方案

IF 0.2 0 LANGUAGE & LINGUISTICS
T. Kuhn
{"title":"基于语料库的大学生在线葡萄牙语词典设计方案","authors":"T. Kuhn","doi":"10.5334/JPL.209","DOIUrl":null,"url":null,"abstract":"The objective of this PhD project was to propose the design of an online corpus-driven dictionary of Portuguese for university students (DOPU), aimed at both speakers of Portuguese as a mother tongue and as an additional language and covering Brazilian and European Portuguese varieties. For that, the highly innovative semi-automated approach to dictionary-making (Gantar, Kosem and Krek 2016) was adopted, which involves automatic extraction of data from the corpus and import into dictionary writing system. As a method that had never been applied for lexicographical projects of the Portuguese language, it was necessary to experiment the approach for the first time. Thus, all the required pre-requisites were newly developed, namely, a corpus of academic texts, sketch grammar, GDEX configuration, and a specially-tailored procedure for automatic extraction of data. The experiment indicated that not only can this approach be successfully used as a means to provide lexical content for the design of DOPU, but it can also be beneficial to other lexicographical projects of Portuguese.","PeriodicalId":41871,"journal":{"name":"Journal of Portuguese Linguistics","volume":null,"pages":null},"PeriodicalIF":0.2000,"publicationDate":"2019-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"A Design Proposal of an Online Corpus-Driven Dictionary of Portuguese for University Students\",\"authors\":\"T. Kuhn\",\"doi\":\"10.5334/JPL.209\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The objective of this PhD project was to propose the design of an online corpus-driven dictionary of Portuguese for university students (DOPU), aimed at both speakers of Portuguese as a mother tongue and as an additional language and covering Brazilian and European Portuguese varieties. For that, the highly innovative semi-automated approach to dictionary-making (Gantar, Kosem and Krek 2016) was adopted, which involves automatic extraction of data from the corpus and import into dictionary writing system. As a method that had never been applied for lexicographical projects of the Portuguese language, it was necessary to experiment the approach for the first time. Thus, all the required pre-requisites were newly developed, namely, a corpus of academic texts, sketch grammar, GDEX configuration, and a specially-tailored procedure for automatic extraction of data. The experiment indicated that not only can this approach be successfully used as a means to provide lexical content for the design of DOPU, but it can also be beneficial to other lexicographical projects of Portuguese.\",\"PeriodicalId\":41871,\"journal\":{\"name\":\"Journal of Portuguese Linguistics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.2000,\"publicationDate\":\"2019-01-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Portuguese Linguistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5334/JPL.209\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"LANGUAGE & LINGUISTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Portuguese Linguistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5334/JPL.209","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 4

摘要

这个博士项目的目标是为大学生设计一个在线语料库驱动的葡萄牙语词典(DOPU),针对以葡萄牙语为母语和作为附加语言的使用者,涵盖巴西和欧洲的葡萄牙语品种。为此,采用了高度创新的半自动化词典制作方法(Gantar, Kosem和Krek 2016),该方法涉及从语料库中自动提取数据并导入词典编写系统。作为一种从未应用于葡萄牙语词典编纂项目的方法,有必要对这种方法进行第一次试验。因此,所有必需的先决条件都是新开发的,即学术文本的语料库、草图语法、GDEX配置和用于自动提取数据的专门定制的过程。实验表明,该方法不仅可以成功地为DOPU的设计提供词汇内容,而且对其他葡语词典编纂项目也有一定的借鉴意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Design Proposal of an Online Corpus-Driven Dictionary of Portuguese for University Students
The objective of this PhD project was to propose the design of an online corpus-driven dictionary of Portuguese for university students (DOPU), aimed at both speakers of Portuguese as a mother tongue and as an additional language and covering Brazilian and European Portuguese varieties. For that, the highly innovative semi-automated approach to dictionary-making (Gantar, Kosem and Krek 2016) was adopted, which involves automatic extraction of data from the corpus and import into dictionary writing system. As a method that had never been applied for lexicographical projects of the Portuguese language, it was necessary to experiment the approach for the first time. Thus, all the required pre-requisites were newly developed, namely, a corpus of academic texts, sketch grammar, GDEX configuration, and a specially-tailored procedure for automatic extraction of data. The experiment indicated that not only can this approach be successfully used as a means to provide lexical content for the design of DOPU, but it can also be beneficial to other lexicographical projects of Portuguese.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Portuguese Linguistics
Journal of Portuguese Linguistics LANGUAGE & LINGUISTICS-
CiteScore
1.10
自引率
0.00%
发文量
11
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信