Linguamatica最新文献_第9页

Hacia una clasificación verbal automática para el español: estudio sobre la relevancia de los diferentes tipos y configuraciones de información sintáctico-semántica 西班牙语的自动言语分类:句法语义信息不同类型和配置的相关性研究

IF 0.6

Linguamatica Pub Date : 2015-07-31 DOI: 10.21814/LM.7.1.202

Lara Gil-Vallejo, I. Castellón, Marta Coll-Florit, J. Turmo

引用次数: 0

Geração de Linguagem Natural para Conversão de Dados em Texto - Aplicação a um Assistente de Medicação para o Português 生成用于数据转换为文本的自然语言-葡萄牙语药物助理的应用

IF 0.6

Linguamatica Pub Date : 2015-07-31 DOI: 10.21814/LM.7.1.206

J. C. Pereira, A. Teixeira

{"title":"Geração de Linguagem Natural para Conversão de Dados em Texto - Aplicação a um Assistente de Medicação para o Português","authors":"J. C. Pereira, A. Teixeira","doi":"10.21814/LM.7.1.206","DOIUrl":"https://doi.org/10.21814/LM.7.1.206","url":null,"abstract":"New equipments, such as smartphones and tablets, are changing human computer interaction. These devices present several challenges, especially due to their small screen and keyboard. In order to use text and voice in multimodal interaction, it is essential to deploy modules to translate the internal information of the applications into sentences or texts, in order to display it on screen or synthesize it. Also, these modules must generate phrases and texts in the user's native language; the development should not require considerable resources; and the outcome of the generation should achieve a good degree of variability. Our main objective is to propose, implement and evaluate a method of data conversion to Portuguese which can be developed with a minimum of time and knowledge, but without compromising the necessary variability and quality of what is generated. The developed system, for a Medication Assistant, is intended to create descriptions, in natural language, of medication to be taken. Motivated by recent results, we opted for an approach based on machine translation, with models trained on a small parallel corpus. For that, a new corpus was created. With it, two variants of the system were trained: phrase-based translation and syntax-based translation. The two variants were evaluated by automatic measurements -- BLEU and Meteor -- and by humans. The results showed that a phrase-based approach produced better results than a syntax-based one: human evaluators evaluated 60% of phrase-based responses as good, or very good, compared to only 46% of syntax-based responses. Considering the corpus size, we judge this value (60%) as good.","PeriodicalId":41819,"journal":{"name":"Linguamatica","volume":"7 1","pages":"3-21"},"PeriodicalIF":0.6,"publicationDate":"2015-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68371861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A arquitetura de um glossário terminológico Inglês-Português na área de Eletrotécnica 电气技术领域的英语-葡萄牙语术语表的架构

IF 0.6

Linguamatica Pub Date : 2015-07-31 DOI: 10.21814/LM.7.1.204

S. Fadanelli, M. J. B. Finatto

引用次数: 0

Uma Comparação Sistemática de Diferentes Abordagens para a Sumarização Automática Extrativa de Textos em Português 葡萄牙语文本自动提取摘要不同方法的系统比较

IF 0.6

Linguamatica Pub Date : 2015-07-31 DOI: 10.21814/LM.7.1.203

M. Costa, Bruno Martins

引用次数: 3

Extração de Relações utilizando Features Diferenciadas para Português 葡萄牙语中使用不同特征的关系提取

IF 0.6

Linguamatica Pub Date : 2014-12-26 DOI: 10.21814/LM.6.2.182

Erick Nilsen Pereira de Souza, Daniela Barreio Claro

{"title":"Extração de Relações utilizando Features Diferenciadas para Português","authors":"Erick Nilsen Pereira de Souza, Daniela Barreio Claro","doi":"10.21814/LM.6.2.182","DOIUrl":"https://doi.org/10.21814/LM.6.2.182","url":null,"abstract":"Relation Extraction (RE) is a task of Information Extraction (IE) responsible for the discovery of semantic relationships between concepts in unstructured text. When the extraction is not limited to a predefined set of relations, the task is called Open Relation Extraction, whose main challenge is to reduce the proportion of invalid extractions in the universe of relationships identified. Current methods based on a set of specific machine learning features eliminate much of the invalid extractions. However, these solutions have the disadvantage of being highly language-dependent. This dependence arises from the difficulty in finding the most representative set of features to the Open RE problem, considering the peculiarities of each language. In this context, the present work proposes to assess the difficulties of classification based on features in open relation extraction in Portuguese, aiming to base new solutions that can reduce language dependence in this task. The results indicate that many representative features in English can not be mapped directly to the Portuguese language with satisfactory merits of classification. Among the classification algorithms evaluated, J48 showed the best results with a F-measure value of 84.1%, followed by SVM (83.9%), Perceptron (82.0%) and Naive Bayes (79,9%).","PeriodicalId":41819,"journal":{"name":"Linguamatica","volume":"55 3 1","pages":"57-65"},"PeriodicalIF":0.6,"publicationDate":"2014-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68370924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Izen+aditz konbinazioen azterketa elebiduna, hizkuntza-aplikazio aurreratuei begira 名称+听力组合测试元素，检查高级语言应用程序

IF 0.6

Linguamatica Pub Date : 2014-12-26 DOI: 10.21814/LM.6.2.188

Uxoa Iñurrieta Urmeneta, I. Aduriz, A. D. D. Ilarraza, Gorka Labaka, K. Sarasola

引用次数: 1

O dicionario de sinónimos como recurso para a expansión de WordNet 同义词词典作为WordNet扩展的资源

IF 0.6

Linguamatica Pub Date : 2014-12-26 DOI: 10.21814/LM.6.2.183

Xavier Gómez Guinovart, Miguel Anxo Solla Portela

引用次数: 6

Projetos sobre Tradução Automática do Português no Laboratório de Sistemas de Língua Falada do INESC-ID INESC-ID口语系统实验室葡萄牙语机器翻译项目