Structured Text Generation for Spanish Freestyle Battles using Neural Networks

P. D. Bianco, I. Mindlin, L. Lanzarini, Franco Ronchetti, W. Hasperué, F. Quiroga
{"title":"Structured Text Generation for Spanish Freestyle Battles using Neural Networks","authors":"P. D. Bianco, I. Mindlin, L. Lanzarini, Franco Ronchetti, W. Hasperué, F. Quiroga","doi":"10.1109/CLEI53233.2021.9639929","DOIUrl":null,"url":null,"abstract":"As the presence of artificial intelligence has increased in a variety of different areas, the use of machine learning and deep learning techniques for creative purposes has also risen significantly in recent years. Works of this kind within the area of natural language processing (NLP) are typically neural models used for fiction or lyrics generation. Those works are in most cases in English and adapting them to other languages is not feasible. In this work, we develop a Spanish text generator system for the rap sub-genre known as freestyle. Freestyle songs present unique challenges for text generation given that performers compete with one another in a lyric improvisation contest. Given the low availability of freestyle text, especially in Spanish, we collected two separate datasets, one with freestyle lyrics and the other, larger, with rap lyrics, which are more readily available. The rap dataset can be used for pretraining, and the freestyle dataset for finetuning on the generation task. Furthermore, we design a neural network-based generation model that takes into account both the structure of freestyle and the low data availability. The model was able to generate realistic freestyle verses in Spanish.","PeriodicalId":6803,"journal":{"name":"2021 XLVII Latin American Computing Conference (CLEI)","volume":"71 1","pages":"1-9"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 XLVII Latin American Computing Conference (CLEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CLEI53233.2021.9639929","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

As the presence of artificial intelligence has increased in a variety of different areas, the use of machine learning and deep learning techniques for creative purposes has also risen significantly in recent years. Works of this kind within the area of natural language processing (NLP) are typically neural models used for fiction or lyrics generation. Those works are in most cases in English and adapting them to other languages is not feasible. In this work, we develop a Spanish text generator system for the rap sub-genre known as freestyle. Freestyle songs present unique challenges for text generation given that performers compete with one another in a lyric improvisation contest. Given the low availability of freestyle text, especially in Spanish, we collected two separate datasets, one with freestyle lyrics and the other, larger, with rap lyrics, which are more readily available. The rap dataset can be used for pretraining, and the freestyle dataset for finetuning on the generation task. Furthermore, we design a neural network-based generation model that takes into account both the structure of freestyle and the low data availability. The model was able to generate realistic freestyle verses in Spanish.
使用神经网络的结构化文本生成西班牙自由式战斗
随着人工智能在各种不同领域的出现,近年来,机器学习和深度学习技术用于创造性目的的使用也显著增加。这类在自然语言处理(NLP)领域的作品通常是用于小说或歌词生成的神经模型。这些作品大多是英文的,改编成其他语言是不可行的。在这项工作中,我们开发了一个西班牙语文本生成器系统,用于说唱子类型,即自由式。鉴于表演者在歌词即兴创作比赛中相互竞争,自由式歌曲对文本生成提出了独特的挑战。考虑到自由式文本的低可用性,特别是在西班牙语中,我们收集了两个独立的数据集,一个包含自由式歌词,另一个更大,包含说唱歌词,这更容易获得。rap数据集可用于预训练,freestyle数据集可用于对生成任务进行微调。此外,我们设计了一个基于神经网络的生成模型,该模型考虑了freestyle的结构和低数据可用性。该模型能够生成真实的西班牙语自由式诗歌。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信