Mark Fishel, Yota Georgakopoulou, Sergio Penkale, V. Petukhova, M. Rojc, M. Volk, Andy Way
{"title":"From Subtitles to Parallel Corpora","authors":"Mark Fishel, Yota Georgakopoulou, Sergio Penkale, V. Petukhova, M. Rojc, M. Volk, Andy Way","doi":"10.5167/UZH-63327","DOIUrl":null,"url":null,"abstract":"We describe the preparation of parallel corpora based on professional quality subtitles in seven European language pairs. The main focus is the effect of the processing steps on the size and quality of the final corpora.","PeriodicalId":137211,"journal":{"name":"European Association for Machine Translation Conferences/Workshops","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Association for Machine Translation Conferences/Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5167/UZH-63327","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
We describe the preparation of parallel corpora based on professional quality subtitles in seven European language pairs. The main focus is the effect of the processing steps on the size and quality of the final corpora.