{"title":"在信息学的帮助下发现缩略语和首字母","authors":"Attila Imre","doi":"10.2478/ausp-2022-0025","DOIUrl":null,"url":null,"abstract":"Abstract The growing popularity of streaming services has led to innumerable audiovisual material available for the audience. As movies, documentaries, or TV shows are part of the entertainment industry, they aim at reaching viewers worldwide with the help of dubbed and subtitled versions. Our aim is to collect the acronyms used in the transcripts/subtitles of several American political TV shows (24, Designated Survivor, House of Cards, and The West Wing) and analyse their translated versions into Hungarian. However, the strenuous activity of opening each subtitle file one by one and browsing through them to spot and collect the acronyms and initialisms would result in countless mouse clicks. Hence, a specific software (SRT Manager) was designed to speed up the process. As the majority of definitions regarding acronyms and initialisms focus on the fact that they result from the combination of at least two capital letters, once the software gets the input (multiple subtitle files of entire seasons), it provides all the consecutive two- or more capital letter instances (with or without periods) found in the raw data, such as AA or A.A. Further statistical data (the source file of each instance, counting all unique values and numbering occurrences, and adding sample lines from the subtitle) also saves a lot of time and energy, as it can easily be exported to spreadsheet programs for further data analysis.","PeriodicalId":37574,"journal":{"name":"Acta Universitatis Sapientiae, Philologica","volume":"27 1","pages":"51 - 76"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Spotting Acronyms and Initialisms with the Help of Informatics\",\"authors\":\"Attila Imre\",\"doi\":\"10.2478/ausp-2022-0025\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract The growing popularity of streaming services has led to innumerable audiovisual material available for the audience. As movies, documentaries, or TV shows are part of the entertainment industry, they aim at reaching viewers worldwide with the help of dubbed and subtitled versions. Our aim is to collect the acronyms used in the transcripts/subtitles of several American political TV shows (24, Designated Survivor, House of Cards, and The West Wing) and analyse their translated versions into Hungarian. However, the strenuous activity of opening each subtitle file one by one and browsing through them to spot and collect the acronyms and initialisms would result in countless mouse clicks. Hence, a specific software (SRT Manager) was designed to speed up the process. As the majority of definitions regarding acronyms and initialisms focus on the fact that they result from the combination of at least two capital letters, once the software gets the input (multiple subtitle files of entire seasons), it provides all the consecutive two- or more capital letter instances (with or without periods) found in the raw data, such as AA or A.A. Further statistical data (the source file of each instance, counting all unique values and numbering occurrences, and adding sample lines from the subtitle) also saves a lot of time and energy, as it can easily be exported to spreadsheet programs for further data analysis.\",\"PeriodicalId\":37574,\"journal\":{\"name\":\"Acta Universitatis Sapientiae, Philologica\",\"volume\":\"27 1\",\"pages\":\"51 - 76\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Acta Universitatis Sapientiae, Philologica\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2478/ausp-2022-0025\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Arts and Humanities\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acta Universitatis Sapientiae, Philologica","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/ausp-2022-0025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Arts and Humanities","Score":null,"Total":0}
Spotting Acronyms and Initialisms with the Help of Informatics
Abstract The growing popularity of streaming services has led to innumerable audiovisual material available for the audience. As movies, documentaries, or TV shows are part of the entertainment industry, they aim at reaching viewers worldwide with the help of dubbed and subtitled versions. Our aim is to collect the acronyms used in the transcripts/subtitles of several American political TV shows (24, Designated Survivor, House of Cards, and The West Wing) and analyse their translated versions into Hungarian. However, the strenuous activity of opening each subtitle file one by one and browsing through them to spot and collect the acronyms and initialisms would result in countless mouse clicks. Hence, a specific software (SRT Manager) was designed to speed up the process. As the majority of definitions regarding acronyms and initialisms focus on the fact that they result from the combination of at least two capital letters, once the software gets the input (multiple subtitle files of entire seasons), it provides all the consecutive two- or more capital letter instances (with or without periods) found in the raw data, such as AA or A.A. Further statistical data (the source file of each instance, counting all unique values and numbering occurrences, and adding sample lines from the subtitle) also saves a lot of time and energy, as it can easily be exported to spreadsheet programs for further data analysis.
期刊介绍:
Series Philologica is published in cooperation with Sciendo by De Gruyter. Series Philologica publishes original, previously unpublished articles in the wide field of philological studies, and it is published in 3 issues a year (since 2014). The printed and online version of papers are identical.