电视剧作为新兴词汇的传播者:电视语料库中的非典型化表达

ICAME journal : computers in English linguistics Pub Date : 2023-05-01 DOI:10.2478/icame-2023-0004

Daniela Landert, Tanja Säily, Mika Hämäläinen

{"title":"电视剧作为新兴词汇的传播者:电视语料库中的非典型化表达","authors":"Daniela Landert, Tanja Säily, Mika Hämäläinen","doi":"10.2478/icame-2023-0004","DOIUrl":null,"url":null,"abstract":"Abstract This study presents a method for identifying words that appear in corpus data earlier than their first date of attestation in dictionaries. We demonstrate the application of this method based on a large diachronic corpus, the TV Corpus, and the Oxford English Dictionary (OED). Combining automatic extraction of candidate terms from the TV Corpus with comprehensive manual analysis and verification, the method identifies 32 words that were used in TV series before their first attestation in the OED. We present a detailed discussion of these words, analysing their distribution across decades and genres of the TV Corpus, their origins, semantic domains and word-formation processes. We also present extracts with their first uses in the TV Corpus and analyse how the words were presented to the large and anonymous mass audience. Our study shows that the method we present is suitable for identifying early attestations of words in large corpora, even though in the case of the TV Corpus, a great deal of manual analysis and verification is needed. In addition, we argue that TV series and other types of fictional texts are an important resource for studying the coinage and spread of terms, due to their function and the fact that they address a mass audience.","PeriodicalId":73271,"journal":{"name":"ICAME journal : computers in English linguistics","volume":"32 3 1","pages":"63 - 79"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"TV series as disseminators of emerging vocabulary: Non-codified expressions in the TV Corpus\",\"authors\":\"Daniela Landert, Tanja Säily, Mika Hämäläinen\",\"doi\":\"10.2478/icame-2023-0004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract This study presents a method for identifying words that appear in corpus data earlier than their first date of attestation in dictionaries. We demonstrate the application of this method based on a large diachronic corpus, the TV Corpus, and the Oxford English Dictionary (OED). Combining automatic extraction of candidate terms from the TV Corpus with comprehensive manual analysis and verification, the method identifies 32 words that were used in TV series before their first attestation in the OED. We present a detailed discussion of these words, analysing their distribution across decades and genres of the TV Corpus, their origins, semantic domains and word-formation processes. We also present extracts with their first uses in the TV Corpus and analyse how the words were presented to the large and anonymous mass audience. Our study shows that the method we present is suitable for identifying early attestations of words in large corpora, even though in the case of the TV Corpus, a great deal of manual analysis and verification is needed. In addition, we argue that TV series and other types of fictional texts are an important resource for studying the coinage and spread of terms, due to their function and the fact that they address a mass audience.\",\"PeriodicalId\":73271,\"journal\":{\"name\":\"ICAME journal : computers in English linguistics\",\"volume\":\"32 3 1\",\"pages\":\"63 - 79\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ICAME journal : computers in English linguistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2478/icame-2023-0004\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ICAME journal : computers in English linguistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/icame-2023-0004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

摘要本研究提出了一种识别语料库数据中出现的早于词典首次证明日期的单词的方法。我们在一个大型历时语料库、电视语料库和牛津英语词典(OED)的基础上演示了这种方法的应用。该方法将自动从电视语料库中提取候选词与全面的人工分析和验证相结合，确定了32个在电视连续剧首次在牛津英语词典中得到证实之前使用过的词。我们对这些词进行了详细的讨论，分析了它们在电视语料库中几十年和体裁的分布、它们的起源、语义域和构词过程。我们还介绍了在电视语料库中首次使用的摘录，并分析了这些单词是如何呈现给大量匿名的大众观众的。我们的研究表明，我们提出的方法适用于识别大型语料库中的单词的早期证明，尽管在电视语料库的情况下，需要大量的人工分析和验证。此外，我们认为电视剧和其他类型的虚构文本是研究术语造词和传播的重要资源，因为它们的功能和面向大众的事实。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

TV series as disseminators of emerging vocabulary: Non-codified expressions in the TV Corpus

Abstract This study presents a method for identifying words that appear in corpus data earlier than their first date of attestation in dictionaries. We demonstrate the application of this method based on a large diachronic corpus, the TV Corpus, and the Oxford English Dictionary (OED). Combining automatic extraction of candidate terms from the TV Corpus with comprehensive manual analysis and verification, the method identifies 32 words that were used in TV series before their first attestation in the OED. We present a detailed discussion of these words, analysing their distribution across decades and genres of the TV Corpus, their origins, semantic domains and word-formation processes. We also present extracts with their first uses in the TV Corpus and analyse how the words were presented to the large and anonymous mass audience. Our study shows that the method we present is suitable for identifying early attestations of words in large corpora, even though in the case of the TV Corpus, a great deal of manual analysis and verification is needed. In addition, we argue that TV series and other types of fictional texts are an important resource for studying the coinage and spread of terms, due to their function and the fact that they address a mass audience.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

ICAME journal : computers in English linguistics

自引率

0.00%

发文量

审稿时长

32 weeks