{"title":"Das KOMMA-Korpus und seine vielseitige Nutzbarkeit: Die deutsche Standardvarietät in Südtirol","authors":"M. Leonardi","doi":"10.13092/lo.123.10547","DOIUrl":null,"url":null,"abstract":"This paper outlines the construction of the KOMMA corpus and the presentation and analysis of selected case studies. The paper contains written and oral data from young adults attending the final year of a German-language high school in South Tyrol. In addition to describing the data collection and the methods used, the paper will also go into more detail about the data processing (transcription, normalisation, lemmatisation and POS tagging). Finally, some research results illustrate the usability of the corpus, with selected case studies from the field of word formation and lexis being presented.","PeriodicalId":56243,"journal":{"name":"Linguistik Online","volume":"58 ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Linguistik Online","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.13092/lo.123.10547","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper outlines the construction of the KOMMA corpus and the presentation and analysis of selected case studies. The paper contains written and oral data from young adults attending the final year of a German-language high school in South Tyrol. In addition to describing the data collection and the methods used, the paper will also go into more detail about the data processing (transcription, normalisation, lemmatisation and POS tagging). Finally, some research results illustrate the usability of the corpus, with selected case studies from the field of word formation and lexis being presented.
本文概述了 KOMMA 语料库的构建以及对部分案例研究的介绍和分析。本文包含来自南蒂罗尔一所德语高中最后一年学生的书面和口头数据。除了介绍数据收集和使用的方法外,本文还将详细介绍数据处理(转录、规范化、词法化和 POS 标记)。最后,一些研究成果将说明该语料库的可用性,并介绍在组词和词法领域的一些案例研究。