{"title":"The DIG Mandarin Conversations (DMC) Corpus","authors":"Guodong Yu, Yaxin Wu, Paul Drew, C. W. Raymond","doi":"10.1075/cld.23001.guo","DOIUrl":null,"url":null,"abstract":"\n This paper introduces the DMC Corpus – a newly collected dataset of 150 mundane cell phone calls\n from Mainland China in Mandarin Chinese (audio and detailed transcripts) – which is now publicly available for use in research and\n teaching. In this report, we first describe the constitution and current contents of the DMC Corpus, as well as instructions for\n access. Additional calls will be added periodically to the Corpus, and so the quantitative overview presented here should be\n considered conservative. We then provide concrete examples of the sorts of phenomena that might be explored with these new data,\n underscoring how the Corpus offers researchers the ability to build systematic collections for analysis – no matter whether\n researchers prefer to begin with ‘forms’ (e.g., utterance-final particles), with ‘functions’ (e.g., complaining), and/or with the\n temporal organization of interaction itself (e.g., preference organization, repair). The paper concludes with an explicit call for\n increased research on Mandarin conversation, to which we hope the materials in the DMC Corpus will contribute.","PeriodicalId":42144,"journal":{"name":"Chinese Language and Discourse","volume":"2017 34","pages":""},"PeriodicalIF":0.3000,"publicationDate":"2023-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chinese Language and Discourse","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1075/cld.23001.guo","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"LINGUISTICS","Score":null,"Total":0}
引用次数: 0
Abstract
This paper introduces the DMC Corpus – a newly collected dataset of 150 mundane cell phone calls
from Mainland China in Mandarin Chinese (audio and detailed transcripts) – which is now publicly available for use in research and
teaching. In this report, we first describe the constitution and current contents of the DMC Corpus, as well as instructions for
access. Additional calls will be added periodically to the Corpus, and so the quantitative overview presented here should be
considered conservative. We then provide concrete examples of the sorts of phenomena that might be explored with these new data,
underscoring how the Corpus offers researchers the ability to build systematic collections for analysis – no matter whether
researchers prefer to begin with ‘forms’ (e.g., utterance-final particles), with ‘functions’ (e.g., complaining), and/or with the
temporal organization of interaction itself (e.g., preference organization, repair). The paper concludes with an explicit call for
increased research on Mandarin conversation, to which we hope the materials in the DMC Corpus will contribute.