International Journal of Corpus Linguistics最新文献

筛选
英文 中文
Strategies in tracing linguistic variation in a corpus of Old Irish texts (CorPH) 古爱尔兰语语料库中语言变异的追踪策略
IF 1 2区 文学
International Journal of Corpus Linguistics Pub Date : 2022-09-20 DOI: 10.1075/ijcl.22018.sti
D. Stifter, Fangzhe Qiu, M. Aquino-López, Bernhard Bauer, E. Lash, Nora White
{"title":"Strategies in tracing linguistic variation in a corpus of Old Irish texts (CorPH)","authors":"D. Stifter, Fangzhe Qiu, M. Aquino-López, Bernhard Bauer, E. Lash, Nora White","doi":"10.1075/ijcl.22018.sti","DOIUrl":"https://doi.org/10.1075/ijcl.22018.sti","url":null,"abstract":"\u0000This article introduces Corpus PalaeoHibernicum (CorPH), a corpus currently consisting of 78 texts in Early Irish (c. 7th–10th cent.) created by the ERC-funded Chronologicon Hibernicum (ChronHib) project by bringing together pre-existing lexical and syntactic databases and adding further crucial texts from the period. In addition to being annotated for POS, morphological and syntactic information, another layer of annotation has been developed for CorPH – ‘Variation Tagging’, i.e. a tagset that numerically encodes synchronic language variation during the Early Irish period, thus allowing for much improved research on the chronological variation among the material. Another new pillar of studying linguistic variation is Bayesian Language Variation Analysis (BLaVA), in order to address the challenge that “not-so-big data” poses to statistical corpus methods. Instead of reflecting feature frequencies, BLaVA models language variation as probabilities of variation.","PeriodicalId":46843,"journal":{"name":"International Journal of Corpus Linguistics","volume":" ","pages":""},"PeriodicalIF":1.0,"publicationDate":"2022-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45170903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
“In barbarous times and in uncivilized countries” “在野蛮的时代和不文明的国家”
IF 1 2区 文学
International Journal of Corpus Linguistics Pub Date : 2022-09-09 DOI: 10.1075/ijcl.22016.ale
Marc Alexander, Andrew Struan
{"title":"“In barbarous times and in uncivilized countries”","authors":"Marc Alexander, Andrew Struan","doi":"10.1075/ijcl.22016.ale","DOIUrl":"https://doi.org/10.1075/ijcl.22016.ale","url":null,"abstract":"\u0000The ways in which politicians have discussed who, what, and where was considered “uncivilized’” across the past two centuries gives an insight into how speakers in a position of authority classified and constructed the world around them, and how those in power in Britain see the country and themselves. This article uses the Hansard Corpus 1803–2003 of speeches in the UK Parliament alongside data from the Historical Thesaurus of English to analyse diachronic variation in usage of words for persons, places and practices considered uncivil. It proposes new methods and offers quantitative data to describe the period’s shift in political attitudes towards not just the so-called “uncivil” but also the country as a whole.","PeriodicalId":46843,"journal":{"name":"International Journal of Corpus Linguistics","volume":" ","pages":""},"PeriodicalIF":1.0,"publicationDate":"2022-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45305120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Volatile concepts 不稳定的概念
IF 1 2区 文学
International Journal of Corpus Linguistics Pub Date : 2022-09-06 DOI: 10.1075/ijcl.22005.fit
S. Fitzmaurice, Seth Mehl
{"title":"Volatile concepts","authors":"S. Fitzmaurice, Seth Mehl","doi":"10.1075/ijcl.22005.fit","DOIUrl":"https://doi.org/10.1075/ijcl.22005.fit","url":null,"abstract":"\u0000This paper demonstrates the value of studying co-occurrence ‘quads’ – constellations of four non-adjacent lemmas that consistently co-occur across spans of up to 100 tokens – for understanding discursive change. We map meaning onto quads as ‘discursive concepts’, which encompass encyclopaedic semantics, pragmatics, and context. We investigate a high-frequency quad with high co-occurrence strength in EEBO-TCP: world-heaven-earth-power. We conduct semantic and pragmatic analysis to generate hypotheses regarding discursive change. The quad’s components are semantically underspecified; thus, although the quad indicates a discursive concept, each instantiation of the quad is variable, contingent, and dependent upon context and pragmatic processes for interpretation. We observe how the vague lexemes that constitute building blocks of religious discourse are employed to generate new, timely secular discourses; and we argue that semantic underspecification is the site and source of discursive change. Indeed, the volatile, unstable nature of the component lexical meanings renders them indispensable to early modern debate.","PeriodicalId":46843,"journal":{"name":"International Journal of Corpus Linguistics","volume":" ","pages":""},"PeriodicalIF":1.0,"publicationDate":"2022-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46980749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A corpus-based study of anglicized neologisms in Korea 基于语料库的韩语英语化新词研究
IF 1 2区 文学
International Journal of Corpus Linguistics Pub Date : 2022-09-06 DOI: 10.1075/ijcl.20055.kim
E. Kim
{"title":"A corpus-based study of anglicized neologisms in Korea","authors":"E. Kim","doi":"10.1075/ijcl.20055.kim","DOIUrl":"https://doi.org/10.1075/ijcl.20055.kim","url":null,"abstract":"\u0000 This study examines usage changes of English-based loanwords and Korean replacement words promoted by the National\u0000 Institute of Korean Language in a six-year span, using two corpora. It focuses on 18 Korean and anglicized word pairs appearing on\u0000 the National Institute of Korean Language’s website that purportedly showcase the Institute’s successful efforts to curtail the\u0000 usage of English words by promoting Korean replacement words. The results indicate that promoting Korean does not necessarily\u0000 decrease the usage of English, and that the usage of English-based words seems to increase in conjunction with the Korean words.\u0000 Several Korean words promoted by the National Institute of Korean Language have extremely low frequencies, and some loanwords are\u0000 being used with various meanings. Commentaries are provided to explain various patterns of observed usage change.","PeriodicalId":46843,"journal":{"name":"International Journal of Corpus Linguistics","volume":" ","pages":""},"PeriodicalIF":1.0,"publicationDate":"2022-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43175169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Keywords through time 关键词通过时间
IF 1 2区 文学
International Journal of Corpus Linguistics Pub Date : 2022-08-29 DOI: 10.1075/ijcl.22011.cla
Isobelle Clarke, Gavin Brookes, Tony McEnery
{"title":"Keywords through time","authors":"Isobelle Clarke, Gavin Brookes, Tony McEnery","doi":"10.1075/ijcl.22011.cla","DOIUrl":"https://doi.org/10.1075/ijcl.22011.cla","url":null,"abstract":"\u0000This paper applies a new approach to the identification of discourses, based on Multiple Correspondence Analysis (MCA), to the study of discourse variation over time. The MCA approach to keywords deals with a major issue with the use of keywords to identify discourses: the allocation of individual keywords to multiple discourses. Yet, as this paper demonstrates, the approach also allows us to observe variation in the prevalence of discourses over time. The MCA approach to keywords allows the allocation of individual texts to multiple discourses based on patterns of keyword co-occurrence. Metadata in the corpus data analysed (here, UK newspaper articles about Islam) can then be used to map those discourses over time, resulting in a clear view of how the discourses vary relative to one another as time progresses. The paper argues that the drivers for these fluctuations are language external; the real-world events reported on in the newspapers.","PeriodicalId":46843,"journal":{"name":"International Journal of Corpus Linguistics","volume":" ","pages":""},"PeriodicalIF":1.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47439187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Register variation across text lengths 记录文本长度的变化
IF 1 2区 文学
International Journal of Corpus Linguistics Pub Date : 2022-08-23 DOI: 10.1075/ijcl.20177.lii
A. Liimatta
{"title":"Register variation across text lengths","authors":"A. Liimatta","doi":"10.1075/ijcl.20177.lii","DOIUrl":"https://doi.org/10.1075/ijcl.20177.lii","url":null,"abstract":"\u0000This paper explores variation in lexico-grammatical register features across text lengths in a large-scale sample of Reddit comments. Very short texts are known to be problematic for many statistical methods, so understanding their nature is important for the corpus-linguistic study of social media, where most contributions are short. I show that the frequencies of linguistic features change with comment length, even between longer comments, although longer texts are often considered similar in statistical terms. Moreover, I classify the variation found between short comments of different lengths into two main patterns, although other patterns can also be found, and there is variation even within these patterns. Furthermore, I interpret the observed differences in terms of register variation. For example, shorter comments appear to be more casual and less edited in terms of their feature makeup, whereas narrative and informational registers seem to favor longer comments.","PeriodicalId":46843,"journal":{"name":"International Journal of Corpus Linguistics","volume":" ","pages":""},"PeriodicalIF":1.0,"publicationDate":"2022-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44427278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
New methods for analysing diachronic suffix competition across registers 跨语域历时后缀竞争分析的新方法
IF 1 2区 文学
International Journal of Corpus Linguistics Pub Date : 2022-08-19 DOI: 10.1075/ijcl.22014.rod
Paula Rodríguez-Puente, Tanja Säily, J. Suomela
{"title":"New methods for analysing diachronic suffix competition across registers","authors":"Paula Rodríguez-Puente, Tanja Säily, J. Suomela","doi":"10.1075/ijcl.22014.rod","DOIUrl":"https://doi.org/10.1075/ijcl.22014.rod","url":null,"abstract":"\u0000This paper tracks stylistic variation in the use of two roughly synonymous suffixes, the Romance -ity and the native -ness, during the Early Modern English period. We seek to verify from a statistical viewpoint the claims of Rodríguez-Puente (2020), who reports on a decrease of -ness in favour of -ity in registers representative of the speech-written and formal-informal continua at that time. To this end, we develop new methods of statistical and visual analysis that enable diachronic comparisons of competing processes across subcorpora, building upon an earlier method by Säily and Suomela (2009). Our results confirm that -ity gained ground first in written registers and then spread towards speech-related registers, and we are able to time this change more accurately thanks to a novel periodisation. We also provide strong statistical support indicating that the proportion of -ity was significantly higher in legal registers than in other registers.","PeriodicalId":46843,"journal":{"name":"International Journal of Corpus Linguistics","volume":" ","pages":""},"PeriodicalIF":1.0,"publicationDate":"2022-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47926633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Annotating dialogue acts in speech data 语音数据中的对话行为注释
IF 1 2区 文学
International Journal of Corpus Linguistics Pub Date : 2022-08-08 DOI: 10.1075/ijcl.20165.ver
D. Verdonik
{"title":"Annotating dialogue acts in speech data","authors":"D. Verdonik","doi":"10.1075/ijcl.20165.ver","DOIUrl":"https://doi.org/10.1075/ijcl.20165.ver","url":null,"abstract":"\u0000 The aims of this paper are to detect the most problematic issues related to dialogue act annotation in speech\u0000 corpora and to define basic categories of dialogue acts. I critically examine and test generic schemes that represent different\u0000 lines of dialogue act annotation: AMI, DART, ISO 24617–2 and SWBD-DAMSL. It is found that the most problematic issues regarding\u0000 dialogue act annotation are related to the distinction between the semantic and pragmatic meanings of utterances, the annotation\u0000 of metadiscourse, and the adequacy and informativeness of the tagset. The identified basic dialogue act categories are information\u0000 providing, information seeking, actions, social acts and metadiscourse. The findings help improve dialogue act annotation.","PeriodicalId":46843,"journal":{"name":"International Journal of Corpus Linguistics","volume":" ","pages":""},"PeriodicalIF":1.0,"publicationDate":"2022-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47202283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Derivation and semantic autonomy 派生与语义自主
IF 1 2区 文学
International Journal of Corpus Linguistics Pub Date : 2022-07-21 DOI: 10.1075/ijcl.20074.kra
Iwona Kraska-Szlenk, Beata Wójtowicz
{"title":"Derivation and semantic autonomy","authors":"Iwona Kraska-Szlenk, Beata Wójtowicz","doi":"10.1075/ijcl.20074.kra","DOIUrl":"https://doi.org/10.1075/ijcl.20074.kra","url":null,"abstract":"\u0000 The article focuses on the polysemy and usage patterns of the Polish lexeme głowa “head” and its\u0000 diminutive główka. Based on corpus methodology and cognitive linguistics analysis, it is argued that the two\u0000 lexemes are too autonomous in their meanings than predicted by their morphological relatedness. As the two words cover different\u0000 semantic domains, we observe that the diminutive suffix has developed a new function which signals lexicalization of meaning\u0000 toward a non-human semantic domain, for example, material objects, plants, etc. Our research contributes to studies on Polish\u0000 morphology and lexical semantics and to theoretical research on the polysemy of body part terms.","PeriodicalId":46843,"journal":{"name":"International Journal of Corpus Linguistics","volume":" ","pages":""},"PeriodicalIF":1.0,"publicationDate":"2022-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46948133","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Question illocutionary force indicating devices in academic writing 质疑学术写作中的言外之力指示手段
IF 1 2区 文学
International Journal of Corpus Linguistics Pub Date : 2022-07-18 DOI: 10.1075/ijcl.20065.cur
Niall Curry
{"title":"Question illocutionary force indicating devices in academic writing","authors":"Niall Curry","doi":"10.1075/ijcl.20065.cur","DOIUrl":"https://doi.org/10.1075/ijcl.20065.cur","url":null,"abstract":"\u0000Corpus research on questions as reader engagement markers in academic writing typically focuses on direct questions. Such questions are signalled by question marks and are relatively easily searchable in a corpus. However, indirect questions can be more challenging to identify, as they can be introduced by a range of forms. Based on a contrastive analysis of a corpus of English, French, and Spanish economics research articles, this paper provides pertinent evidence on direct and indirect questions as reader engagement markers. Firstly, it shows that direct and indirect questions as reader engagement markers are a rhetorical and generic feature of academic writing in the economics research article and, secondly, it presents a comprehensive list of indirect question illocutionary force indicating devices, valuable for future studies of indirect questions. Methodologically, this paper illustrates a replicable process for functional analysis and discusses the value of theoretically merging corpus and contrastive linguistic approaches.","PeriodicalId":46843,"journal":{"name":"International Journal of Corpus Linguistics","volume":" ","pages":""},"PeriodicalIF":1.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49168052","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信