Distant Co-occurrence Patterns of Connectives: a Corpus Study of Formulaicity in Japanese

Q3 Arts and Humanities
Andrej Bekeš, Bor Hodošček, K. Nishina, Takeshi Abekawa
{"title":"Distant Co-occurrence Patterns of Connectives: a Corpus Study of Formulaicity in Japanese","authors":"Andrej Bekeš, Bor Hodošček, K. Nishina, Takeshi Abekawa","doi":"10.4312/ala.13.2.9-38","DOIUrl":null,"url":null,"abstract":"Using corpus research methods, this study aims to establish whether there are two-item and, more generally, multi-item distant co-occurrence patterns of connectives in written Japanese, and further, to clarify the role these patterns play in discourse. The study is based on a hybrid corpus of written Japanese including Humanities and social science papers, Science and technology papers, and general written language data. The co-occurrence threshold was set at co-occurrence frequency > 10, PMI value > 2, and Dice coefficient > 0.01. The distribution of the observed co-occurring pairs differed according to the genre. Visualization of the connectivity potential of co-occurring pairs as directed graphs showed that these co-occurring pairs constitute longer co-occurrence chains which can be interpreted as ready-made co-occurrence patterns. Two-item and multi-item co-occurrence patterns are considered a type of Bourdieu’s habitus and contribute to both discourse development and discourse prediction.","PeriodicalId":37373,"journal":{"name":"Acta Linguistica Asiatica","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acta Linguistica Asiatica","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4312/ala.13.2.9-38","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Arts and Humanities","Score":null,"Total":0}
引用次数: 0

Abstract

Using corpus research methods, this study aims to establish whether there are two-item and, more generally, multi-item distant co-occurrence patterns of connectives in written Japanese, and further, to clarify the role these patterns play in discourse. The study is based on a hybrid corpus of written Japanese including Humanities and social science papers, Science and technology papers, and general written language data. The co-occurrence threshold was set at co-occurrence frequency > 10, PMI value > 2, and Dice coefficient > 0.01. The distribution of the observed co-occurring pairs differed according to the genre. Visualization of the connectivity potential of co-occurring pairs as directed graphs showed that these co-occurring pairs constitute longer co-occurrence chains which can be interpreted as ready-made co-occurrence patterns. Two-item and multi-item co-occurrence patterns are considered a type of Bourdieu’s habitus and contribute to both discourse development and discourse prediction.
连接词的远同现模式:日语公式化语料库研究
本研究采用语料库研究方法,旨在确定书面日语中连接词是否存在两项和更普遍的多项远距离共现模式,并进一步阐明这些模式在语篇中的作用。这项研究基于日语书面语的混合语料库,包括人文和社会科学论文、科学和技术论文以及一般书面语言数据。同现阈值设置为同现频率>10,PMI值>2,Dice系数>0.01。观察到的共现配对的分布因类型而异。将共现对的连通潜力可视化为有向图表明,这些共现对构成了更长的共现链,可以解释为现成的共现模式。两项和多项共现模式被认为是布迪厄的一种习惯,有助于话语发展和话语预测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Acta Linguistica Asiatica
Acta Linguistica Asiatica Arts and Humanities-Language and Linguistics
CiteScore
0.40
自引率
0.00%
发文量
14
审稿时长
20 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信