基于语料库的小学科学图的多模态和体裁分析

IF 1.2 2区 文学 Q3 COMMUNICATION
Tuomo Hiippala
{"title":"基于语料库的小学科学图的多模态和体裁分析","authors":"Tuomo Hiippala","doi":"10.1177/14703572231161829","DOIUrl":null,"url":null,"abstract":"This article presents a data-driven analysis of multimodal genre in a corpus of primary school science diagrams that contains multiple layers of cross-referenced annotations for multimodal discourse structure. The aim is to identify diagram genres in the corpus and describe their multimodal characteristics. To do so, information about expressive resources used in the diagrams and the discourse relations between them is extracted from the corpus, and computer vision is used to approximate the visual appearance of the diagrams. The article also presents a new method for quantifying information about the use of layout space. The resulting description of multimodal discourse structure is processed using UMAP, an unsupervised machine-learning algorithm, in order to identify diagrams that exhibit similar structural characteristics. The analysis allows the identification and characterization of four diagram genres in the corpus, which adopt different rhetorical strategies in combining expressive resources into discourse structures. The analysis also reveals that layout plays a major role in shaping the genre space, which can be further refined using information about the discourse structure. Overall, the results suggest that computational methods can be used to characterize multimodal genre from a bottom-up perspective using low-level information about expressive resources and layout.","PeriodicalId":51671,"journal":{"name":"Visual Communication","volume":null,"pages":null},"PeriodicalIF":1.2000,"publicationDate":"2023-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Corpus-based insights into multimodality and genre in primary school science diagrams\",\"authors\":\"Tuomo Hiippala\",\"doi\":\"10.1177/14703572231161829\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article presents a data-driven analysis of multimodal genre in a corpus of primary school science diagrams that contains multiple layers of cross-referenced annotations for multimodal discourse structure. The aim is to identify diagram genres in the corpus and describe their multimodal characteristics. To do so, information about expressive resources used in the diagrams and the discourse relations between them is extracted from the corpus, and computer vision is used to approximate the visual appearance of the diagrams. The article also presents a new method for quantifying information about the use of layout space. The resulting description of multimodal discourse structure is processed using UMAP, an unsupervised machine-learning algorithm, in order to identify diagrams that exhibit similar structural characteristics. The analysis allows the identification and characterization of four diagram genres in the corpus, which adopt different rhetorical strategies in combining expressive resources into discourse structures. The analysis also reveals that layout plays a major role in shaping the genre space, which can be further refined using information about the discourse structure. Overall, the results suggest that computational methods can be used to characterize multimodal genre from a bottom-up perspective using low-level information about expressive resources and layout.\",\"PeriodicalId\":51671,\"journal\":{\"name\":\"Visual Communication\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2023-04-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Visual Communication\",\"FirstCategoryId\":\"98\",\"ListUrlMain\":\"https://doi.org/10.1177/14703572231161829\",\"RegionNum\":2,\"RegionCategory\":\"文学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMMUNICATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Visual Communication","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1177/14703572231161829","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMMUNICATION","Score":null,"Total":0}
引用次数: 0

摘要

本文提出了一种数据驱动的小学科学图表语料库多模态体裁分析方法,该语料库包含多模态语篇结构的多层交叉引用注释。目的是识别语料库中的图表类型,并描述它们的多模态特征。为此,从语料库中提取有关图中使用的表达资源及其之间的话语关系的信息,并使用计算机视觉来近似图的视觉外观。本文还提出了一种量化布局空间使用信息的新方法。使用UMAP(一种无监督机器学习算法)处理多模态话语结构的结果描述,以识别表现出相似结构特征的图。通过分析,可以对语料库中的四种图表类型进行识别和表征,这四种类型在将表达资源组合成话语结构时采用了不同的修辞策略。分析还表明,语篇布局在语篇空间的形成中起着重要作用,可以利用语篇结构信息进一步细化语篇空间。总体而言,研究结果表明,计算方法可以从自下而上的角度利用表达资源和布局的底层信息来表征多模态类型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Corpus-based insights into multimodality and genre in primary school science diagrams
This article presents a data-driven analysis of multimodal genre in a corpus of primary school science diagrams that contains multiple layers of cross-referenced annotations for multimodal discourse structure. The aim is to identify diagram genres in the corpus and describe their multimodal characteristics. To do so, information about expressive resources used in the diagrams and the discourse relations between them is extracted from the corpus, and computer vision is used to approximate the visual appearance of the diagrams. The article also presents a new method for quantifying information about the use of layout space. The resulting description of multimodal discourse structure is processed using UMAP, an unsupervised machine-learning algorithm, in order to identify diagrams that exhibit similar structural characteristics. The analysis allows the identification and characterization of four diagram genres in the corpus, which adopt different rhetorical strategies in combining expressive resources into discourse structures. The analysis also reveals that layout plays a major role in shaping the genre space, which can be further refined using information about the discourse structure. Overall, the results suggest that computational methods can be used to characterize multimodal genre from a bottom-up perspective using low-level information about expressive resources and layout.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Visual Communication
Visual Communication COMMUNICATION-
CiteScore
3.40
自引率
13.30%
发文量
45
期刊介绍: Visual Communication provides an international forum for the growing body of work in numerous interrelated disciplines. Its broad coverage includes: still and moving images; graphic design and typography; visual phenomena such as fashion, professional vision, posture and interaction; the built and landscaped environment; the role of the visual in relation to language, music, sound and action.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信