Corpus-based insights into multimodality and genre in primary school science diagrams

IF 1.4 2区文学 Q3 COMMUNICATION

Visual Communication Pub Date : 2023-04-25 DOI:10.1177/14703572231161829

Tuomo Hiippala

{"title":"Corpus-based insights into multimodality and genre in primary school science diagrams","authors":"Tuomo Hiippala","doi":"10.1177/14703572231161829","DOIUrl":null,"url":null,"abstract":"This article presents a data-driven analysis of multimodal genre in a corpus of primary school science diagrams that contains multiple layers of cross-referenced annotations for multimodal discourse structure. The aim is to identify diagram genres in the corpus and describe their multimodal characteristics. To do so, information about expressive resources used in the diagrams and the discourse relations between them is extracted from the corpus, and computer vision is used to approximate the visual appearance of the diagrams. The article also presents a new method for quantifying information about the use of layout space. The resulting description of multimodal discourse structure is processed using UMAP, an unsupervised machine-learning algorithm, in order to identify diagrams that exhibit similar structural characteristics. The analysis allows the identification and characterization of four diagram genres in the corpus, which adopt different rhetorical strategies in combining expressive resources into discourse structures. The analysis also reveals that layout plays a major role in shaping the genre space, which can be further refined using information about the discourse structure. Overall, the results suggest that computational methods can be used to characterize multimodal genre from a bottom-up perspective using low-level information about expressive resources and layout.","PeriodicalId":51671,"journal":{"name":"Visual Communication","volume":"339 1","pages":""},"PeriodicalIF":1.4000,"publicationDate":"2023-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Visual Communication","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1177/14703572231161829","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMMUNICATION","Score":null,"Total":0}

引用次数: 0

Abstract

This article presents a data-driven analysis of multimodal genre in a corpus of primary school science diagrams that contains multiple layers of cross-referenced annotations for multimodal discourse structure. The aim is to identify diagram genres in the corpus and describe their multimodal characteristics. To do so, information about expressive resources used in the diagrams and the discourse relations between them is extracted from the corpus, and computer vision is used to approximate the visual appearance of the diagrams. The article also presents a new method for quantifying information about the use of layout space. The resulting description of multimodal discourse structure is processed using UMAP, an unsupervised machine-learning algorithm, in order to identify diagrams that exhibit similar structural characteristics. The analysis allows the identification and characterization of four diagram genres in the corpus, which adopt different rhetorical strategies in combining expressive resources into discourse structures. The analysis also reveals that layout plays a major role in shaping the genre space, which can be further refined using information about the discourse structure. Overall, the results suggest that computational methods can be used to characterize multimodal genre from a bottom-up perspective using low-level information about expressive resources and layout.

查看原文本刊更多论文

基于语料库的小学科学图的多模态和体裁分析

本文提出了一种数据驱动的小学科学图表语料库多模态体裁分析方法，该语料库包含多模态语篇结构的多层交叉引用注释。目的是识别语料库中的图表类型，并描述它们的多模态特征。为此，从语料库中提取有关图中使用的表达资源及其之间的话语关系的信息，并使用计算机视觉来近似图的视觉外观。本文还提出了一种量化布局空间使用信息的新方法。使用UMAP(一种无监督机器学习算法)处理多模态话语结构的结果描述，以识别表现出相似结构特征的图。通过分析，可以对语料库中的四种图表类型进行识别和表征，这四种类型在将表达资源组合成话语结构时采用了不同的修辞策略。分析还表明，语篇布局在语篇空间的形成中起着重要作用，可以利用语篇结构信息进一步细化语篇空间。总体而言，研究结果表明，计算方法可以从自下而上的角度利用表达资源和布局的底层信息来表征多模态类型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Visual Communication COMMUNICATION-

CiteScore

3.40

自引率

13.30%

发文量

期刊介绍： Visual Communication provides an international forum for the growing body of work in numerous interrelated disciplines. Its broad coverage includes: still and moving images; graphic design and typography; visual phenomena such as fashion, professional vision, posture and interaction; the built and landscaped environment; the role of the visual in relation to language, music, sound and action.