Intelligent Multimodal Analysis Framework for Teacher-Student Interaction

2022 International Conference on Intelligent Education and Intelligent Research (IEIR) Pub Date : 2022-12-18 DOI:10.1109/IEIR56323.2022.10050044

Mengke Wang, Liang Luo, Zengzhao Chen, Qiuyu Zheng, Jiawen Li, Wei Gao

{"title":"Intelligent Multimodal Analysis Framework for Teacher-Student Interaction","authors":"Mengke Wang, Liang Luo, Zengzhao Chen, Qiuyu Zheng, Jiawen Li, Wei Gao","doi":"10.1109/IEIR56323.2022.10050044","DOIUrl":null,"url":null,"abstract":"This paper constructed a multi-modal analysis framework of teacher-student interaction based on intelligent technology. Voiceprint recognition was used to divide the teaching video into slices according to sentences and then used speech recognition, speech emotion analysis, gaze point estimation, and other technologies to recognize and encoded the multimodal behavior of each slice. We analyzed 10 lessons using the event sampling method proposed in the analysis framework in comparison with the classical temporal sampling analysis method and demonstrated the results of multimodal interaction analysis of an instructional video as an example. The results indicated that the event sampling method proposed not only reduces the number of analysis units but also has more complete information about the utterance of each unit, overcoming the incomplete information or information redundancy of analysis units caused by the mechanical segmentation of temporal sampling. The multimodal analysis showed that taking into account both teacher-student verbal and nonverbal interactions can reveal richer and deeper information about classroom teaching and learning. This framework provides an important reference for intelligent multimodal analysis of teacher-student interaction.","PeriodicalId":183709,"journal":{"name":"2022 International Conference on Intelligent Education and Intelligent Research (IEIR)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Intelligent Education and Intelligent Research (IEIR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEIR56323.2022.10050044","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

This paper constructed a multi-modal analysis framework of teacher-student interaction based on intelligent technology. Voiceprint recognition was used to divide the teaching video into slices according to sentences and then used speech recognition, speech emotion analysis, gaze point estimation, and other technologies to recognize and encoded the multimodal behavior of each slice. We analyzed 10 lessons using the event sampling method proposed in the analysis framework in comparison with the classical temporal sampling analysis method and demonstrated the results of multimodal interaction analysis of an instructional video as an example. The results indicated that the event sampling method proposed not only reduces the number of analysis units but also has more complete information about the utterance of each unit, overcoming the incomplete information or information redundancy of analysis units caused by the mechanical segmentation of temporal sampling. The multimodal analysis showed that taking into account both teacher-student verbal and nonverbal interactions can reveal richer and deeper information about classroom teaching and learning. This framework provides an important reference for intelligent multimodal analysis of teacher-student interaction.

查看原文本刊更多论文

师生互动智能多模态分析框架

本文构建了一个基于智能技术的师生互动多模态分析框架。采用声纹识别技术将教学视频按句子划分为多个片段，然后利用语音识别、语音情感分析、注视点估计等技术对每个片段的多模态行为进行识别和编码。采用分析框架中提出的事件抽样方法对10节课进行分析，并与经典的时间抽样分析方法进行对比，并以教学视频的多模态交互分析结果为例进行了验证。结果表明，所提出的事件采样方法不仅减少了分析单元的数量，而且每个单元的话语信息更完整，克服了由于时间采样的机械分割造成的分析单元信息不完整或信息冗余的问题。多模态分析表明，同时考虑师生之间的语言和非语言互动可以揭示更丰富、更深入的课堂教学信息。该框架为师生互动的智能多模态分析提供了重要参考。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 International Conference on Intelligent Education and Intelligent Research (IEIR)

自引率

0.00%

发文量